0710.5636/ms.tex
1: \documentclass[numberedappendix]{emulateapj}
2: %\documentclass[letterpaper]{emulateapj} 
3: %\documentclass[12pt,preprint]{aastex}
4: %\documentclass[manuscript]{aastex}
5: %% preprint2 produces a double-column, single-spaced document:
6: %\documentclass[preprint2]{aastex}
7: %% Sometimes a paper's abstract is too long to fit on the
8: %% title page in preprint2 mode. When that is the case,
9: %% use the longabstract style option.
10: %% \documentclass[preprint2,longabstract]{aastex}
11: 
12: 
13: \usepackage{natbib}
14: \usepackage{verbatim}
15: \renewcommand{\d}{{\rm d}}
16: \newcommand{\beq}{\begin{equation}}
17: \newcommand{\eeq}{\end{equation}}
18: 
19: 
20: %% You can insert a short comment on the title page using the command below.
21: 
22: %\slugcomment{To be submitted, \today}
23: 
24: %% If you wish, you may supply running head information, although
25: %% this information may be modified by the editorial offices.
26: %% The left head contains a list of authors,
27: %% usually a maximum of three (otherwise use et al.).  The right
28: %% head is a modified title of up to roughly 44 characters.
29: %% Running heads will not print in the manuscript style.
30: 
31: \shorttitle{Where is the matter in Abell 2218?}
32: \shortauthors{El\'\i asd\'ottir et al.}
33: 
34: %% This is the end of the preamble.  Indicate the beginning of the
35: %% paper itself with \begin{document}.
36: 
37: \begin{document}
38: \bibliographystyle{apj}
39: %\bibliographystyle{aa}
40: 
41: 
42: 
43: \title{Where is the matter in the Merging Cluster Abell 2218?
44: \thanks{Based on observations made with the Hubble Telescope's Advance Camera for Surveys, Chandra X-ray Observatory, the William Herschel Telescope and LRIS on the KECK telescope.} 
45: }
46: 
47: %% Use \author, \affil, and the \and command to format
48: %% author and affiliation information.
49: %% Note that \email has replaced the old \authoremail command
50: %% from AASTeX v4.0. You can use \email to mark an email address
51: %% anywhere in the paper, not just in the front matter.
52: %% As in the title, use \\ to force line breaks.
53: 
54: %\author{\'Ard\'\i s El\'\i asd\'ottir\altaffilmark{2} and friends :) \altaffilmark{3}}
55: 
56: \author{\'Ard\'\i s El\'\i asd\'ottir\altaffilmark{2}, Marceau Limousin\altaffilmark{2}, Johan Richard\altaffilmark{3}, Jens Hjorth\altaffilmark{2}, Jean-Paul Kneib\altaffilmark{4},  Priya Natarajan\altaffilmark{5,6}, Kristian Pedersen\altaffilmark{2}, Eric Jullo\altaffilmark{7}, Danuta Paraficz\altaffilmark{2,8}}
57: 
58: 
59: \email{ardis@dark-cosmology.dk}
60: 
61: %\and
62: 
63: %\author{bla\altaffilmark{5}}
64: %\affil{bla}
65: 
66: %% Notice that each of these authors has alternate affiliations, which
67: %% are identified by the \altaffilmark after each name.  Specify alternate
68: %% affiliation information with \altaffiltext, with one command per each
69: %% affiliation.
70: 
71: 
72: \altaffiltext{2}{Dark Cosmology Centre, Niels Bohr Institute, University of
73: Copenhagen, Juliane Maries Vej 30, DK-2100 Copenhagen \O, Denmark; 
74: (ardis, marceau, danutas, kp, jens)@dark-cosmology.dk.}
75: \altaffiltext{3}{Department of Astronomy, California Institute of  Technology, 105-24, Pasadena, CA91125; 5(johan,kneib)@astro.caltech.edu}
76: \altaffiltext{4}{OAMP, Laboratoire d'Astrophysique de Marseille UMR 6110 Traverse du Siphon 13012 Marseille, France; Jean-Paul.Kneib@oamp.fr}
77: \altaffiltext{5}{Astronomy Department, Yale University, P.O. Box 208101, New Haven, CT 06520-8101, USA, priya@astro.yale.edu}
78: \altaffiltext{6}{Department of Physics, Yale University, P.O. Box 208101, New Haven, CT 06520-8101; USA}
79: \altaffiltext{7}{European Southern Observatory, Alonso de Cordova 3107, Vitacura, Chile; ejullo@eso.org }
80: \altaffiltext{8}{Nordic Optical Telescope (NOT), Apartado 474, 38700 Santa Cruz de La Palma, Canary Islands, Spain}
81: 
82: 
83: \begin{abstract}
84: We present a parametric strong lensing model of the cluster Abell 2218 based on HST ACS data.  We constrain the lens model using 8 previously known multiply imaged systems, of which 7 have spectroscopically confirmed redshifts.   In addition, we propose five candidate multiply imaged systems and estimate their redshifts using our mass model.  
85: The model parameters are optimized in the source plane by a bayesian Monte Carlo Markov Chain as implemented in the the publicly available software Lenstool.  We find rms$_s=0\farcs12$ for the scatter  of the sources in the source plane, which translates into rms$_i=1\farcs49$ between the predicted and measured image positions in the image plane.
86: We find that the projected mass distribution of Abell 2218 is bimodal, which is supported by an analysis of the light distribution.  We also find evidence for two structures in velocity space, separated by $\sim 1000$~km~$s^{-1}$, corresponding to the two large scale dark matter clumps.  We find that the lensing constraints can not be well reproduced using only dark matter halos associated with the cluster galaxies, but that the dark matter is required to be smoothly distributed in large scale halos.   At $100\arcsec$ ($291$~kpc) the enclosed projected mass is $3.8\times10^{14}$~M$_\sun$.  At that radius, the large scale halos contribute $\sim85\%$ of the mass, the brightest central galaxy (BCG) contributes  $\sim9\%$ while the remaining $\sim6\%$ come from the other cluster galaxies.  We find that the model is not very sensitive to the fainter (and therefore by assumption less massive) galaxy sized halos, unless they locally perturb a given multiply imaged system.  Therefore, dark galaxy sized substructure can be reliably constrained only if it locally perturbs one of the systems.  The massive BCG and galaxies which locally perturb a multiply imaged system are reliably detected in the mass reconstruction.  In an appendix we give a self-contained description of the parametric profile we use, the dual pseudo isothermal elliptical mass distribution (dPIE).  This profile is a two component pseudo isothermal mass distribution (PIEMD) with both a core radii and a scale radii.
87: \end{abstract}
88: 
89: 
90: \keywords{ dark matter --- galaxies: clusters: individual (Abell 2218) --- gravitational lensing}
91: 
92: \section{Introduction}
93: Dark matter dominates over baryonic matter in the universe, but its nature is not known.  The study of the inner parts of dark matter halos can give insight into the nature of the dark matter, as the steepness of the profile is correlated with the interaction between the dark matter itself and with the baryonic matter.  According to $\Lambda$CDM simulations, the mass distribution of galaxy clusters should be dominated by their dark matter halos.  Gravitational lensing, which is sensitive to the total matter distribution, visible or dark, is ideal for studying the mass distribution of clusters.  Strong lensing features, consisting of multiply imaged and strongly distorted background sources, provide constraints on the inner parts of the cluster, while weak lensing features, consisting of weakly distorted singly imaged background sources, provide constraints on the outer slope of the surface mass profile \citep[see e.g.,][]{smail1995a,smail1995b,kneib1996, smail1997,abdelsalam1998,natarajan2002b, bradac2006,gavazzi2007,limousin2007}.
94: 
95: Lensing can therefore provide unique information about the total mass distribution of clusters, from the inner to the outer parts.  In addition, lensing can in principle be used to deduce various cosmological parameters (e.g. $H_0$, $\Omega_\Lambda$, $\Omega_m$).   This has already been extensively applied to lensing on galaxy scales \citep[see e.g.,][]{schechter1997,koopmans2003} and to a smaller degree for lensing on cluster scales \citep[see e.g.,][]{soucail2004, meneghetti2005}, where the accuracy of the mass map is a limiting factor.  The accuracy of the mass map is strongly dependent on the number of multiply imaged systems used to constrain it. Therefore, to construct a robust model of the dark matter distribution, accurate enough for cosmography and for using the cluster as a gravitational telescope, it is important to include as many spectroscopically confirmed multiply imaged systems as possible \citep[see e.g.,][]{ellis2001}.
96: 
97: Abell 2218 is one of the richest clusters in the Abell galaxy cluster catalog \citep{abell1958, abell1989} and has been successfully exploited as a gravitational lens.    
98: A parametric lens model has previously been constructed by \citet{kneib1995,kneib1996} (using 1 and 2 spectroscopically confirmed systems respectively) and by \citet{natarajan2002b, natarajan2007} (using 4 and 5 spectroscopically confirmed systems) building on the model of \citet{kneib1996} and including weak lensing constraints from HST WFPC2 data.  A non-parametric model was constructed by \citet{abdelsalam1998} using three spectroscopically confirmed multiply imaged systems.  In all of these models, a bimodal mass distribution was required to explain the image configurations (i.e. the models include two large scale dark matter clumps), but the number of constraints were not sufficient to accurately constrain the second large scale dark matter clump.  Abell 2218 has also been used as a gravitational telescope, with  \citet{ellis2001} discovering a source at $z=5.6$ and \citet{kneib2004} discovering an even more distant source at $z\sim6.7$, later confirmed by \citet{egami2005} using Spitzer data.  \citet{soucail2004} estimated cosmological parameters based on a lensing model of Abell 2218 using 4 multiply imaged systems.  The latest published lens model of Abell 2218 is by \citet{smith2005}, who incorporated four multiply imaged systems and weak lensing constraints, using the WFPC2 data.  Although the number of constraints has increased from the initial models, all previous models have assumed that the location of the second dark matter clump coincided with the brightest galaxy in the South East, due to a lack of constraints in its vicinity.
99: 
100: The motivation for revisiting the modeling of Abell 2218 comes from the new ACS data which have not been used before for modeling this cluster and are superior in both resolution, sensitivity and field of view to the previous WFPC2 data set.  These new high quality data allow us to identify several subcomponents in previously known multiple images, thus adding more constraints, and in one case, two more multiple images of a known system.
101: In addition, we have measured a spectroscopic redshift for an arc around the second dark matter clump, which our model predicts to be singly imaged.
102: We also have several new candidate multiply imaged systems, which we add as constraints and estimate their redshift with the model.  In total we have 7 multiply imaged systems with measured spectroscopic redshift and 6 multiply imaged systems without spectroscopic data (of which 5 are new candidate systems).  Finally, the lensing code Lenstool, has undergone significant improvements from previous models, with the incorporation of a Monte Carlo Markov Chain (MCMC), which enables us to not only find the best model in the lowest $\chi^2$ sense, but the most likely model as measured by its Evidence \citep{jullo2007}.  The MCMC also allows for a reliable estimate of the uncertainties in the derived model parameters. 
103: 
104: The paper is organized as follows:  In Section~\ref{sec:data}, we give an overview of the data used in this paper.  We compile a list of all currently known and new multiply imaged systems in Abell 2218 and discuss the reliability of the redshift estimate of each one in Section~\ref{sec:systems}.  The methodology of the strong lensing modeling is described in Section~\ref{sec:modeling}.    In Section~\ref{sec:analysis} we present the results of our lensing analysis, and compare them to previous models.  In Section~\ref{sec:degeneracy} we discuss degeneracies in the modeling.  In Section~\ref{sec:reliability} we address how reliable our model is, and discuss the smoothness of the dark matter distribution.
105:  In Section~\ref{sec:bimodal} we interpret the bimodality of our model, along with X-ray measurements and an analysis of the distribution of cluster members in velocity space.  We summarize our main conclusions in Section~\ref{sec:conclusions}.   Throughout the paper, we adopt a flat $\Lambda$-dominated Universe with $\Omega_\Lambda=0.7$, $\Omega_m=0.3$ and $H_{0}=70\ \mathrm{km\,s}^{-1}\,\mathrm{Mpc}^{-1}$.  Following \citet{smith2005} we place the cluster at $z=0.171$.  At this redshift $1\arcsec$ corresponds to $2.91$~kpc in the given cosmology.  
106: 
107: \section{Data}
108: \label{sec:data}
109: We use data from several different sources for our lens modeling and analysis.  The basis of our modeling is Advanced Camera for Surveys (ACS) data from the Hubble Space Telescope, which allows us to identify and accurately locate multiply imaged systems.  Our cluster member catalog is also selected using the ACS data and the magnitude of each cluster member is given in the Ingrid K-band of the William Herschel Telescope.  In addition, we have obtained a spectroscopic redshift of a system using the Keck telescope and archival Chandra X-ray Observatory data have been used to produce an X-ray map of Abell 2218. 
110: 
111: \subsection{The galaxy catalog}
112: \label{sec:galaxycat}
113: Cluster members were selected based on the ACS data using the characteristic cluster red-sequences (V-Z) and (I-Z) in two color-magnitude diagrams and selecting objects lying within the red-sequences of 
114: \begin{figure}
115: \includegraphics[angle=270, scale=0.4]{f1a.eps}\\
116: \includegraphics[angle=270, scale=0.4]{f1b.eps}
117: \caption{Color-magnitude diagrams, (V-Z) and (I-Z), used for selecting the cluster members (see Section~\ref{sec:galaxycat}).  The red quadrilaterals define the red sequence and galaxies lying within are considered cluster members.   \label{fig:redsequence}}
118: \end{figure}
119: \begin{figure*}
120: \epsscale{1.0}
121: \plotone{f2.eps}
122: \caption{A color image of Abell 2218 based on ACS data (F775W, F625W and F475W filters in the red, green and blue channel respectively).  The cluster galaxies are marked in yellow (modeled using scaling relations) or blue (individually fitted).  The multiple images are labeled in green for spectroscopically confirmed systems and red for candidate systems.  The arc for which we have obtained spectroscopic redshift, S8, is labelled in cyan.  Also shown are the critical lines corresponding to $z=0.702$ (cyan), $z=2.515$ (red) and $z=6.7$ (green).  \label{fig:overview}}
123: \end{figure*}
124: both diagrams (see Figure~\ref{fig:redsequence}).  The ACS data were used to select the galaxy members due to their better photometric accuracy compared to the K-band data.  Nevertheless, the catalog is given in the K band since it is more representative of the galaxy population in elliptical galaxies.  This selection gave  $203$ cluster galaxies down to $K=19.6$.  However, we rejected six galaxies from the catalog leaving $197$ cluster members, of which four have measured redshifts showing them to be background galaxies.
125: 
126: \subsection{X-ray data}
127: The X-ray data were taken with the Advanced CCD Imaging Spectrometer (ACIS) on NASA's Chandra X-ray Observatory.  The ACIS is an array of 10 CCD's, capable of simultaneously imaging and measuring the energy of the incoming X-rays.  We combined three images taken at different epochs to construct our final X-ray map of Abell 2218.  Two of the images were taken in October 1999 and one in August 2001, with 5960~s, 11560~s and 49240~s exposure times respectively.  The data reduction was performed using standard pipelines from the Chandra Interactive Analysis of Observations (CIAO) software.  The X-ray data will be used for comparison with the lensing data in Section~\ref{sec:bimodal}.
128: 
129: 
130: 
131: \section{Multiply imaged systems}
132: \label{sec:systems}
133: Multiply imaged systems are the basis of our analysis, and are particularly useful as constraints if their redshifts are accurately determined.  In this section we list the multiple images, with or without spectroscopically confirmed redshifts, used as constraints in our lens modeling.  We propose several new candidate systems found in the ACS data and present a spectroscopic redshift measurement for an arc which lies in the region of the second dark matter clump.  All systems are listed in Table \ref{tab:systems} and shown on Figure~\ref{fig:overview}.
134: 
135: \input{tab1}
136: 
137: 
138: 
139: \subsection{Previously known systems}
140: \label{sec:prev_known}
141: The first system, system S1, is at a redshift of $z=0.702$ as measured by \citet{leborgne1992} (system 359 in their catalog).  Thanks to the new ACS observations, it has seven identified images (previously it had five identified images \citep{kneib1996}), of which two pairs are located close to individual galaxies $\#1028$ and $\#993$ in the galaxy catalog) which affect the lensing signal.
142: 
143: System S2 is a star-forming galaxy at a redshift of $z=2.515$ \citep{ebbels1996}.  It consists of a fold and a counter image (listed as images $\#384$ and $\#468$ in \citet{ebbels1996}).  We separate the fold image into two images and identify four components, giving us four sets of triple images from this system (referred to as S2.1, S2.2, S2.3 and S2.4). 
144: 
145: System S3 is a faint star-forming doubly imaged system at a high redshift $z=5.576$ discovered by \citet{ellis2001}.  
146: 
147: System S4 is a triply imaged submillimeter source at $z=2.515$ reported by \citet{kneib2004b}.  In each image they identify three components labelled $\alpha, \gamma, \beta$.  We identify all three images in the ACS data.  However, we only detect the $\alpha$ and $\beta$ components.  We include both as constraints on the model (referring to them as S4.1 and S4.2 respectively).  Image S4.1 seems to consist of further three subcomponents.  As we do not clearly distinguish them in all three images, we do not add them as separate constraints.
148: 
149: The fifth system, S5, is a triply imaged system, believed to be the triply imaged outskirts of an associated singly imaged galaxy.  It is believed that the galaxy partially crosses the caustic, with the central part being singly imaged and the outskirts being triply imaged.  \citet{kneib2004} measured a redshift of $z=2.515$ for the galaxy (labelled as $\#273$ in their notation) but an independent spectroscopic redshift has not been determined for the faint triply imaged component.  Although it is likely (and consistent with the models) that they belong to the same background source, it is not certain, making the redshift determination for S5 less certain.
150: 
151: The sixth system, S6, consists of an arc in two parts at $z=1.034$ \citep[][called 289 in their notation]{kneib1996, swinbank2003}.  
152: 
153: The seventh system, S7, first reported by \citet{kneib1996}, consists of a merging image and a counter image (also known as \#444 and H6 respectively).  \citet{ebbels1998} measured a spectroscopic redshift at $z=1.03$ for the merging image
154: %, which is consistent with their photometric redshift of $z_{phot}=1.1\pm0.1$, 
155: but this is listed as a tentative identification (as the only identified feature in the spectra was the Fe II doublet at $\lambda\lambda2587$, $2600$).  
156: 
157: The final previously known multiply imaged system, C1, is a high redshift ($z\sim7$) triply imaged system \citep{kneib2004}. \citet{egami2005} further constrained the redshift of the system to lie in the range $6.6-6.8$.  For the purposes of the modeling we will assign $z=6.7$ to this system, noting that changing the redshift within the range $6.6-6.8$ does not affect the model.
158: 
159: 
160: \subsection{New candidate systems}
161: We propose 5 new candidate systems which we have identified in the new ACS data.  We do not have spectroscopic redshift determination for these systems, but estimate their redshifts using the model prediction.  
162: 
163: \begin{figure}
164: \epsscale{.35}
165: \plotone{f3a.eps}
166: \plotone{f3b.eps}
167: \plotone{f3c.eps}
168: \caption{The new candidate system, C2, identified in the ACS images.  The three panels show the three images of C2, marked in Figure~\ref{fig:overview}.  The colors are the same as in Figure~\ref{fig:overview}.  The system is characterized by a central 'yellow' spot, flanked by blue spots.  \label{fig:newsys}}
169: \end{figure}
170: The first candidate is a triply imaged system, which we call C2.  In Figure~\ref{fig:newsys} we show color stamps of the three images, and the locations are given in Table~\ref{tab:systems}.  The morphology of the three images, suggests that it is of the same background source, with a 'yellow' spot in the middle, flanked by a fainter blue on the sides.  
171: 
172: \begin{figure}
173: \epsscale{.35}
174: \plotone{f4a.eps}
175: \plotone{f4b.eps}
176: \plotone{f4c.eps}
177: \caption{The new candidate arcs identified in the ACS images.  {\it Left panel:}  The candidate arc C3 (green circles).  {\it Middle panel:}   The candidate systems C4 (green circles) and C5 (yellow circle).  The red circles are the locations of two of the images of system C1 (at $z=6.7$), while the blue circle shows the location of images belonging to system S4 (components a and b).   {\it Right panel:}  The candidate system C6 (green circles).  The high redshift system S3 ($z=5.6$) is marked with red circles.  \label{fig:newarcs}}
178: \end{figure}
179: We also identify a new faint arc in the north-west part of the cluster, which we call C3 (see Figure~\ref{fig:newarcs}).  
180: In the vicinity of system 6, we further identify two arclike images, which we label C4 and C5.  The first, C4, is a very faint blue double arc, while in the second, C5, two bright spots of a merging arc can be seen.    
181: Finally, a pair of blue extended images, C6, is found in the vicinity of system 3.
182: These candidate systems  are shown in a color image in Figure~\ref{fig:newarcs} and are listed in Table~\ref{tab:systems} along with their redshift estimates (see Section~\ref{sec:redshifts} for details).  The positions of these images are used to constrain the final model, but their redshift is kept free in the modeling. 
183: 
184: \subsection{Spectroscopic redshift for an arc}
185: We have used the Low Resolution Imager and Spectrograph (LRIS, \citet{oke1995}) at Keck to measure the spectroscopic redshift of an arc, S8, to the south-east of the second dark matter clump (see Figure~\ref{fig:overview}).  Two exposures of 1800 seconds were obtained on June 29th 2007 with a $175\arcsec\times1\arcsec$
186:  long slit placed along the
187: brighter components of this arc (Figure~\ref{fig:a2218_S8_slit}).
188: \begin{figure}
189: \epsscale{0.8}
190: \plotone{f5.eps}
191: \caption{The alignment of the slit for the spectra of S8.  The box shows the area used to extract the spectrum (see Figure~\ref{fig:S8_spectrum}).\label{fig:a2218_S8_slit}}
192: \end{figure}
193: A $600$~l~mm$^{-1}$ grism blazed at $4000$~\AA~and a $400$~l~mm$^{-1}$
194: grating blazed at $8500$~\AA~were used in the blue and red
195: channels of the instrument, both lightpaths being separated by a dichroic at $5600$~\AA. The corresponding dispersions are $0.6$/$1.85$~\AA~and
196: resolutions  are $4.0$/$6.5$~\AA~in the blue/red channel, respectively.
197: 
198: \begin{figure}
199: \includegraphics[angle=270, scale=0.3]{f6.eps}
200: \caption{The spectrum of system S8.  The plot shows lines for the arc at $z=2.74$ (blue) and the nearby galaxy (\#617) at $z=0.177$ (red).  The spectrum shows Ly$\alpha$ absorption and emission and a number of absorption lines in the UV which are marked in the figure.  At this redshift, $z=2.74$, the model predicts S8 to be singly imaged.  \label{fig:S8_spectrum}}
201: \end{figure}
202: The resulting spectrum (see Figure~\ref{fig:S8_spectrum}) is dominated by the light coming
203: from the very bright neighbor cluster member at $z=0.177$, yet shows
204: Lyman-$\alpha$ in emission and UV absorption features of SiII, CI, CII and
205: CIV from the arc, giving a redshift $z=2.74$. These features are not compatible with any line at the redshift of the lens galaxy, giving a
206: redshift class of 2 for this spectrum, with $75\%$ probability of being correct, following the classification of \citet{lefevre1995}.  Our model is not consistent with this object being multiply imaged, and when including it as a constraint as multiply imaged (using spots in the arclike structure as independent images), the model predicts additional counter images which are not seen.  We therefore conclude that although this arc is lensed, it is not multiply imaged, and include it as a singly imaged system in the model constraints.
207: 
208: \section{Modeling}
209: \label{sec:modeling}
210: Following e.g., \citet{kneib1996, smith2005, limousin2007} we do a parametric mass reconstruction of Abell 2218.  The multiply imaged systems form the basis of our analysis, with each $n$-tuply imaged system giving $2(n-1)$ constraints if the redshift is known.  
211: As the number of constraints needs to be greater than the number of free parameters in our fit, we are limited by the known multiply imaged systems.  The MCMC sampling and optimization is done using the Lenstool software \citep{kneib1993, jullo2007}.   The optimization is performed in the source plane, as it is faster and has been found to be equivalent to optimizing in the image plane \citep{halkola2006, jullo2007}.  The new version of Lenstool \citep{jullo2007}, gives a distribution of values for each of the parameters, thus making it possible to estimate the uncertainty of the parameters.  In addition, it returns the Evidence of a model, which is a measure of how likely a model is, penalizing unnecessarily complicated models.  Thus, a model with a lower $\chi^2$ but more free parameters, may have a lower Evidence, suggesting that we should choose the simpler model.
212: For each image we find its rms (root-mean-square) value for its position in the source plane, rms$_s$, and image plane, rms$_i$, given by
213: \begin{eqnarray}
214: \mathrm{rms}_s&=&\sqrt{\frac{1}{n}\sum_{j=1}^{n}\left(X^j_s-<X^j_s>\right)^2}\\
215: \mathrm{rms}_i&=&\sqrt{\frac{1}{n}\sum_{j=1}^{n}\left(X^j_{\mathrm{obs}}-X^j\right)^2}
216: \end{eqnarray}
217: where $n$ is the number of images for the system, $X_s$ is the position in the source plane, $X$ the position in the image plane and $X_{\mathrm{obs}}$ the observed position in the image plane.  The overall rms is defined by summing and averaging over all the images for all the systems.
218: A detailed overview of the Lenstool software, including definitions of $\chi^2$ and the Evidence, can be found in \citet{jullo2007}.
219: 
220: \subsection{Model components}
221: \label{sec:dm_clumps}
222: \label{sec:scaling}
223: We refer to the individual components of the model as 'clumps', where each clump is denoted by its position, ellipticity, position angle and the parameters of the profile used to describe it.  The parametric profile we use is the dual Pseudo Isothermal Elliptical mass distribution (dPIE, derived from \citet{kassiola}) and its form and main properties are given in Appendix~\ref{app:piemd}.  The dPIE profile is defined in Lenstool by three parameters,  the core radius $a$, the scale radius $s$ and a fiducial velocity dispersion $\sigma_{\mathrm{dPIE}}$.  For $a<r<s$, the profile behaves as $\rho\sim r^{-2}$, while it falls like $r^{-4}$ in the outer regions.  For a vanishing core radius the scale radius corresponds to the radius containing half the 3D mass.
224: 
225: The clumps are called 'large scale clumps' if their mass within the outermost multiply imaged constraint is greater than $20\%$ of the total mass.  Smaller clumps are referred to as 'galaxy scale clumps', and are in general associated with the cluster members.  Large scale dark matter clumps are optimized independently.  The redshift is fixed at the location of the cluster, but the central position (R.A., Dec.), the ellipticity and the position angle (P.A.) are allowed to vary.  
226: 
227: We associate a galaxy scaled clump with each of the cluster galaxies, fixing the central location, ellipticity and position angle of the mass distribution to that of the light distribution.   A few of the cluster galaxies are fitted individually (optimizing their $a$, $s$ and $\sigma_{\mathrm{dPIE}}$, see Table \ref{tab:model} for an overview), but due to a lack of constraints, most of the galaxies are optimized in a combined way.  
228: The parameters ($a, s, \sigma_{\mathrm{dPIE}}$) are optimized together using the following scaling relations for the luminosity $L$:
229: \begin{eqnarray}
230: a=a^\star \left(\frac{L}{L^\star}\right)^{1/2}\\ s=s^\star \left(\frac{L}{L^\star}\right)^{1/2} \\  \sigma_{\mathrm{dPIE}}=\sigma_{\mathrm{dPIE}}^\star \left(\frac{L}{L^\star}\right)^{1/4}.
231: \label{eq:scaling}
232: \end{eqnarray} 
233: For a discussion of these scaling relations we refer to \citet{limousin2007} and \citet{halkola2007}.  For a given $L^\star$ luminosity, we fix $a^\star=0.25$~kpc, while $\sigma_{\mathrm{dPIE}}^\star$ and $s^\star$ are allowed to vary.   We note that fixing the core radius to be small, makes the profile approximately equivalent to the profile used by \citet{brainerd1996} to describe galaxies (see also Appendix~\ref{app:piemd}).  Following \citet{depropris1999} we take the apparent magnitude of an $L^\star$ in the K-band to be $K^\star=15$ at $z=0.171$ (redshift of Abell 2218).
234: \input{tab2}
235: 
236: \section{The Strong Lensing Mass Distribution} 
237: \label{sec:analysis}
238: \input{tab3}
239: In this section we present our strong lensing model (optimized in the source plane), discuss its implications and compare it to previous results.  All reported error bars correspond to $68$\% confidence levels.  For our best model we find rms$_s=0\farcs12$, which gives rms$_i=1\farcs49$ (see Table~ref{tab:chires}).\footnote{A parameter file containing all the following information, and which can be used with the Lenstool software package, along with a FITS file of the mass map generated from the best-fit model are available at \url{http://archive.dark-cosmology.dk/}.  These can be used to find relevant critical lines for using Abell 2218 as a gravitational telescope and to predict/confirm candidate lensed systems. }
240: 
241: \subsection{A bimodal mass distribution}
242: \label{sec:mass_dist}
243: \begin{figure*}
244: \epsscale{.45}
245: \plotone{f7a.eps}
246: \plotone{f7b.eps}
247: \caption{The mass density map and its contours (black).  The maps are $300\arcsec\times300\arcsec$ and centered on the BCG with North being up and East being left.  The red contours show the light distribution (left panel) and the X-ray distribution (right panel).   The overall shape of the light-contours and the mass map contours are similar, with the light being slightly more 'pear'-shaped, as it broadens in the SE direction.  The overall shape of the light-contours and the mass map contours also agree, although the X-rays become more spherical for  in the outer regions.  See section~\ref{sec:bimodal}.  \label{fig:massmap}}
248: \end{figure*}
249: We show the mass density model in Figure~\ref{fig:massmap} and critical lines predicted by the model at $z=0.702, 2.515, 6.7$ in Figure~\ref{fig:overview}.  The total projected mass as a function of radius, centered on the BCG, is shown in Figure~\ref{fig:m_1d}.  
250: \begin{figure}
251: \epsscale{1.}
252: \plotone{f8.eps}
253: \caption{Total projected mass as a function of aperture radius (centered on the BCG) for different model components.  The two large scale clumps, DM1 and DM2, contribute a similar amount to the mass, but the halo associated with the BCG dominates in the inner regions, and the combined halo of DM1 and the BCG dominates the mass in the region where most of the constraints lie (within $80\arcsec$).  The galaxies (excluding the BCG) only contribute a small amount to the overall mass, of the order of $5-6$\%. \label{fig:m_1d}}
254: \end{figure}
255: We find that the mass distribution is strongly preferred to be bimodal, even for a simple model where all the galaxies are modelled based on the scaling relations, but the dark matter clumps are optimized independently using only systems 1 and 2 as constraints (28 constraints in all).  The second dark matter clump is located to the south east of the central clump for this simple model, near galaxy \#617, which is in agreement with previous models of Abell 2218.  The bimodal mass distribution is also strongly preferred when we add more constraints.  We will refer to the two large scale dark matter clumps as DM1 (for the one associated with the BCG) and DM2 (for the one associated with galaxy \#617).  We also constructed a three clump model, but its bayesian Evidence was worse, leading us to reject it as our best model.
256: 
257: In previous models, the location of the second clump has been fixed at the center of the brightest galaxy in the south east corner (\#617), but we find that when the location is allowed to vary it is offset from this galaxy by $\sim 35\arcsec$.   
258: We also find that DM2 has high ellipticity and is comparable in mass to DM1, although significantly less massive than the DM1 and BCG halos combined (see Figure~\ref{fig:m_1d}).    We note that the light distribution is similar to the derived matter distribution (see Figure~\ref{fig:massmap}), supporting the finding that there is a significant matter component in the vicinity of DM2  (see also section~\ref{sec:bimodal}).
259: 
260: \subsection{Dark matter halos of galaxies}
261: \label{sec:dm_galaxies}
262: The potential contribution of the individual dark matter halos associated with cluster galaxies
263: was first proposed by \citet{natarajan1997}. Typically it is found that contribution of
264: dark matter halos associated with the bright, early-type cluster galaxies in the inner regions are required to explain the positions and geometries of multiply lensed images
265: in the strong lensing regime \citep{meneghetti2007}.
266: 
267: We fit three galaxies individually (for the parameters of these galaxies see Table~\ref{tab:model}), two are the brightest galaxies near the centers of the large scale dark matter clumps ($\#1193$ - the BCG - and $\# 617$), while the third is an elliptical galaxy important to the lensing of system S1.  The halo associated with the BCG is very massive, and in particular in the inner $10\arcsec$ it dominates the mass distribution (see Figure~\ref{fig:m_1d}).  Even at the outermost Einstein radii of a multiply imaged system ($80\arcsec$) it still contributes around $10\%$ of the mass of the cluster. 
268: 
269: System S1 has seven images, and two of the pairs are strongly affected by nearby galaxies (an elliptical $\# 1028$ and a spiral $\# 993$).  Keeping the parameters of the spiral free did not significantly affect the model, while adding the elliptical did.  Although both galaxies would at first glance appear equally important to be fitted individually due to their strong effects on system S1, \begin{figure}
270: \epsscale{1.}
271: \plotone{f9.eps}
272: \caption{The Faber-Jackson relation for the K-band using $\sigma_0$ measurements from \citet{ziegler2001}.  Also plotted is the central velocity dispersion of the BCG ($\# 1193$) as obtained by \citet{jorgensen1999}.   The individually marked galaxies are discussed in section~\ref{sec:scaling} and \ref{sec:dm_galaxies}. \label{fig:fab_jack}}
273: \end{figure}
274: Figure~\ref{fig:fab_jack} provides an important insight to why only $\# 1028$ needs to be included individually.  This is because the scaling relations (see \S~\ref{sec:scaling}) assume that the galaxies follow a Faber-Jackson relation (Equation~\ref{eq:scaling}) without any scatter.  As can be seen from Figure~\ref{fig:fab_jack} and  discussed in \citet{ziegler2001} the galaxies in Abell 2218 show a significant scatter and therefore the above scaling relations can give very unrealistic values for individual galaxies which lie far from the mean behavior.  The elliptical $\# 1028$ is one such galaxy, with a measured central velocity dispersion which is greater than the central velocity dispersion of the BCG as measured by \citet{jorgensen1999}, while $\# 993$, although a spiral galaxy, is more consistent with the mean.
275: 
276: For the other cluster galaxies, they are included in the model via the scaling relations given in Section~\ref{sec:scaling}.  For $L^{\star}$ corresponding to $K=15$ at $z=0.171$ we find $\sigma_{\mathrm{dPIE}}^\star=185^{+10}_{-11}$~km~s$^{-1}$ and $s^\star=2\farcs9^{+0.5}_{-0.3}$, but we note that there is some degeneracy between the two values (see discussion in section~\ref{sec:degeneracy}).
277: 
278: 
279: 
280: \subsection{Comparison with measured velocity dispersions}
281: We use the velocity dispersion measurements, $\sigma_0$, of \citet{ziegler2001}  and \citet{jorgensen1999} for Abell 2218 to compare with the results from the lens model.   \citet{ziegler2001} have a total of 48 galaxies in their sample, of which nearly half fall within the ACS image and are included in our galaxy catalogue, while the  \citet{jorgensen1999}  data contains the BCG and seven other cluster members in our galaxy catalogue.  Using the velocity dispersion measurements for Abell 2218 cluster members from \citet{ziegler2001} we plot the Faber-Jackson relationship for the K-band data in Figure~\ref{fig:fab_jack}.  We then calculate the mean and the standard deviation of galaxies with K-band magnitudes in the range from $14.8$ to $15.2$, excluding the individually fitted galaxy (galaxy $\# 1028$ in our catalog, or $\# 1662$ in the notation used by \citet{ziegler2001}).  This gives $\sigma_{0,\mathrm ziegler}^\star\approx195\pm35$~km~s$^{-1}$ for a typical K=15 galaxy.
282: 
283: \begin{figure}
284: \epsscale{1.}
285: \plotone{f10.eps}
286: \caption{The velocity dispersion of the dPIE, $\sigma_{\mathrm{dPIE}}$ vs. $\sigma_0$ measurements from \citet{ziegler2001} and \citet{jorgensen1999}.  The line is not a fit, but shows the relationship $\sigma_0=0.85\sigma_{\mathrm{dPIE}}$ found in Section~\ref{sec:app_veldisp}. \label{fig:compare_vel_disp}}
287: \end{figure}
288: In Figure~\ref{fig:compare_vel_disp} we plot $\sigma_0$ from \citet{ziegler2001} and \citet{jorgensen1999} vs. the fiducial velocity dispersion, $\sigma_{\mathrm{dPIE}}$ for the BCG, the two individually fitted galaxies and $L^{*}$.  
289:  Although the values from direct velocity dispersion measurements can not be directly related to the values obtained from the dPIE profile (as the measured values are calculated based on an isothermal profile and are not aperture corrected) we find that they are consistent with $\sigma_{\mathrm{dPIE}}$ being related to the measured velocity dispersion by $\sigma_0\approx0.85\sigma_{\mathrm{dPIE}}$ as found in Section~\ref{sec:app_veldisp}.  
290: 
291: 
292: 
293: 
294: \subsection{Redshift estimates of the new candidate systems}
295: \label{sec:redshifts}
296: We estimated the redshifts of the new candidate systems using the model predictions.  The three component candidate system C2 is found to have $z=2.6\pm0.1$.  For the merging arcs we find $z=2.8\pm0.6$ for C3, $z=2.2\pm0.2$ for C4,  $z=2.6\pm0.3$ for C6 while C5 is poorly constrained with $z=2.3\pm0.8$. The estimated redshifts are reported in Table~\ref{tab:systems}.  These redshifts are consistent with a preliminary photometric redshift analysis done using the Hyperz photometric redshift code \citep{bolzonella2000}.
297: 
298: \subsection{A strongly lensed galaxy group at z=2.5}
299: Three multiply imaged systems, S2, S4 and S5, all have the same redshift of $z=2.515$.  To check whether these three systems belong to a background galaxy group, we lens them back to their source plane (see Figure~\ref{fig:2515}).  
300: \begin{figure}
301: \epsscale{1.0}
302: \plotone{f11.eps}
303: \caption{A plot of the source plane at $z=2.515$ showing systems S2, S4 and S5 and the caustic lines.  Their maximum separation is around $130$~kpc, consistent with them belonging to the same group of galaxies.  Also shown are the candidate systems C2, C3, C4, C5 and C6 when their redshift is assumed to be $z=2.515$.  Under that assumption, their locations are consistent with them also belonging to this group of galaxies.\label{fig:2515}  }
304: \end{figure}
305: 
306: We find that S4 is in the middle with S5 and S2 at a separation of $5\farcs4$ and $10\farcs3$ respectively, with the maximum separation being $15\farcs7$, corresponding to $127$~kpc in the source plane ($1\arcsec = 8.06$ kpc at $z=2.515$), suggesting that the three systems belong to the same background group of galaxies.  This is in agreement with the findings of \citet{kneib2004b} who found the maximum separation of the systems in the source plane to be $130$~kpc.  It may be of interest to do a dedicated search for more systems at $z=2.515$, either multiply imaged or singly imaged, to further study this high redshift group of galaxies, and we note that all the candidate systems have a predicted $z$ consistent with $z=2.515$.  If we assume that these candidate systems have $z=2.515$, their location in the source plane is consistent with them belonging to this same group (see Figure~\ref{fig:2515}).
307: 
308: 
309: \subsection{Comparison with previous results and weak lensing}
310: Although the overall results of our model are in agreement with previous models of Abell 2218, they have found the second clump to be less massive \citep[see e.g.,][]{kneib1996, abdelsalam1998, smith2005,natarajan2007}.  There are several possible reasons for the discrepancy, with the first one being the new constraints we use in this model.  We have therefore redone the modeling using only the previously known spectroscopically confirmed systems, but we still find that DM2 is massive and with a large core.  It is also possible, that the previous models constructed using the older version of the lenstool package which did not involve MCMC sampling, have been caught in local minima.  Indeed, we do find when forcing DM2 to be smaller, a comparable $\chi^2$ but with the posterior probability distribution of the core radius $a$ pushing toward the upper limit of the input range.  However, this can not explain the discrepancy found for the non-parametric model of \citet{abdelsalam1998}.  
311: 
312: A third possible explanation is that the models of  \citet{abdelsalam1998, smith2005,natarajan2007} all incorporated weak lensing to 'anchor' the outer part of the mass distribution.  Using our model to predict the weak lensing shear profile at large radii, we find that our profile overestimates the signal compared to the measured signal of \citet{bardeau2007} from ground based observations with the Canada-France-Hawaii Telescope (see Figure~\ref{fig:weak_lens}).  
313: \begin{figure}
314: \epsscale{1.0}
315: \plotone{f12.eps}
316: \caption{ The weak lensing signal predicted by our model (solid line) compared to the weak lensing found by \citet{bardeau2007} (dotted line - $1\sigma$ error bars).   The \citet{bardeau2007} shows a very flat inner profile, characteristic of contamination of the background galaxy catalog by cluster members.  Therefore, we do not expect an agreement in the inner regions.  In the outer regions, where the contamination should be negligible, we find that our model overpredicts the weak lensing signal, but we note that the prediction is an extrapolation of a strong lensing model with constraints within $100\arcsec$.  \label{fig:weak_lens}  }
317: \end{figure}
318: The central part of the \citet{bardeau2007} profile is flat, characteristic of contamination of the background galaxy catalog by foreground cluster members (see \citet{limousin2007} for discussion on contamination).  Therefore, we do not expect to have an agreement between the weak lensing result and the strong lensing result.  At around $\sim300-400\arcsec$ the contamination should be negligible, and there we find that the agreement is better although our model still overpredicts the signal (although they are consistent within $2\sigma$).  We stress however that at this radius, we are extrapolating a strong lensing model based on constraints within $100\arcsec$, and the prediction becomes more uncertain the further we go out.
319: 
320: \section{Degeneracies}
321: \label{sec:degeneracy}
322: In Section~\ref{sec:analysis} we presented the strong lensing model of Abell 2218.
323: In this Section we study the degeneracies of our parametric strong lensing modeling, both those inherent to the parametric profile and the model components.  \citet{jullo2007} have also discussed the various degeneracies of the dPIE profile in lensing, addressing how different image configurations can break some degeneracies.
324: 
325: Lensing most strongly constrains the projected mass, and therefore we expect to see degeneracies arising from Equation~\ref{eq:mass2D}, although ellipticity may complicate that picture further.   In agreement with \citet{jullo2007} we find for the large scale halos, that the scale radius, $s$, is poorly constrained (as it lies beyond the outermost multiply imaged system).  This is also the case for the BCG and the \#617 which have large scale radii.  For the smaller halos, i.e., the scaled galaxies and \#1029, the scale radius, s, is small enough to affect the projected mass, and we find that lower $s$ requires higher $\sigma_{\mathrm{dPIE}}$ to keep the mass constant 
326: \begin{figure}
327: \epsscale{.50}
328: \plotone{f13a.eps}
329: \plotone{f13b.eps}
330: \caption{The degeneracy between the scale radius, $s$, and $\sigma_{\mathrm{dPIE}}$ for \#1028 (left) and the scaled galaxies (right).  These arise from equation \ref{eq:mass2D} which gives the aperture mass, showing that to keep the enclosed mass constant, an increase in the scale radius requires a lower value of $\sigma_{\mathrm{dPIE}}$.  The contours correspond to $1\sigma$, $2\sigma$ and $3\sigma$ confidence levels.  \label{fig:sigma_scale}  }
331: \end{figure}
332: (see Figure~\ref{fig:sigma_scale}).  However, the favored region for $s$ is always small, consistent with the tidal stripping of cluster galaxies proposed by \citet{natarajan1998,natarajan2002a,natarajan2002b,limousin2007AA, limousin2007_trunc}.
333: 
334: 
335: \begin{figure*}
336: \epsscale{.20}
337: \plotone{f14a.eps}
338: \plotone{f14b.eps}
339: \plotone{f14c.eps}
340: \plotone{f14d.eps}
341: \plotone{f14e.eps}
342: \caption{The degeneracy between the core radius, $a$, and $\sigma_{\mathrm{dPIE}}$ for DM1, DM2, BCG, \#1028, \#617 from left to right.  These again arise from Equation~\ref{eq:mass2D} requiring the aperture mass to remain constant.  They show that to keep the mass constant, the $\sigma_{\mathrm{dPIE}}$ must increase when the core radius $a$ is increased.  This is consistent with the findings of \citet{jullo2007} for the dPIE profile and the findings of \citet{kochanek1996} for more general profiles with core.  The contours correspond to $1\sigma$, $2\sigma$ and $3\sigma$ confidence levels.\label{fig:sigma_core}}
343: \end{figure*}
344: Also, in agreement with \citet{jullo2007}, and as discussed by e.g. \citet{kochanek1996} for more general cored profiles, a larger core radius, $a$,  requires higher $\sigma_{\mathrm{dPIE}}$ to keep the mass constant (see Figure~\ref{fig:sigma_core}).  As our model has both a large core radius $a$ and $\sigma_{\mathrm{dPIE}}$ for DM2 compared to previous models of A2218, we have explored whether this degeneracy can reduce both values.  However, when forcing both $a$ and $\sigma_{\mathrm{dPIE}}$ to be smaller, the posterior distribution of $a$ always pushes to the upper limit of the input range.  Therefore, we conclude that this degeneracy is not the explanation for the high $a$ and $\sigma_{\mathrm{dPIE}}$ we find for DM2.  This flat profile is further supported by the 'blind tests' we perform in Section~\ref{sec:blind_test}.
345: 
346: In addition to the degeneracies associated with the profile itself, there may be degeneracies associated with the components we include in the model, i.e., are all the components we include necessary and are they independent of each other?  As mentioned in section~\ref{sec:mass_dist}, the model is strongly preferred to be bimodal.   
347: \begin{figure}
348: \epsscale{.5}
349: \plotone{f15a.eps}
350: \plotone{f15b.eps}\\
351: \plotone{f15c.eps}
352: \plotone{f15d.eps}\\
353: \plotone{f15e.eps}
354: \plotone{f15f.eps}
355: \caption{The 2D posterior distribution of the parameters of DM2 vs. DM1 and BCG.  There is a clear degeneracy between the parameters of the two large scale components and the BCG.   Therefore, although the model prefers the inclusion of all the components, the values of their parameters are not fully independent.  The contours correspond to $1\sigma$, $2\sigma$ and $3\sigma$ confidence levels.\label{fig:DM1vsDM2deg}  }
356: \end{figure}
357: There is however degeneracy between the parameters of DM1 (and the BCG) and DM2 which is visualized in Figure~\ref{fig:DM1vsDM2deg}.  As for the BCG, we find that the Evidence is marginally higher when both the DM1 and BCG  are included in the model, suggesting that the data are sufficient to model both separately. Therefore we conclude, that although the model prefers the inclusion of all three components, the values for their parameters are not fully independent.  Finally, we look at the parameters for the scaled galaxies with respect to DM1 and DM2 
358: \begin{figure}
359: \epsscale{.5}
360: \plotone{f16a.eps}
361: \plotone{f16b.eps}\\
362: \plotone{f16c.eps}
363: \plotone{f16d.eps}
364: \caption{The 2D posterior distribution of the scaled galaxy parameters ($\sigma^\star$, $s^\star$) with respect to the core radii of the large scale dark matter clumps DM1 and DM2.   Although there is a degeneracy between the parameters, the scale radius $s^\star$ is well constrained at low values.  The contours correspond to $1\sigma$, $2\sigma$ and $3\sigma$ confidence levels.\label{fig:POTvsDMdeg}  }
365: \end{figure}
366: (see Figure~\ref{fig:POTvsDMdeg}).  Although there is degeneracy the preferred value, in particular for the scale radius $s^\star$, is well constrained at low values ($\lesssim4\arcsec$), consistent with the tidal stripping scenario.
367: 
368: \begin{figure*}
369: \epsscale{.5}
370: \plotone{f17a.eps}
371: \plotone{f17b.eps}
372: \caption{ {\it Left panel:} The mass map from Section~\ref{sec:need_DM} using only halos associated with the galaxies.   {\it Right panel:} The mass map from Section~\ref{sec:analysis} consisting of large scale dark matter halos and galaxy halos.  The mass map including the large scale dark matter halos, which is much smoother, provides a significantly better fit to the data than the clumpy galaxy-only model (rms$_s=0\farcs12$ vs. rms$_s=1\farcs62$).  The maps are $300\arcsec\times300\arcsec$ and are centered on the BCG with North being up and East being left. \label{fig:need_DM}  }
373: \end{figure*}
374: \section{Reliability of the mass map}
375: \label{sec:reliability}
376: We have presented the mass map inferred from our strong lensing analysis in Section~\ref{sec:analysis} (Figure~\ref{fig:massmap}).
377: In short, we find evidence for a bimodal mass distribution described by two large scale smooth dark matter clumps, on top of which
378: we add some extra mass associated with the cluster galaxies.
379: Moreover, we associate a significant mass concentration with the location of the BCG galaxy.
380: These two conclusions are believed to be robust, and the aim of this section is to perform a series of tests in order to check 
381: our main findings regarding how the dark matter is distributed in Abell~2218.
382: 
383: \subsection{Do we need the smoothly distributed dark matter component?} 
384: \label{sec:need_DM}
385: We wish to test whether the smoothly distributed component is necessary to reproduce a good fit, or whether mass associated with the galaxies alone can provide an equally good fit.  To this end,
386: we construct a model where the mass is only in halos associated with the galaxies, without the inclusion of any smoothly distributed dark matter component.  The galaxies are included in a scaled manner, allowing the $\sigma_{\mathrm{dPIE}}^\star$ and $s^\star$ to move to higher values to increase their mass.
387: As a result of the optimization, we find a very poor fit (rms$_s=1\farcs62$ instead of $0\farcs12$ for the model from Section~\ref{sec:analysis}).     As the model from Section~\ref{sec:analysis} individually fits three of the galaxies, we need to check whether this difference arises from these extra free parameters.  We therefore redo the analysis using only DM1 and DM2, and scaling all the cluster galaxies.  The resulting fit is worse than the original (rms$_s=0\farcs22$) but still significantly better than the fit without any smooth component.
388: 
389: 
390: 
391: The resulting mass map is shown in Figure~\ref{fig:need_DM} alongside our model from Section~\ref{sec:analysis}.  The mass map without a smooth component is very 'clumpy', whereas the latter is smooth.
392: \begin{figure}
393: \epsscale{1.0}
394: \plotone{f18.eps}
395: \caption{The total mass as function of radius (centered on the BCG) for the two models.   Although the total mass within any given radii is similar, the pure-galaxy-mass map gives a significantly worse fit to the data (rms$_{s}=0.12$ vs rms$_{s}=1.62$), confirming the need for a large scale dark matter halo to accurately fit the data (see Section~\ref{sec:need_DM}). \label{fig:need_DM_total}  }
396: \end{figure}
397: As expected, the enclosed mass derived from each model is comparable for any radius where we have observational constraints (see Figure~\ref{fig:need_DM_total}).  Therefore, the poorness of the 'clumpy' fit is not due to the fact that it is not massive enough to reproduce the lensing constraints.  Moreover, it is worth noting that the 'clumpy' model is not very satisfactory in the sense that it describes the 
398: cluster galaxies as being very massive (around a few times $10^{12} M_\sun$ on average), which is not compatible with independent galaxy-galaxy lensing probes of
399: cluster galaxy masses \citep[see e.g., ][]{mandelbaum2005b}.  We therefore interpret the difference in the goodness of fit as evidence for the dark matter being distributed smoothly in the cluster, with only small perturbations from the cluster galaxies.  This is further supported by the X-ray emission (see Section~\ref{sec:bimodal}).
400: 
401: \input{tab4}
402: \subsection{Sensitivity to the galaxy scale perturbers}
403: \label{sec:sens_gal}
404: We have already seen that individual galaxies are important to the overall lens model if they are close to a multiply imaged system (see also Section~\ref{sec:blind_test}), but in general the cluster galaxies only add small perturbations to the overall 
405: mass distribution of the cluster, with $\sim5$-$6$\% of the total mass being associated with the galaxy sized halos (excluding the BCG, see Figure~\ref{fig:m_1d}).  
406: To check whether those small scale perturbations are important, we try using a subset of the galaxy catalog to see if this affects the results.  
407: Our original catalog contains $197$ cluster members down to $K=19.6$ (including the individually fitted galaxies), but we also create catalogs using a cut off magnitude of 
408: $K=19, 18, 17, 16$ with $145, 110, 62, 35$ members respectively.  
409: We find that the overall quality of the fit is only weakly affected (see Table~\ref{tab:gal_cutoff}), suggesting that for the purposes of lensing studies, it is sufficient to include the brightest 
410: galaxy members in the modeling in addition to those which clearly locally perturb given multiply imaged systems.  
411: This point is important in order to save computing time, as increased number of clumps, even if modelled by scaling, significantly increases the required CPU (Central Processing Unit) time.
412: 
413: 
414: \begin{figure*}
415: \epsscale{.4}
416: \plotone{f19a.eps}
417: \plotone{f19b.eps}
418: \caption{Density plots for the positions of the galaxy sized clumps for the models discussed in section~\ref{sec:blind_test}.  The locations of the clumps for the best model are marked with red stars.  The size of the stars is proportional to their velocity dispersion.  The two big black stars show the two large dark matter clumps, their size again proportional to the velocity dispersion.  {\it Left panel:}  Configuration (A):  The model consists of five freely placed galaxy sized halos in addition to DM1 and DM2.   {\it Right panel:}  Configuration (B)  The model consists of ten freely placed galaxy sized halos in addition to DM1 and DM2.   Both configurations place galaxy scaled clumps where the BCG and the galaxies responsible for the local splitting of S1 are located.  The maps are $200\arcsec\times200\arcsec$ and are centered on the BCG with North being up and East being left. \label{fig:blind_test}}
419: \end{figure*}
420: 
421: \subsection{Blind tests - can we localize galaxy scale subtructure?}
422: \label{sec:blind_test}
423: Normally galaxy sized perturbers are added to the model by placing a dark matter clump at the location of a known cluster member.  This method is by construction not able to detect dark substructure directly.  To test how sensitive we are to these substructures, luminous or dark, we perform 'blind tests', where 
424: we model the cluster with (A) five and (B) ten freely placed galaxy sized dark matter clumps, in addition to the two large scale dark matter clumps DM1 and DM2.  
425: 
426: In order to limit the number of free parameters, we fix both $a=1$~kpc and $s=40$~kpc for the galaxy sized dark matter clumps, but we optimize all the parameters of the large scale clumps as before.   We allow the positions of the galaxy scale clumps to vary from $-100\arcsec$ to $100\arcsec$, corresponding roughly to the ACS field of view.  The velocity dispersion, $\sigma_{\mathrm{dPIE}}$, of each clump is allowed to vary from $0$-$500$~km~s$^{-1}$.  
427: 
428: 
429: The results are shown in Figure~\ref{fig:blind_test}, showing a density plot of the position of the clumps for the 1000 best realizations (in the lowest $\chi^2$ sense), for both configurations (A) and (B).  The locations of the clumps corresponding to the best realization are marked by stars.
430: We find that both configurations give a good fit to the data (rms$_s=0.17$ and rms$_s=0.19$ for (A) and (B) respectively).  Configurations (A) and (B) are consistent with each other: setup (A) finds five well defined clumps. Setup (B) finds the same well defined five clumps, whereas it tries to marginalize the
431: five extra clumps that seem not to be needed in the optimization.
432: 
433: 
434: For both configurations, the model places two large halos where the BCG is, clearly demonstrating the need for significant mass at the location of the BCG galaxy.
435: Another common feature, is the lack of a galaxy sized clump in the vicinity of DM2, suggesting a very flat profile is preferred there.  Both configurations place clumps in the positions of galaxies \#1028 and \#993, which are responsible for the local splitting of S1.  This shows that lensing analysis can reliably detect galaxy sized dark substructure if it causes the local splitting of images, although the lensing map is not very sensitive to galaxy sized substructure in general (see also Section~\ref{sec:sens_gal}). 
436: 
437: \section{Bimodality of the Mass Distribution: Evidence of a Merger}
438: \label{sec:bimodal}
439: Our strong lensing analysis shows that Abell 2218 has a bimodal mass distribution: even if the dominant mass component is the BCG  and DM1 halos, the second large scale dark matter clump DM2 is also significant, contributing  around $20\%$ of the total mass within $100\arcsec$.
440: This second smooth mass component is associated with the bright cluster galaxy in the south-east that we call $\#617$.    To further interpret this bimodality, we look at two different available probes of the
441: cluster: the X-ray map and the velocity of cluster members.
442: 
443: \begin{description}
444:  \item[The X-ray map]
445: The X-ray flux map in the central parts of the cluster shows a complex morphology with no outstanding central peak. The offset between the X-ray peak at (R.A., Dec.)=$(248.967, 66.210)$
446: %(alpha=16 35 52.0, delta=+66 12 36.0).
447: and the BCG is significant, $\sim 20\arcsec$, with the X-ray peak located in the direction of DM2, but there is no evident peak in the X-ray emission in the vicinity of DM2.  The X-ray flux map within about 1 arcmin of the peak of the X-ray emission is clearly elongated in the SE-NW direction, but becomes more spherical with increasing distance from the cluster center (see Figure~\ref{fig:massmap}).  Comparing the contours for the X-rays and the mass map we see that the elongation of the two are very similar, although the X-rays become more spherical at large radii.  To get more quantitative values for comparison of the two maps, we fitted a 2-dimensional $\beta$ model \citep{cavaliere1978} to both maps.
448: 
449: For the X-ray map, the best fit beta-model to the inner 2 arcmin by 2 arcmin (centered on the X-ray centroid) has an eccentricity ($\epsilon^\beta\equiv\sqrt{1-(B/A)^2}$) of $\epsilon^\beta_X(\mathrm{inner})=0.27\pm0.12$ and a position angle of $\theta_X(\mathrm{inner})=39\degr\pm16\degr$ (measured anti-clockwise from West). Fitting to the the 4 arcmin by 4 arcmin X-ray map (centered on the X-ray centroid) results in eccentricity of $\epsilon^\beta_X(\mathrm{outer})=0.18\pm0.08$ and a position angle of $\theta_X(\mathrm{outer})=17\degr\pm14\degr$.   A similar analysis for the mass map in a 5 arcmin by 5 arcmin (centered on the BCG) gives an eccentricity of the overall mass map as $\epsilon^\beta_{\mathrm{mass}}\approx0.25$ with $\theta_{\mathrm{mass}}\approx39\degr$, consistent with that found for the X-ray map, in particular in the inner regions.
450: 
451: \item[Distribution in velocity space]
452: If a merger has taken place in Abell~2218, then this should be imprinted
453: in the velocity distribution of the cluster members: we should be able to
454: identify two structures in velocity space; one associated with the BCG galaxy, and 
455: one associated with galaxy $\#617$.  
456: 
457: \citet{girardi1997} studied the structure of Abell 2218 using the spectroscopic data from \citet{leborgne1992}.  They found evidence for two structures (labelled MS1 and MS2)  separated by $2000$~km~s$^{-1}$.  The larger of these two structures, MS2, contains both the BCG and the brightest galaxy in DM2.  The two structures are superimposed along the line of sight and do not correspond exactly to the clumps found for the strong lens modeling.   Such superimposed structures are hard to separate using lensing as it is not sensitive to the 3D distribution of the matter.
458: 
459: To look for substructure associated with the BCG and $\#617$ we have calculated the separation ($\Delta v = c (z_i-z_j)/(1+z_i)$) between them and the other cluster members using spectroscopic data from \citet{leborgne1992}.  
460: Out of the 50 galaxies in the catalog we found 13 with $\Delta v < 500$~km~s$^{-1}$ and 25 with $\Delta v < 1000$~km~s$^{-1}$ for the brightest galaxy in DM2 while for DM1 the numbers were 11 and 21 respectively.  The $\Delta v<500$~km~s$^{-1}$ cut defines two structures without any common members, while the $\Delta v <1000$~km~s$^{-1}$  cut starts mixing the two groups.   
461: While all the galaxies we find with a $\Delta v < 500$~km~s$^{-1}$ for the BCG and $\# 617$ would belong to MS2 found by \citet{girardi1997}, the data suggests that MS2 may be further subdivided into two smaller structures, corresponding to the DM1 and DM2 found by the lens models.
462: \end{description}
463: 
464: The X-ray data and the distribution of the cluster members in velocity space, indicates that the bimodal mass distribution is caused by a merger.   The main cluster is the one associated with the BCG (DM1 and BCG in the lensing analysis) which the second cluster associated with $\#617$ (DM2 in the lensing analysis) has merged with, thus displacing the center of the X-ray peak from the BCG galaxy.  
465: 
466: \section{Conclusions}
467: \label{sec:conclusions}
468: We have reconstructed a mass map of the rich galaxy cluster Abell 2218 using strong lensing constraints.  Our model is based on 7 multiply imaged systems and 1 arc with spectroscopic redshifts,  and 6 systems without  spectroscopic redshifts, of which 5 are new candidate systems proposed in this work.  The model is sampled and optimized in the source plane by a Bayesian Monte Carlo Markov Chain implemented in the publicly available software Lenstool.  Our best model has rms$_s=0\farcs12$ in the source plane, corresponding to rms$_i=1\farcs49$ in the image plane.
469: 
470: We have found, in agreement with previous models of Abell 2218, that the mass distribution is bimodal.
471: We find that DM2 is larger and with a flatter core than previous models have found.  The flatness of the profile near DM2 is further supported by our 'blind tests', which do not place a galaxy sized component near its center.  The BCG and DM1 are the dominant component of the mass model in the inner regions  ($<100\arcsec$) of the cluster.  We have analyzed the distribution of galaxies in velocity space, finding evidence for two substructures, separated by $\sim 1000$~km~s$^{-1}$, corresponding to DM1 and DM2.  Although both the light and the X-ray contours are consistent with the mass map, the center of the X-ray emission is offset from the central peak of the BCG.  We find that the X-ray data and the distribution in velocity space support the interpretation that the bimodal mass distribution arises from a cluster merger.
472: 
473: We have explored the degeneracy of the mass model, both those inherent to the dPIE profile and those arising from the mass model components themselves.   For the dPIE, our results are in agreement with those of \citet{jullo2007}.   We find that the large scale dark matter clumps are a necessary component of the model, i.e., using only the dark matter halos associated with the galaxies does not give a good fit to the data (rms$_s=1\farcs62$ vs. rms$_s=0\farcs12$), even if they are allowed to become much more massive. 
474: 
475: At around $100\arcsec$ the two large scale halos contribute $\sim85\%$ of the enclosed projected mass, while the BCG contributes $\sim9\%$ and the remaining cluster galaxies $\sim6\%$.  We have performed 'blind tests' to check where the model requires a galaxy scale component to reproduce the lensing constraints, and find that both the BCG and galaxies which locally perturb given multiply imaged systems are reliably reclaimed.    However, we find that the inclusion of the cluster galaxies (excluding the BCG) only weakly affects the model unless they locally perturb a multiply imaged system.  Assuming that mass scales with light, this shows that strong lensing constraints can reliable detect substructure, dark or luminous, if the substructure is massive or locally perturbs a system.
476: 
477: The accurate mass map we have presented is made available to the community and can be used to exploit Abell 2218 as a gravitational telescope, probing the high redshift universe.  In this work we have fixed the cosmology, but with the increased number of constraints it may be possible to simultaneously fit it with the mass distribution.  However, the bimodal structure of Abell 2218, and the remaining uncertainty in the mass model (around $\sim1\arcsec$ in the image plane) may make it challenging to reach competitive accuracy in the derived cosmological parameters.
478: 
479: \acknowledgments
480: 
481: We thank S\'ebastien Bardeau and Genevi\`eve Soucail for giving us access to the weak lensing data.  \'A.E. thanks Bo Milvang-Jensen, C\'ecile Faure, Chlo\'e F\'eron, John McKean, Kim Nilsson and Paul Vreeswijk for helpful discussions relating to this work.
482: The Dark Cosmology Centre is funded by the Danish National Research Foundation.  This work was supported by the European Community's Sixth Framework Marie Curie Research Training Network Programme, Contract No. MRTN-CT-2004-505183 "ANGLES".  We thank the Danish Centre for Scientific Computing for granting the computer resources used.  This research was supported in part by the National Science Foundation under Grant No. PHY99-07949.  JR is grateful to the California Institute of Technology for its support.  JPK acknowledges support from CNRS.  KP acknowledges support from IDA. The authors recognize and acknowledge the very significant cultural
483: role and reverence that the summit of Mauna Kea has always had
484: within the indigenous Hawaiian community.  We are most fortunate
485: to have the opportunity to conduct observations from this mountain.
486: 
487: %% To help institutions obtain information on the effectiveness of their
488: %% telescopes, the AAS Journals has created a group of keywords for telescope
489: %% facilities. A common set of keywords will make these types of searches
490: %% significantly easier and more accurate. In addition, they will also be
491: %% useful in linking papers together which utilize the same telescopes
492: %% within the framework of the National Virtual Observatory.
493: %% See the AASTeX Web site at http://www.journals.uchicago.edu/AAS/AASTeX
494: %% for information on obtaining the facility keywords.
495: 
496: %% After the acknowledgments section, use the following syntax and the
497: %% \facility{} macro to list the keywords of facilities used in the research
498: %% for the paper.  Each keyword will be checked against the master list during
499: %% copy editing.  Individual instruments or configurations can be provided 
500: %% in parentheses, after the keyword, but they will not be verified.
501: 
502: %{\it Facilities:} \facility{Nickel}, \facility{HST (STIS)}, \facility{CXO (ASIS)}.
503: 
504: 
505: % Appendix
506: \appendix
507: \include{appa}
508: 
509: % References
510: \include{ref}
511: 
512: 
513: 
514: % Tables
515: 
516: %\clearpage
517: %\input{tab1}
518: %\input{tab2}
519: %\input{tab3}
520: %\input{tab4}
521: 
522: \end{document}
523: 
524: