0807:0807.2868/ms.tex

1: % !iTeXMac(typeset): simpdftex latex --keep-psfile ${iTMInput}

2: % !iTeXMac(compile): "./local Command"

3: %\documentclass[12pt,preprint]{aastex}

4: \documentclass{emulateapj}

5: \usepackage{apjfonts}

6: \bibliographystyle{apj}

7:

8: %\newcommand{\scaleup}{}

9: %\newcommand{\plotter}{\includegraphics[scale=0.70]}

10: %\newcommand{\plotterr}{\includegraphics[scale=0.50]}

11: %\newcommand{\breaker}{\clearpage}

12: %\newcommand{\appendixcolumns}{}

13: \newcommand{\scaleup}{\epsscale{1.1}}

14: \newcommand{\plotter}{\plotone}

15: \newcommand{\plotterr}{\plotone}

16: \newcommand{\breaker}{}

17: \newcommand{\appendixcolumns}{\twocolumngrid}

18:

19: \newcommand{\tableset}{deluxetable}

20: \newcommand{\tableclear}{\clearpage}

21: \newcommand{\Mdot}{\dot{M}}

22: \newcommand{\mdot}{\dot{m}}

23: \newcommand{\etal}{et al.}

24: \newcommand{\mbh}{M_{\rm BH}}

25: \newcommand{\tH}{t_{\rm H}}

26: \newcommand{\mstar}{M_{\ast}}

27: \newcommand{\mhalo}{M_{\rm halo}}

28: \newcommand{\mgal}{M_{\rm gal}}

29: \newcommand{\msun}{M_{\sun}}

30: \newcommand{\lstar}{L_{\ast}}

31: \newcommand{\phistar}{\phi_{\ast}}

32: \newcommand{\pstar}{\phistar}

33: \newcommand{\reducedchi}{\chi^{2}/\nu}

34: \newcommand{\qeos}{q_{\rm eos}}

35: \newcommand{\fgas}{f_{\rm gas}}

36: \newcommand{\mdyn}{M_{\rm dyn}}

37: \newcommand{\re}{R_{\rm e}}

38: \newcommand{\fsb}{f_{\rm sb}}

39: \newcommand{\paperone}{Paper \textrm{I}}

40: \newcommand{\papertwo}{Paper \textrm{II}}

41: \newcommand{\paperthree}{Paper \textrm{III}}

42: \newcommand{\paperfour}{Paper \textrm{IV}}

43:

44: \shorttitle{Evolution in Spheroid Scalings}

45: \shortauthors{Hopkins \etal}

46: \slugcomment{Submitted to ApJ, June 14, 2008}

47: \begin{document}

48:

49: \title{Dissipation and Extra Light in Galactic Nuclei: \textrm{IV}.\

50: Evolution in the Scaling Relations of Spheroids}

51: \author{Philip F. Hopkins\altaffilmark{1},

52: Lars Hernquist\altaffilmark{1},

53: Thomas J. Cox\altaffilmark{1,2},

54: Dusan Keres\altaffilmark{1},

55: \&\ Stijn Wuyts\altaffilmark{1,2}

56: }

57: \altaffiltext{1}{Harvard-Smithsonian Center for Astrophysics,

58: 60 Garden Street, Cambridge, MA 02138}

59: \altaffiltext{2}{W.~M.\ Keck Postdoctoral Fellow at the

60: Harvard-Smithsonian Center for Astrophysics}

61:

62:

63: \begin{abstract}

64:

65: We develop a model for the physical origin and redshift

66: evolution of spheroid scaling relations.

67: We consider spheroid sizes, velocity dispersions, dynamical masses,

68: profile shapes (Sersic indices), stellar and supermassive

69: black hole masses, and their related scalings.

70: Our approach combines advantages of prior observational

71: constraints in halo occupation models and libraries of

72: high-resolution hydrodynamic simulations of galaxy mergers.

73: This allows us to separate

74: the relative roles of dissipation, dry mergers, formation time,

75: and evolution in progenitor properties, and identify their impact on

76: observed scalings at each redshift.

77: We show that, at all redshifts,

78: dissipation is the most important factor determining spheroid sizes

79: and fundamental plane scalings, and can account (at $z=0$) for the observed

80: fundamental plane tilt and differences between observed disk and

81: spheroid scaling relations. Because disks (spheroid progenitors)

82: at high redshift have characteristically larger gas fractions, this predicts

83: more dissipation in mergers, yielding systematically more compact,

84: smaller spheroids. In detail, this gives rise to a mass-dependent evolution

85: in the sizes of spheroids of a given mass, that agrees well with observations.

86: This relates to a subtle weakening of the tilt of the early-type fundamental

87: plane with redshift,

88: important for a number of studies that assume a non-evolving

89: stellar mass fundamental plane.

90: This also predicts evolution in the black hole-host mass relations,

91: towards more massive black holes at higher redshifts.

92: Dry mergers are also significant, but only for

93: large systems which form early -- they

94: originate as compact systems, but undergo a number of dry mergers

95: (consistent with observations) such that they have sizes at any later

96: observed redshift similar to systems of the same mass formed more recently.

97: Most of the observed, compact high-redshift ellipticals will become the

98: cores of present BCGs, and we show how their sizes, velocity dispersions,

99: and black hole masses

100: evolve to become consistent with observations. We also predict

101: what fraction might survive intact from early formation,

102: and identify their characteristic $z=0$ properties.

103: We make predictions for residual correlations as well: e.g.\ the

104: correlation of size and fundamental plane residuals with formation time

105: of a given elliptical, that can be used as additional tests of these models.

106:

107: \end{abstract}

108:

109: \keywords{galaxies: elliptical and lenticular, cD --- galaxies: evolution ---

110: galaxies: formation --- galaxies: nuclei --- galaxies: structure ---

111: cosmology: theory}

112:

113:

114:

115: \section{Introduction}

116: \label{sec:intro}

117:

118: Understanding the scaling relations between the photometric and

119: kinematic properties of galaxy spheroids -- their masses, sizes,

120: velocity dispersions, and luminosities -- is fundamental to explaining

121: their origin. Any model which attempts to account for these

122: correlations must, of course, also determine how they evolve as a

123: function of redshift. In a $\Lambda$CDM universe, objects grow

124: hierarchically, implying that ellipticals do not comprise a monolithic

125: population formed at an early epoch, but have evolved and grown by

126: mergers from $z\gtrsim6$ to $z=0$.

127:

128: Observations are beginning to probe

129: the populations of spheroids at these epochs, and reveal that there is

130: indeed evolution in such scaling relations \citep[see

131: e.g.][]{trujillo:size.evolution,mcintosh:size.evolution,

132: zirm:drg.sizes,vandokkum:z2.sizes}, generally in the sense of massive

133: ellipticals being more compact at high redshifts.  Some of these

134: systems, in fact, appear sufficiently compact \citep[see

135: e.g.][]{zirm:drg.sizes,vandokkum:z2.sizes} that there may be no

136: similar $z=0$ analogs whatsoever, leading to considerable debate

137: regarding the fate of such objects.  Ultimately, the causes of such

138: evolution and its implication for e.g.\ the fundamental plane of

139: ellipticals at those redshifts (and today) are not well understood.

140:

141: Moreover, observations now demonstrate that essentially all massive

142: spheroids host a supermassive black hole at their centers

143: \citep{KormendyRichstone95}. The mass of this black hole is tightly

144: correlated with a variety of structural parameters: the mass

145: \citep{magorrian}, velocity dispersion \citep{FM00,Gebhardt00}, and

146: concentration/Sersic index \citep{graham:concentration,graham:sersic}.

147:

148: Recently, \citet{hopkins:bhfp.obs} and \citet{aller:mbh.esph}

149: demonstrated that these correlations could be understood in terms of a

150: ``fundamental plane''-like relation, essentially one where

151: black hole (BH) mass

152: tracks the binding energy of the stellar

153: bulge. \citet{hopkins:bhfp.theory} showed that this is the natural

154: expectation of theories where some form of feedback from accretion

155: leads to self-regulated growth of the BH.  This has important

156: consequences for e.g.\ quasar lightcurves \citep{hopkins:faint.slope}

157: and lifetimes \citep{hopkins:lifetimes.letter}, as well as for the

158: effects of ``feedback'' on the host galaxy, possibly necessary for

159: ``quenching'' of star formation to create red elliptical galaxies at

160: $z=0$

161: \citep{croton:sam,hopkins:red.galaxies,hopkins:clustering,hopkins:groups.ell}.

162:

163: Observations suggest that there may be evolution in the black hole

164: host-galaxy scalings \citep[e.g.][]{peng:magorrian.evolution}, but the

165: details remain ambiguous \citep[see

166: e.g.][]{woo06:lowz.msigma.evolution,

167: salviander:msigma.evolution,lauer:mbh.bias}.  As pointed out in

168: \citet{hopkins:bhfp.theory}, at least some such evolution should be a

169: natural consequence of evolution in host spheroid scaling relations,

170: provided that there is some fundamental quantity or property

171: traced by black hole mass. Again, if true, this is of basic

172: importance for models of black hole growth, AGN and quasar activity,

173: star formation, and galaxy formation and evolution.

174:

175: A number of important questions therefore arise: If indeed the

176: observations are correct, why are these high-redshift spheroids

177: different from their analogs at $z=0$?  What can this tell us about

178: the process of spheroid formation? Can we use them to constrain and

179: test models for spheroid formation, quenching, and black hole growth?

180: Are such objects typical, or are they the result of complex selection

181: effects? What happens to them by $z=0$? Can we find the remnants or

182: left-overs of the population today?  Are evolution in spheroid

183: structural properties related to evolution in their hosted black

184: holes? And how is this connected to the cosmological process of quasar

185: fueling and black hole growth?

186:

187: In a series of papers, \citet{hopkins:cusps.mergers,hopkins:cusps.ell,

188: hopkins:cores,hopkins:cusps.fp} (hereafter \paperone-\paperfour),

189: we combined libraries of hydrodynamic simulations of galaxy merger remnants

190: and observations of nearby ellipticals spanning the largest available dynamic

191: range in e.g.\ spatial scale, surface brightness, and galaxy properties,

192: and developed a methodology by which we could empirically separate

193: spheroids into their two dominant physical components. First, a dissipationless

194: component -- i.e.\ an ``envelope,'' formed from the violent relaxation/scattering

195: of stars that are

196: already present in merging stellar disks that contribute to the final

197: remnant. Because disks are very extended, with low phase-space density

198: (and collisionless processes in a merger cannot raise this phase-space density),

199: these stars will necessarily dominate the profile at large radii, hence

200: the envelope, with a low central density.

201:

202: Second, a dissipational component -- i.e.\ a dense central relic of

203: starbursts from gas that has been brought into the remnant, lost its

204: angular momentum, and turned into stars in a compact central starburst

205: similar to those observed in e.g.\ local ULIRGs (although these would

206: represent the most extreme cases).  Because gas can radiate, it can

207: collapse to very high densities, and will dominate the profile within

208: radii $\sim0.5-1\,$kpc, accounting for the high central densities in

209: ellipticals. The gas will probably reflect that initially brought in

210: from merging disks, but could in principle also stem from new cooling or

211: stellar mass loss in the elliptical \citep[see

212: e.g.][]{ciottiostriker:recycling}.  In subsequent mergers the two

213: components will act dissipationlessly (they are just stars and dark

214: matter), but the segregation between the two is sufficient that they

215: remain distinct even after multiple major dissipationless or ``dry''

216: re-mergers: i.e.\ one can still, in principle, distinguish the dense

217: central stellar component that is the remnant of the combined

218: dissipational starburst(s) from the less dense outer envelope that is

219: the remnant of low-density disk stars.

220:

221: In \paperone\ we developed a methodology to empirically separate these components

222: in observed systems with data of sufficient quality, and tested this on

223: observed samples of nearby merger remnants from \citet{rj:profiles}.

224: Using e.g.\ comparison with stellar populations, colors, and other properties

225: as well as direct comparison of simulations with observed surface brightness profiles,

226: galaxy shapes, and kinematics, we confirmed that this could reliably extract

227: the dissipational component of the galaxy \citep[see also][and references therein]{jk:profiles}.

228: In \papertwo\ we extended this to

229: observed ellipticals with central ``cusps'' and in \paperthree\ to those with central

230: ``cores'' (believed to have undergone subsequent dry mergers, but as we argue,

231: still retaining evidence for the initial dissipational/dissipationless stellar

232: components), from \citet{jk:profiles} and \citet{lauer:bimodal.profiles}.

233: In \paperfour, we used these results to argue that the relative

234: mass fraction in the dissipational component -- i.e.\ the total mass fraction of

235: the elliptical built up dissipationally, as opposed to brought in as stars in

236: stellar disks, was the most important parameter determining the sizes

237: and densities of ellipticals at $z=0$. We showed that this fraction depends

238: systematically on mass: low mass ellipticals experienced more dissipation

239: than high-mass ones (presumably reflecting the well-established

240: observational fact that low-mass disks are more gas-rich than

241: higher-mass disks). We then empirically

242: demonstrated that this systematic dependence is

243: both necessary and sufficient to explain e.g.\ the difference between

244: the size-mass relation of ellipticals and that of their (ultimate) progenitor disks,

245: and to explain the observed ``tilt'' in the fundamental plane relation of spheroids.

246:

247: There are, however, limitations in our ability to theoretically model these

248: processes using idealized simulations and to understand them

249: observing only $z=0$ ellipticals. Many galaxies are expected to have had complex

250: merger histories (perhaps forming early in a rapid series of multiple, highly dissipational

251: mergers, rather than a single disk-disk merger at late times),

252: which might, in principle, ``smear our'' or bias some of these comparisons or

253: systematically alter some predictions

254: \citep[see e.g.][]{kobayashi:pseudo.monolithic,naab:etg.formation}.

255: Moreover, the gas content in pre-merger disks (and possibly

256: in ellipticals themselves) is both expected and observed

257: \citep[e.g.][]{erb:lbg.gasmasses} to evolve strongly as a function of redshift.

258: The disks building up ellipticals at $z\gtrsim2$ are much more gas

259: rich, so their remnant ellipticals at these times would necessarily be more dissipational,

260: and as such might have different structural properties and obey different

261: scaling relations from their $z=0$ counterparts.

262: It is therefore of particular interest to combine our empirical understanding of the

263: role of dissipation from local samples, and our inferences from idealized simulations,

264: with fully cosmological models that can account for the evolution in elliptical progenitors

265: and merger histories with redshift.

266:

267: Some initial attempts at studying these correlations have been

268: made using cosmological simulations \citep{naab:etg.formation,dimatteo:cosmo.bhs,

269: sijacki:radio,croft:cosmo.morph.density}.

270: It is, unfortunately, prohibitively expensive to simultaneously resolve the

271: $\sim100\,$pc

272: scales necessary to meaningfully predict e.g.\ spheroid scaling

273: lengths, black hole masses, and central velocity dispersions

274: in a fully cosmological simulation which would also contain a volume

275: capable of modeling a large number of galaxies up to masses

276: $\sim10^{12}\,\msun$. Moreover, disk formation remains an

277: unsolved problem in such simulations, with the uncertain physics of

278: e.g.\ star formation and feedback probably important to form disks with

279: the appropriate scale lengths, thicknesses, and bulge-to-disk ratios

280: for their masses \citep{robertson:cosmological.disk.formation,

281: donghia:disk.ang.mom.loss,governato:disk.formation,ceverino:cosmo.sne.fb,

282: zavala:cosmo.disk.vs.fb}.

283: If initial disks do not have the appropriate observed

284: structural properties, then even a perfect model for spheroid formation will

285: have severe systematic inaccuracies.

286:

287: In order to get around some of these limitations, alternative attempts

288: have been made to instead predict scaling resolutions

289: from semi-analytic models of galaxy formation \citep{khochfar:size.evolution.model,

290: almeida:durham.ell.scalings}.

291: Although they could not calibrate the details of their predictions with

292: numerical simulations,

293: \citet{khochfar:size.evolution.model} used this to illustrate, with very simple

294: assumptions, that increasing dissipational fractions in ellipticals at

295: higher redshift leads to the expectation that spheroids should be more

296: compact. \citet{almeida:durham.ell.scalings} explicitly ignored dissipation,

297: and as a result found that none of the models they considered could

298: reproduce the observed $z=0$ fundamental plane correlations of

299: spheroids (nor their redshift evolution).

300:

301: Such models, however, potentially suffer from the same problems

302: regarding the nature of disks (the ``initial conditions''):

303: uncertainties related to modeling spheroid formation are difficult to

304: disentangle from prescriptions for e.g.\ cooling, disk growth, star

305: formation, and feedback.  Moreover, while a few properties can easily

306: be estimated (in a mean sense) from {\em a priori} analytic arguments,

307: many important properties and physical factors affecting spheroid

308: evolution -- the role of dissipation in changing the shape and

309: velocity structure of a spheroid, the detailed density profiles

310: (especially at small radii and in extended envelopes) of ellipticals,

311: and the dark matter to stellar or baryonic mass ratios within various

312: radii in the galaxy -- require detailed, high-resolution numerical

313: simulations to robustly model.

314:

315: One way of circumventing the problems in these cosmological models is

316: to adopt a halo occupation distribution (HOD) approach, in which

317: (beginning from a halo+subhalo distribution determined in numerical

318: simulations) halos are populated with galaxies according to empirical

319: constraints on their clustering properties. The success of such models

320: at reproducing the observed distributions at a variety of redshifts,

321: as a function of various galaxy properties, has now been demonstrated

322: \citep[see e.g.][]{conroy:monotonic.hod}.  This allows us to begin

323: with the empirical knowledge of where galaxies lie, and what their gas

324: properties are, without reference to some (potentially incomplete)

325: {\em a priori} model of e.g.\ star formation and feedback (which we

326: are not, in any case, interested in testing here). Using this approach

327: to identify and follow galaxies and their subsequent mergers, we can

328: then reference each merger to an idealized hydrodynamic simulation

329: with similar properties (e.g.\ initial morphologies, gas fractions,

330: sizes, masses, etc.), and use this to predict what the properties of

331: the remnant should be. Constructing a sample of such remnant spheroids

332: at any given redshift is straightforward, and allows us to compare

333: with observed scaling relations in a direct manner.

334:

335: In this paper, we adopt this method to combine the advantages of

336: high-resolution simulations, semi-analytic models, and observational

337: constraints on spheroid progenitors in order to develop

338: robust predictions for the evolution of spheroid structure and

339: the correlation between black hole mass and host properties.

340: We study how e.g.\ the evolution in disk gas fractions, structural properties,

341: and the interplay between gas-rich and gas-poor mergers

342: and multiple mergers in a hierarchical cosmology relate to

343: observed correlations from $z=0-4$.

344:

345: In \S~\ref{sec:model}, we describe our methodology, including our

346: cosmological models of merger histories

347: (\S~\ref{sec:model:cosmology}), empirical models for progenitor disks

348: (\S~\ref{sec:model:hod}), and library of simulations which we use to

349: determine remnant properties (\S~\ref{sec:model:sims}). In

350: \S~\ref{sec:z0}, we consider and compare observations to the

351: resulting predicted scalings of spheroids at $z=0$, and specifically

352: highlight the effects of dissipation (\S~\ref{sec:z0:diss}) and dry

353: mergers (\S~\ref{sec:z0:dry}).  In

354: \S~\ref{sec:evol} we make a number of predictions and comparisons to

355: observations regarding how these correlations evolve with redshift,

356: and again illustrate the effects of dissipation

357: (\S~\ref{sec:evol:diss}) and dry mergers (\S~\ref{sec:evol:dry}).  In

358: \S~\ref{sec:track} we follow the history of individual systems forming

359: at different redshifts to compare them at later redshifts, and discuss

360: the fates of early-forming galaxies which (initially) reflect these

361: evolved correlations. Finally, in \S~\ref{sec:track}, we summarize our

362: results and outline future observational tests.

363:

364: Throughout, we adopt a WMAP5

365: $(\Omega_{\rm M},\,\Omega_{\Lambda},\,h,\,\sigma_{8},\,n_{s})

366: =(0.274,\,0.726,\,0.705,\,0.812,\,0.960)$ cosmology

367: \citep{komatsu:wmap5}, and normalize all observations and models

368: shown to these parameters. Although the exact choice of

369: cosmology may systematically

370: shift the associated halo masses (and high-redshift evolution)

371: at a given galaxy mass (primarily scaling with $\sigma_{8}$),

372: our comparisons (in terms of stellar mass) are for the most part unchanged (many of these

373: differences are implicitly normalized out in the halo occupation approach).

374: Repeating our calculations for

375: a ``concordance'' $(0.3,\,0.7,\,0.7,\,0.9,\,1.0)$ cosmology or

376: the WMAP1 $(0.27,\,0.73,\,0.71,\,0.84,\,0.96)$ and WMAP 3

377: $(0.268,\,0.732,\,0.704,\,0.776,\,0.947)$

378: results of \citet{spergel:wmap1} and \citet{spergel:wmap3}

379: has little effect on our conclusions.

380: We also adopt a \citet{salpeter64} stellar initial mass function (IMF), and convert all stellar masses

381: and mass-to-light ratios accordingly. Again, the choice of the IMF systematically

382: shifts the normalization of stellar masses herein, but does not substantially change

383: our comparisons.

384: All magnitudes are in the Vega system, unless otherwise specified.

385:

386:

387: \breaker

388: \section{The Simulations and Cosmological Model}

389: \label{sec:model}

390:

391:

392: \subsection{Overview}

393: \label{sec:model:sims:summary}

394:

395: The model we will adopt consists of the following steps, summarized here

396: and described in detail below:

397: \begin{itemize}

398: \item We construct the halo+subhalo mass function, and populate this with galaxies

399: of various masses according to a standard halo occupation model approach. We

400: allow this population to vary within the range allowed by observational constraints,

401: but find this to be a small source of uncertainty.

402: \item We assign un-merged galaxies a scale length and gas fraction

403: appropriate for their stellar mass and redshift according to observational

404: constraints (with appropriate scatter).

405: We also systematically vary assignments to span the range allowed by

406: observational constraints, and find this is the dominant source of uncertainty in

407: our predictions.

408: \item We evolve this model forward in time (following the dark matter) and identify

409: mergers. After a merger, the properties of the remnant galaxy are calculated

410: as a function of the progenitor properties, according to the results from

411: hydrodynamic simulations.

412: \item At some later time, we construct a mock catalogue (uniformly sampling the

413: mock population in stellar mass) and compare the predictions with

414: observed galaxy properties.

415: \end{itemize}

416:

417: The exact implementation

418: of the halo occupation model and merger

419: rate calculations are outlined in the Appendix, for readers

420: interested in reproducing the model calculations. Here we outline the

421: relevant physics and model approach, and highlight the calibrations from

422: simulations used to predict the properties of merger remnants, as

423: well as the important sources of

424: uncertainty for those predictions.

425:

426:

427:

428: \subsection{Cosmological Model}

429: \label{sec:model:cosmology}

430:

431: We wish to track merger histories in a cosmologically motivated

432: manner, in order to follow the growth of spheroids.  As discussed in

433: \S~\ref{sec:intro}, it remains prohibitive to simulate the relevant

434: hydrodynamic processes (with the necessary $\lesssim100\,$pc

435: resolution) in full cosmological simulations (where, in order to probe

436: the massive galaxies of interest here, we would require $\sim{\rm

437: Gpc^{3}}$ box sizes).  In principle, we could begin with a full

438: semi-analytic model: attempt to describe, in an {\em a priori} manner,

439: the entire process of cooling, disk growth, star formation, and

440: feedback.  This has been done by

441: \citet{khochfar:size.evolution.model}, for example, as well as

442: \citet{almeida:durham.ell.scalings}, and we compare with their results

443: below. We have also experimented with this methodology applied to the

444: approach of \citet{somerville:new.sam}, and find qualitatively similar

445: results.

446:

447: However, a full semi-analytic model necessitates an attempt to predict

448: e.g.\ the star formation histories, gas fractions, disk galaxy

449: populations, and clustering of galaxies in an {\em a priori} manner.

450: Although these are certainly worthy of investigation, they are not the

451: quantities we are interested in here -- therefore attempting to model

452: e.g.\ disk formation, star formation, and stellar winds in this manner

453: will introduce additional uncertainties and dependences.

454:

455: For example,

456: \citet{khochfar:size.evolution.model} find that their most massive

457: galaxies are predicted to evolve very rapidly. However, their analysis

458: results in more rapid gas fraction evolution than observed, a problem

459: common to semi-analytic models without strong feedback \citep[see

460: e.g.][]{somerville:sam}.  In fact, over the redshift range $z\sim0-2$,

461: progenitor disk galaxy properties (at least the broad properties of

462: greatest interest to us here, namely their sizes (or the Tully-Fisher

463: relation) and gas fractions) are reasonably well constrained

464: empirically, but theory has had mixed success at best in reproducing

465: them from a fully {\em a priori} framework. If current {\em a priori}

466: models are incorrectly describing the cooling of gas and disk

467: formation (in particular the formation of massive galaxies), then the

468: predicted properties of ellipticals formed in mergers of those

469: progenitors are suspect.

470:

471: We therefore adopt a halo occupation approach -- essentially beginning

472: with the observational constraints on e.g.\ disk gas fractions and the

473: Tully-Fisher relation, and predicting the properties of spheroids from

474: these empirically constrained progenitors. For our purposes, we are

475: not concerned with why a given disk population is e.g.\ gas-rich or

476: gas-poor, or has a particular effective radius -- we need those

477: numbers to compute the properties of the remnant of a major merger of

478: such disks.  This allows us to be as conservative as possible, and to

479: identify the robust prediction for spheroid scaling laws without

480: reference to uncertainties in e.g.\ disk formation models.  We can

481: (and do) include the uncertainties in the observational constraints in

482: our modeling, but we find that these are unimportant.

483:

484: The details of our scheme are described in \citet{hopkins:groups.qso}

485: and \citet{hopkins:groups.ell}, but we briefly review them here. For

486: a particular cosmology, we identify the halo and subhalo populations,

487: and associate each main halo and subhalo with a galaxy in a Monte Carlo

488: fashion. Specifically: at a given redshift, we calculate the halo

489: mass function $n(\mhalo)$ for our adopted cosmology following

490: \citet{shethtormen}. For each halo, we calculate the (weakly mass and

491: redshift dependent) subhalo mass function (or distribution of

492: subhalos, $P[N_{\rm subhalo}\, | \, M_{\rm subhalo},\ \mhalo]$)

493: following \citet{zentner:substructure.sam.hod} and

494: \citet{kravtsov:subhalo.mfs}. Alternatively, we have adopted it

495: directly from \citet{gao:subhalo.mf,nurmi:subhalo.mf} or calculated it

496: following \citet{vandenbosch:subhalo.mf,valeostriker:monotonic.hod},

497: and obtain similar results: the uncertainties at this stage are

498: negligible. Note that the subhalo masses are defined as the masses

499: upon accretion by the parent halo, which makes them a good proxy for

500: the hosted galaxy mass \citep{conroy:monotonic.hod} and removes the

501: uncertainties owing to tidal stripping.

502:

503: We then populate the

504: central galaxies in each halo and subhalo

505: according to an empirical halo occupation model.

506: These empirical models determine both the mean stellar mass and dispersion in stellar masses of

507: galaxies hosted by a given halo/subhalo mass $P(\mgal\,|\,M_{\rm subhalo})$,

508: from fitting the observed galaxy correlation functions as a function of e.g.\ scale and

509: stellar mass or luminosity at a given redshift.

510: Other properties, such as e.g.\ the sizes and gas fractions of galaxies, are determined

511: as a function of their stellar mass (and, according to our adopted model, merger

512: history) as described in \S~\ref{sec:model:hod}-\ref{sec:model:sims:wet} below.

513:

514: Although such models are constrained, by definition, to reproduce the mean

515: properties of the halos occupied by galaxies of a given mass/luminosity, there

516: are known degeneracies between parameterizations that give rise to

517: some (usually small) differences between models.

518: We therefore

519: repeat all our calculations for our ``default'' model

520: \citep[for which we follow the empirical constraints in][]{conroy:monotonic.hod}

521: \citep[see also][]{valeostriker:monotonic.hod} and

522: an alternate halo occupation model

523: \citep{yang:clf} \citep[see also][]{yan:clf.evolution,zheng:hod}, which

524: bracket the range of a number of calculations

525: \citep[e.g.,][]{cooray:highz,cooray:hod.clf,zheng:hod,vandenbosch:concordance.hod,

526: brown:hod.evol}

527: and direct observations of groups \citep{wang:sdss.hod}.

528:

529: We have also compared a variety of prescriptions for the

530: redshift evolution of various components in the halo occupation model:

531: we have adopted that directly fitted by the authors above at various redshifts,

532: we have considered a complete re-derivation

533: of the HOD models of \citet{conroy:monotonic.hod} and

534: \citet{valeostriker:monotonic.hod}

535: at different redshifts from each of the the observed mass functions of

536: \citet{fontana:highz.mfs,bundy:mfs,borch:mfs,blanton:lfs} \citep[see][]{hopkins:groups.qso},

537: and have also found similar results assuming

538: no evolution in $P(\mgal\,|\,M_{\rm subhalo})$ (for star forming galaxies).

539: Indeed, a number of recent

540: studies suggest that there is very little evolution in halo occupation

541: parameters (in terms of mass, or relative to $L_{\ast}$) with

542: redshift \citep{yan:clf.evolution,cooray:highz,

543: conroy:monotonic.hod,brown:hod.evol}, or equivalently that the masses of galaxies hosted in a

544: halo of a given mass are primarily a function of that halo mass, not

545: of redshift \citep{heymans:mhalo-mgal.evol,

546: conroy:mhalo-mgal.evol}. This appears to be especially true for

547: star-forming galaxies \citep{conroy:mhalo-mgal.evol}, unsurprising

548: given that quenching is not strongly operating in those systems to change

549: their mass-to-light ratios, but it also appears to be true for

550: red galaxies at least at moderate redshifts \citep{brown:hod.evol}.

551:

552: Consequently,

553: the differences between any of these choices are

554: small (at least at $z\lesssim3$), and negligible compared to

555: the uncertainties in our predictions from e.g.\ the uncertainties in disk

556: sizes and gas fractions as a function of their stellar mass and redshift.

557: Systems which have not undergone a major merger in the model are assumed

558: to be disk-dominated, and we can populate them either according to a

559: halo occupation model that treats all galaxies in an identical fashion, or as

560: blue/star forming galaxies in a model where the constraints are derived

561: for blue and red galaxies separately (we find it makes little or no difference).

562:

563: Finally, having populated a given halo and its subhalos

564: with galaxies, we follow the evolution of those halos forward and identify

565: major mergers\footnote{In a major merger, tidal forces are

566: sufficiently strong to drive nuclear inflows of gas and build

567: realistic spheroids.  The precise meaning of major merger in this

568: context is blurred by a degeneracy between the progenitor mass ratio

569: and the orbit

570: \citep{hernquist.89,hernquist.mihos:minor.mergers,bournaud:minor.mergers},

571: but both numerical \citep{younger:minor.mergers}

572: and observational

573: \citep{dasyra:mass.ratio.conditions,woods:tidal.triggering} studies

574: indicate that massive inflows of gas and morphological transformation

575: are typical for mass ratios only below $\sim 3:1$.  This is ultimately

576: related to the non-linear scaling of the disk response to a merger

577: of a given mass ratio \citep{hopkins:disk.heating}.

578: Unless otherwise

579: noted, we generally take the term ``mergers'' to refer to major

580: mergers.}. The details of the treatment of subhalo-subhalo and subhalo-halo

581: mergers are described in \citet{hopkins:groups.qso} and \citet{hopkins:groups.ell},

582: but in short the halo merger timescale is straightforward and well defined. When

583: two subhalos are fully merged, we can either assume the galaxies have merged,

584: or allow for some additional merger timescale within the merged halo.

585: We have experimented with a variety of models for this, including e.g.\ the

586: dynamical friction timescale and alternative timescales calibrated from

587: simulations or calculated based on

588: group capture or collisional cross section estimates and angular

589: momentum (orbital cross section) capture estimates. But because these

590: merger times within a halo are always much less than the Hubble time

591: at the relevant redshifts, and we are interested only

592: in statistical properties across populations evolving over a Hubble time,

593: it makes no difference to our calculations what choice we adopt.

594:

595: When galaxies merge, we estimate the properties of the remnant

596: based on those of the progenitor galaxies, according to our

597: numerical simulations, as described in \S~\ref{sec:model:sims}.

598: This approach yields reasonable agreement with global quantities such as the

599: spheroid mass function, the mass density of ellipticals as a function of

600: mass, and the fraction of early-type galaxies as a function of mass

601: \citep{hopkins:groups.ell}.

602: Again, despite the assumptions involved thus far, we

603: stress that the detailed choice of halo occupation model

604: yields negligible differences in progenitor galaxy properties and in our predictions

605: at all the masses and redshifts of interest, where the clustering and abundances

606: of massive galaxies are reasonably well-constrained.

607:

608:

609: \subsection{Progenitors: Laying Galaxies Down}

610: \label{sec:model:hod}

611:

612: Given a host halo mass and disk stellar mass, both determined

613: from the halo occupation model, the two important parameters we must

614: assign to each disk are a size (effective radius $R_{e}$) and gas

615: fraction ($f_{\rm gas}\equiv M_{\rm gas}/(M_{\ast}+M_{\rm

616: gas})$).\footnote{Note that together the disk size, baryonic and halo

617: mass define another parameter of interest, the disk circular velocity

618: or maximum velocity; but this does not explicitly enter into the way

619: in which we calculate the properties of the remnant.}  This is where

620: the dominant uncertainties in our modeling arise -- we will show that

621: e.g.\ the gas fraction in pre-merger disks is an important parameter

622: determining the characteristics of the remnant, so we can only predict

623: those properties to within the uncertainties in disk gas fractions at

624: the appropriate redshifts.

625:

626: To minimize the uncertainties in modeling this,

627: we therefore begin with an empirical estimate of the gas fraction distribution in

628: spiral galaxies as a function of mass and redshift. At $z=0$, there is

629: no significant

630: uncertainty; various measurements

631: \citep[e.g.][]{kennicutt98,belldejong:disk.sfh,kannappan:gfs,mcgaugh.tf.old,

632: mcgaugh:tf} and indirect constraints

633: from star formation and the baryonic Tully-Fisher relation \citep{bell:baryonic.mf,

634: noeske:sfh,calura:sdss.gas.fracs} give

635: consistent estimates of the median gas fraction in spirals as a function of their

636: stellar mass, and the scatter at each mass. We obtain identical results if we

637: use any of these constraints (or the data points themselves), but

638: note for convenience that all of these observations can be well-fitted by the trend

639: \begin{equation}

640: \langle\fgas(z=0)\rangle \approx \frac{1}{1+(M_{\ast}/10^{9.15} M_{\sun})^{0.4}}

641: \label{eqn:fgas.z0}

642: \end{equation}

643: with a constant $\sim0.2-0.25$\,dex scatter at each mass.

644:

645: Similar, albeit less robust, estimates exist at

646: $z=1$ and $z=2$ \citep[see e.g.][respectively]{shapley:z1.abundances,erb:lbg.gasmasses}.

647: We therefore adopt a simple functional form that interpolates between these

648: measurements. Motivated by the power-law form of the observed Kennicutt-Schmidt

649: star formation relation ($\dot{M}_{\ast}\propto \Sigma_{\rm gas}^{1.4}$),

650: we adopt the following interpolation formula:

651: \begin{equation}

652: \langle\fgas(z)\rangle = \fgas(z=0)\,{\Bigl [}1+\tau_{\rm LB}(z)\,{(1-\fgas(z=0)^{\beta})} {\Bigr]}^{-2/3}

653: \label{eqn:fgas.z}

654: \end{equation}

655: where $\beta \approx 3/2$ and $\tau_{\rm LB}(z)$ is the fractional

656: lookback time to redshift $z$.

657: This can be derived, for example, by assuming a simple model where the star formation

658: rate scales according to the observed relation, the disk scale length varies as a

659: power of the baryonic mass, and the net mass accretion rate (inflows minus

660: outflows) is also a power-law

661: function of the mass of the system; given the requirement that the system begin

662: at an initial time $t=0$ with $f_{\rm gas}=1$ and have the observed $f_{\rm gas}$ at

663: $z=0$.

664:

665: In any case, the values above provide a good fit to the observations at $z=0-2$

666: \citep{belldejong:disk.sfh,shapley:z1.abundances,erb:lbg.gasmasses}

667: and a reasonable approximation to results of cosmological simulations

668: \citep{keres:hot.halos,keres:prep} and semi-analytic models \citep{somerville:new.sam},

669: and interpolate smoothly between $z=0$ and

670: high redshift. Lowering $\beta$ systematically weakens the

671: implied redshift evolution in $f_{\rm gas}$ -- in order to represent the uncertainty in the

672: observations, we consider a range $\beta\sim0.5-2.0$ (bracketing the observational

673: errors). Similar prescriptions for $f_{\rm gas}$ evolution can be obtained from

674: various toy model star formation histories including e.g.\ commonly adopted

675: exponential $\tau$-models \citep[see e.g.][]{belldejong:disk.sfh,noeske:sfh}

676: and integration of specific tau models enforcing proportionality between inflow,

677: star formation, and outflow rates \citep{erb:outflow.inflow.masses}; these

678: are all discussed in more detail in \citet{hopkins:bhfp.theory,hopkins:groups.qso},

679: but they generally yield similar results and lie within the uncertainties we consider

680: (differences being much less than the typical observed scatter in $f_{\rm gas}$

681: at each mass and redshift).

682:

683:

684: The other important property

685: we must estimate for progenitor disks is their effective radius.

686: At $z=0$, this is well-measured for all disk masses of interest. We

687: employ the fits from \citet{shen:size.mass} to late-type galaxy sizes

688: as a function of stellar mass (from $\lesssim10^{9}\,\msun$ to

689: $\gtrsim10^{12}\,\msun$), and the scatter in those sizes.

690: Adopting different local estimates or even completely ignoring this

691: scatter makes little difference. Observations are somewhat more

692: ambiguous regarding the redshift evolution of sizes, but fortunately

693: the observational constraints \citep[e.g.][]{trujillo:size.evolution,barden:disk.size.evol,

694: ravindranath:disk.size.evol,rix:gems.gal.evol,ferguson:disk.size.evol,

695: akiyama:lbg.weak.size.evol} and state-of-the-art models

696: \citep{somerville:disk.size.evol} suggest that any evolution is relatively weak.

697: This is also reflected in e.g.\ the baryonic Tully-Fisher

698: relation, which appears to evolve negligibly from

699: $z=0-2$ \citep{conselice:tf.evolution,flores:tf.evolution,

700: kassin:tf.evolution,vandokkum:tf.evolution}.

701:

702: We therefore consider two extremes. In the first case,

703: disk sizes and the baryonic Tully-Fisher relation are completely

704: fixed with redshift, according to their $z=0$ values. In the second,

705: disk sizes evolve according to observational estimates

706: from the various measurements above. A simple power law

707: \begin{equation}

708: R_{e,\rm disk}[\mstar\, | z] = (1+z)^{-\beta_{d}}\, R_{e,\rm disk}[\mstar\, | z=0]

709: \label{eqn:rdisk.evol}

710: \end{equation}

711: where $\beta_{d}\approx0.4$ provides a good fit to the observations and

712: is consistent with all of the measurements, spanning the

713: redshift range $z=0-3$. We adopt this estimate, but obtain a similar

714: result if we use the more detailed cosmologically motivated model

715: in \citet{somerville:disk.size.evol}, which allows disks to form conserving

716: specific angular momentum from halo gas \citep[see][]{momauwhite:disks},

717: yielding (approximately) $R_{d}\propto R_{\rm vir}/c$, where

718: $c\propto1/(1+z)$ is the halo concentration -- although $R_{\rm vir}$ is smaller

719: for halos of a given mass at high redshift, they are less concentrated, yielding

720: a weak size evolution similar to that observed

721: \citep{bullock:concentrations,

722: wechsler:concentration,neto:concentrations,comerford:obs.concentrations,

723: buote:obs.concentrations}.

724:

725: Together this gives us an estimate of our input disk parameters, and we allow for

726: the described range in both the estimated gas fraction and disk size evolution.

727: At low redshift, this introduces little uncertainty (most spheroids at $z=0$

728: have assembled relatively recently, so their structural properties reflect those of

729: low-redshift disk progenitors, where the uncertainties are minimal and the relevant

730: sizes, etc.\ can be directly measured from observations). However, these uncertainties

731: begin to grow rapidly at $z\gtrsim 2-3$, where

732: there are no more direct observational constraints.

733: Moreover, at these redshifts, it is not clear that ``un-merged'' galaxies will

734: necessarily be disks analogous to those well-understood at $z<2$

735: \citep[see e.g.\ the clumpy morphologies and increased dispersions

736: observed in][]{reddy:z2.lbg.spitzer,flores:tf.evolution,

737: bournaud:chain.gal.model}, although higher-resolution observations suggest that

738: at least a significant fraction of this population does exhibit

739: regular rotation and smoother morphologies (albeit most likely still

740: puffier and more dispersion-supported than low-redshift disks) in the rest-frame

741: optical \citep{puech:highz.vsigma.disks,akiyama:lbg.weak.size.evol,

742: genzel:highz.rapid.secular,shapiro:highz.kinematics}.

743:

744: In either case, the uncertainties as this limit is approached

745: can be considered part of the uncertainty in e.g.\ progenitor sizes and

746: structural properties, as observational estimates of this quantity often

747: do not make strict morphological cuts at these redshifts but rather

748: assign gas-rich star-forming systems to a single category

749: (and this is how such uncertainties would enter into our

750: model). In this aspect, the uncertainties from assigning them implicit ``disk'' properties

751: may actually be less than one might expect, because at the point where

752: a galaxy is very gas-rich (increasingly

753: common at these high redshifts), it does not matter what initial configuration this

754: gas is in (whether e.g.\ clumpy, filamentary, or infalling in direct collapse), as

755: it is dissipational and in any case will (in mergers) lose angular momentum

756: and form a dense central starburst.

757: In any case, these uncertainties lead us to limit our predictions to $z<4$.

758: Fortunately, because at any lower redshift most spheroids have assembled

759: relatively recently, with decreasing mass fractions contributed by

760: galaxies assembled at very early times, the uncertainties associated with

761: the lack of constraints at these high redshifts rapidly become

762: less important to our predictions.

763:

764:

765: \subsection{The Simulations}

766: \label{sec:model:sims}

767:

768:

769: In order to determine the effects that a given merger will have on galaxy properties, we

770: require some estimates of how e.g.\ the gas content of initial galaxies (and other properties)

771: translates into properties of the remnant.

772: We derive these prescriptions from a large library of hydrodynamic

773: simulations of galaxy encounters and mergers, described in detail

774: in \citet{robertson:fp,cox:kinematics,younger:minor.mergers} and \paperone.

775: These amount to several hundred unique simulations, spanning a wide

776: range in progenitor galaxy masses, gas fractions, orbital parameters,

777: progenitor structural properties (sizes, concentrations, bulge-to-disk ratios),

778: and redshift.

779:

780: Most of the simulations are major (mass ratios

781: $\sim1:3$-$1:1$), binary encounters between

782: stellar disks (which provide an idealized scenario useful for calibrating

783: how details of the remnant depend on specific progenitor properties),

784: but they include a series of minor mergers (from mass ratios $\sim1:20$ to

785: $\sim1:3$), as well as spheroid-spheroid ``re-mergers'' or ``dry mergers'' (i.e.\ mergers of the

786: elliptical remnants of previous merger simulations),

787: mixed-morphology (spiral-elliptical) mergers \citep[see also][for a

788: detailed study of these mergers]{burkert:mixed.morph.boxiness,

789: johansson:mixed.morph.mbh.sims}, multiple

790: mergers, and rapid series of hierarchical mergers.

791: Our adopted prescriptions are robust to these choices.

792: The simulations usually include accretion and feedback from supermassive

793: black holes, as well as feedback from supernovae and stellar winds.

794: However, we have performed parameter studies in these

795: feedback prescriptions, and find that the

796: structural properties of interest here are relatively insensitive to

797: these effects \citep{cox:winds,hopkins:bhfp.theory,hopkins:cusps.mergers}.

798:

799: The predicted simulation scalings are discussed in detail

800: in \paperone-\paperthree, where we

801: take advantage of high-resolution observations of local merger remnants

802: \citep[from][]{rj:profiles} as well as both nuclear cusp and core ellipticals

803: \citep[from][]{lauer:bimodal.profiles,jk:profiles} to test them observationally.

804: We find good agreement between the predicted and observed scalings

805: with e.g.\ stellar mass and gas content (of the progenitor galaxies)

806: at the time of the merger. If, as is commonly believed, core ellipticals

807: are the product of spheroid-spheroid re-mergers, then

808: our comparisons in \paperthree\ also confirm the scalings

809: derived from simulated re-mergers.

810:

811:

812: \subsubsection{Mergers without Gas}

813: \label{sec:model:sims:dry}

814:

815:

816: In mergers, the stars and dark matter form a collisionless system, and hence

817: gas free (dissipationless or dry) mergers

818: are particularly easy to handle. Furthermore, direct comparison

819: with our simulations suggests that, to the desired accuracy,

820: it is a good approximation to treat the dissipational and dissipationless

821: components of a merger separately, allowing us to apply simple rules to the

822: {\em stellar} remnant

823: in mergers of both gas-rich disks and gas-poor spheroids.

824:

825: It is well established in numerical simulations that, to lowest order, the

826: effect of any merger of dissipationless components

827: is to ``puff them up'' by a uniform factor,

828: while roughly conserving profile shape \citep[at least for

829: certain ``equilibrium'' profile shapes such as

830: the \citet{nfw:profile}, \citet{hernquist:profile}, or \citet{devaucouleurs} profiles;

831: see e.g.][]{barnes:disk.disk.mergers,boylankolchin:mergers.fp,boylankolchin:dry.mergers,

832: hopkins:cores}.

833: The conservation of both energy and phase space

834: density means that, modulo a small normalization offset owing to possibly

835: different profile shapes

836: the remnant of e.g.\ two initial stellar disk or spheroid mergers will have a similar final

837: effective radius.

838:

839: In detail, if we temporarily ignore the

840: halos of the galaxies (a good approximation in most simulations, since

841: the specific binding energy of the galaxy baryonic mass is much larger than

842: that of the halo) and consider isotropic systems, we obtain for the parabolic merger

843: of systems of mass $M_{1}$ and $M_{2}=f\,M_{1}$ ($f\le1$) the energy conservation equation

844: \begin{equation}

845: E_{f} = k_{f}\,(M_{1}+M_{2})\,\sigma_{f}^{2} = E_{i} = k_{1}\,M_{1}\,\sigma_{1}^{2} +

846: k_{2}\,M_{2}\,\sigma_{2}^{2}

847: \label{eqn:egy}

848: \end{equation}

849: where $k$ is a constant that depends weakly on profile shape \citep[for greater

850: detail, see][]{ciotti:dry.vs.wet.mergers}. If profile

851: shape is roughly preserved and $R\propto M/\sigma^{2}$,

852: then we expect

853: \begin{equation}

854: R_{f} = R_{1}\,\frac{(1+f)^{2}}{(1+f^{2}\,\frac{R_{1}}{R_{2}})}.

855: \label{eqn:re.dry}

856: \end{equation}

857: A merger of two perfectly identical spheroids

858: will double $R_{e}$ and conserve $\sigma$.

859: It is straightforward to solve the appropriate energy conservation equation

860: numerically, allowing for arbitrary profile shapes and

861: for halo components (assuming the halo profile follows

862: \citet{nfw:profile} or \citet{hernquist:profile} profiles), but in practice we find that

863: these subtleties make no almost no difference compared to

864: using Equation~(\ref{eqn:re.dry}).

865:

866: In a merger of two stellar disks, violent relaxation will transform the

867: stellar distribution from initially exponential disks into

868: \citet{devaucouleurs}-like profiles, more generally \citet{sersic:profile} profiles (of the form

869: $I_{e}\propto \exp{[-(r/r_{0})^{1/n_{s}}]}$ with $n_{s}\sim2.5-3.5$

870: after a single major disk-disk merger

871: \citep[see \paperone-\papertwo\ and][]{naab:profiles},

872: while the halo profile shape is approximately

873: preserved \citep[\paperfour,][]{boylankolchin:mergers.fp}. These trends in simulations can both

874: be roughly explained by

875: allowing the merger to scatter stars and dark matter

876: (in addition to uniformly puffing up the

877: profile) in their final three-dimensional radii by some lognormal broadening

878: factor $\sigma_{r}\sim0.3-0.4\,$dex (in a $1:1$ merger). In other words,

879: while the median final (post-merger) effective radius of stars at some initial radius $r_{i}$

880: is given by the uniform puffing up or stretching of

881: the profile in Equation~(\ref{eqn:re.dry}),

882: there is a lognormal scatter with dispersion $\sigma_{r}$ in the final radii of stars

883: from that initial radius.

884: Over the relevant dynamic range for

885: most practical observational purposes (from $\sim 0.01\,R_{e}$ to $\gtrsim10\,R_{e}$,

886: or as much as 15 magnitudes in surface brightness),

887: this will effectively transform an initial exponential profile into an

888: $n_{s}\approx3$ ($\sigma_{r}=0.3$) or $n_{s}=4$ ($\sigma_{r}=0.4$)

889: profile.

890:

891: Subsequent spheroid-spheroid or spheroid-disk mergers continue to scatter stars out

892: to larger radii, building an extended envelope of material,

893: and raising the best-fit Sersic index of the dissipationless component of the remnant.

894: We study this in \paperthree, and find to a good approximation that a re-merger of

895: mass ratio $f\leq1$ will raise the $n_{s}$ of the dissipationless component

896: by $\Delta n_{s}\sim f$, or equivalently ``broaden'' the three-dimensional stellar

897: distribution by a lognormal factor $\sim 0.3\,f$ (the prescription we adopt). At

898: $z=0$ we can then determine an ``observed'' Sersic index by projecting this mock

899: profile (with that of the dissipational component) and fitting to the system.

900: If we consider the Sersic indices directly fitted to our simulations --

901: i.e.\ draw a Sersic index for a disk-disk merger remnant randomly from

902: that fitted to our simulated mergers of the same mass ratio, and so on -- we

903: obtain similar predictions.

904:

905: Finally, having the profile of the dissipationless stellar and halo components, we

906: can calculate the velocity dispersion $\sigma$ of stars within some radius

907: (here we take the mass-weighted velocity dispersion of stars within

908: $R_{e}$, and assume isotropic orbits, although this is not a dominant

909: uncertainty in our predictions).

910: We can instead (without any reference to the isotropy assumption)

911: assume that the dynamical mass estimator, $\sigma^{2}\,R_{e}$,

912: is a good tracer of the true enclosed total mass within the stellar effective

913: radius $R_{e}$ (modulo a normalization constant that is the same for

914: most ellipticals) -- a fact seen in our simulations (\paperfour) and

915: observations \citep{cappellari:fp,bolton:fp,bolton:fp.update} --

916: and use the known total mass ($M(<R_{e})$) to predict $\sigma$.

917: The results are similar.

918:

919: We note that halos are tracked with stars in these processes,

920: but between mergers, the halos still grow. We assume at all times

921: that the dark matter halo follows a \citet{hernquist:profile} profile (essentially

922: identical to an \citet{nfw:profile} profile at the radii that matter for the

923: baryonic galaxy, but with finite mass and analytically tractable properties, a

924: more useful approximation since we are considering halos as if in isolation), with a

925: concentration from the mean concentration-halo mass-redshift

926: relation in simulations and observations (see \S~\ref{sec:model:hod}

927: and \citet{springel:models}).

928: We then calculate the halo contribution to quantities such as e.g.\ the

929: dynamical mass within $R_{e}$ and $\sigma$ based on this profile.

930: However, we could also assume the dark matter within the stellar effective

931: radius ``freezes out'' with each major merger (new dark matter within $R_{e}$

932: only being what is brought in by subsequent mergers, within the $R_{e}$ of

933: those galaxies at the time of their first merger) -- it makes relatively

934: little difference, since the mass densities in the cores of halos do not

935: strongly evolve with redshift \citep{bullock:concentrations}.

936:

937:

938: \subsubsection{Mergers with Gas}

939: \label{sec:model:sims:wet}

940:

941:

942: In a gas-rich merger, the gas loses angular momentum and, being dissipative,

943: radiates its energy and collapses to the center of the galaxy. The collapse will,

944: in general, continue until the gas becomes self-gravitating, at which point it

945: rapidly turns into stars as it further contracts \citep[for details, see][]{hopkins:disk.survival}.

946: The remnant will

947: exhibit a ``dissipational component'' -- the dense central stellar concentration

948: which is the relic of these dissipational merger-induced starbursts.

949:

950: \citet{covington:diss.size.expectation} consider the dissipation of energy in

951: mergers and use it to estimate the effect this will have on the size of the merger

952: remnant \citep[see also][]{ciotti:dry.vs.wet.mergers},

953: which we test in \papertwo\ and \paperthree\ and show is a good approximation to

954: our simulations and observed systems, and can be reduced

955: to the approximation

956: \begin{equation}

957: %R_{e}({\rm net}) = \frac{R_{e}({\rm dissipationless})}{1+(f_{\rm gas}/0.1)^{0.8}},

958: %R_{e}({\rm net}) \approx \frac{R_{e}({\rm dissipationless})}{[1+(f_{\rm gas}/0.2)^{1.2}]^{0.8}},

959: R_{e} \approx \frac{R_{e}({\rm dissipationless})}{1+(f_{\rm gas}/f_{0})},

960: \label{eqn:re.wet}

961: \end{equation}

962: where $f_{0}\approx0.25-0.30$, $R_{e}({\rm dissipationless})$ is the effective

963: stellar radius of the final merger remnant if it were entirely dissipationless, and

964: $f_{\rm dissipational}$ is the baryonic

965: mass fraction from the starburst. For most major mergers,

966: gas in the disks at the time of the

967: merger will rapidly lose energy, and star formation is very efficient,

968: so $f_{\rm dissipational}$ reflects the gas fractions of the progenitor

969: disks immediately before the merger $f_{\rm dissipational}({\rm merger})=

970: f_{\rm gas} = M_{\rm gas}({\rm disks})/M_{\ast}({\rm remnant})$

971: (gas blown out by feedback from black holes and stellar winds, even with extreme

972: feedback prescriptions, makes little difference here: it predominantly effects $f_{\rm gas}$

973: of the pre-merger disks over much longer timescales, rather than the consumption

974: in mergers). We show in \citet{hopkins:disk.survival} that this is not strictly true in

975: extremely gas-rich mergers ($f_{\rm gas} > 0.5$), where torques are inefficient at

976: driving dissipation, and ellipticals may not be formed from even major mergers: these

977: extreme cases however are seen only at the lowest observed masses, and do not affect

978: most of our predictions (though we note where this may be important).

979:

980: Given the typical exponential-like ($n_{s}\sim1-2$) character of these dissipational components

981: when they first form (owing to their formation from infalling gas), we can invert this to

982: infer what the effective radius of the dissipational component

983: specifically must be (given that effective radius and profile, we

984: can then construct a mock profile of the entire galaxy).

985: Based on our study of re-merger simulations in \paperthree, we find that

986: the two components (those which were originally dissipational at their formation, and

987: those which were not -- i.e. which originally came from stellar disks) tend to

988: be separately conserved even after several dry or gas-poor spheroid-spheroid re-mergers.

989: In a series of mergers, then, it is a good approximation to treat the two separately:

990: dissipationless components grow and add as described above. Dissipational components

991: form with a mass given by the gas content of merging disks with sizes appropriate for the

992: above relation, and then evolve either by gas poor (spheroid-spheroid) merging

993: (where they will add with

994: other stellar dissipational components formed in earlier mergers, following the rules

995: for dissipationless merging above since both components are stellar), or

996: by more gas-rich (spheroid-disk) merging (in which case new dissipational mass

997: is added given the prescriptions above).

998:

999: Dissipation will also fuel growth of the

1000: central supermassive black hole. Based on our theoretical modeling of

1001: self-regulated black hole growth in simulations \citep{dimatteo:msigma,

1002: hopkins:faint.slope,hopkins:seyferts,hopkins:bhfp.theory}

1003: and that of others

1004: \citep[e.g.][]{silkrees:msigma,murray:momentum.winds, ciottiostriker:recycling}, essentially

1005: any model where the black hole regulates its growth by halting accretion (allowing

1006: some fraction of the energy or momentum of accretion to couple to the surrounding

1007: media) will give rise to a fundamental correlation

1008: between black hole mass and spheroid binding energy \citep{hopkins:bhfp.theory},

1009: more specifically with the binding energy of the gas in the central regions whose

1010: inflow must be suppressed to regulate black hole growth.

1011:

1012: There is direct evidence for this in the observations as well, with \citet{hopkins:bhfp.obs}

1013: finding a $\sim3\,\sigma$ preference in the data for a correlation of the

1014: form $M_{\rm BH}\sim \mstar^{0.5-0.7}\,\sigma^{1.5-2.0}$, similar

1015: to and consistent with the ``fundamental'' correlation being one with bulge

1016: binding energy, which is roughly traced by $E_{b}\sim\mstar\,\sigma^{2}$

1017: \citep[$M_{\rm BH}\propto E_{b}^{0.71}$, see also][]{aller:mbh.esph}.

1018: Indirectly, this can then

1019: explain e.g.\ the observed $M_{\rm BH}-\sigma$ \citep{FM00,Gebhardt00} and

1020: $M_{\rm BH}-\mstar$ \citep{magorrian} correlations.

1021:

1022: Motivated by our simulations and these observational results, we model black

1023: hole growth as follows: in each merger, we allow the black hole to grow by an

1024: amount proportional to the binding energy of

1025: the new dissipational components of the merger (i.e.\ the starburst gas), in addition

1026: to summing the masses of the two progenitor black holes \citep[we do not model

1027: any gravitational kicks that might expel black holes, but theoretical estimates suggest

1028: that by $z=0$ in the massive galaxies we are interested in here, such effects

1029: are small if they are present at all;][]{volonteri:xray.counts,volonteri:recoils}.

1030: We include an intrinsic scatter $\sim0.2\,$dex, again motivated by simulations

1031: (although there will be some scatter in the resulting correlations regardless,

1032: reflecting different gas content and potential depth at the time of merger, with

1033: subsequent evolution to $z=0$).

1034: The exact proportionality constant

1035: is a reflection of the physics of feedback efficiencies and

1036: coupling, but we adopt a constant calibrated in simulations to

1037: match the normalization of the observed $z=0$ $M_{\rm BH}-\sigma$ relation

1038:  \citep[][]{dimatteo:msigma,hopkins:lifetimes.methods,hopkins:bhfp.theory}.

1039: Any other results (e.g.\ the slope

1040: of these correlations, residual correlations or correlations with other

1041: structural parameters, and/or redshift evolution in the correlations)

1042: are genuine predictions of the model, not this calibration.

1043:

1044:

1045:

1046: \breaker

1047: \section{Predicted Scaling Laws at $z=0$}

1048: \label{sec:z0}

1049:

1050: \begin{figure}

1051:     \centering

1052:     %\scaleup

1053:     %\plotone{MC_fp_proj.ps}

1054:     \plotone{f1.ps}

1055:     \caption{Predicted spheroid scaling laws at $z=0$. {\em Top:} Effective

1056:     radius as a function of stellar mass. {\em Left:} Our simulated systems are shown

1057:     (equal numbers of systems per logarithmic interval in mass are simulated and

1058:     plotted, to represent the full dynamic range),

1059:     with color encoding the redshift of their last gas-rich merger

1060:     (from black for $z=0$ to red for $z\gtrsim3$); solid

1061:     lines show the median trends for systems with

1062:     this redshift: $z=0-1$ (black), $1-2$ (blue), $2-3$ (orange), and $>3$ (red).

1063:     {\em Right:} Solid (dashed) line shows the median ($\pm1\sigma$) trend

1064:     from the simulations.

1065:     Observed systems

1066:     from the samples of \citet[][red stars]{jk:profiles} and

1067:     \citet[][purple circles]{lauer:bimodal.profiles} are shown for comparison ({\em right}).

1068:     This notation is used throughout.

1069:     {\em Bottom:} Projected central velocity dispersion as a function of

1070:     stellar mass.

1071:     \label{fig:fp.proj}}

1072: \end{figure}

1073:

1074: \begin{figure}

1075:     \centering

1076:     %\scaleup

1077:     %\plotone{MC_fp.ps}

1078:     \plotone{f2.ps}

1079:     \caption{The fundamental plane. {\em Top:} Dynamical mass as a function of

1080:     stellar mass (style as in Fig. \ref{fig:fp.proj}). {\em Bottom:} Effective radius as a

1081:     function of $\sigma$ and $\mu$ (here converted to stellar surface density $\Sigma$).

1082:     Dotted red line shows the virial relation (no tilt in the fundamental plane).

1083:     \label{fig:fp}}

1084: \end{figure}

1085:

1086: Figures~\ref{fig:fp.proj} \&\ \ref{fig:fp} plot the fundamental plane correlations of

1087: ellipticals at $z=0$, produced by this simple model. Here

1088: (and in subsequent figures) we draw systems randomly

1089: from the halo occupation distribution with the restriction that we draw equal numbers

1090: of simulated systems per logarithmic interval in halo mass (in order to represent

1091: the full dynamic range of interest in the plots). This differs slightly from selecting

1092: e.g.\ a volume-limited sample, which would be dominated by low-mass systems

1093: (less useful for studying the distribution in a parameter at a given mass,

1094: which is our intention here).

1095:

1096: We compare

1097: with a large sample of observed ellipticals: specifically the combination of

1098: the \citet{jk:profiles} compilation of Virgo ellipticals

1099: and the bright elliptical sample of \citet{lauer:bimodal.profiles} (which

1100: extends this dynamic range at the cost of slightly higher uncertainties in the data),

1101: with nuclear {\em HST} observations and ground-based data at large radii

1102: (allowing accurate surface brightness profile measurements from

1103: $\sim 50$\,pc to $\sim50$\,kpc)\footnote{The profiles cover sufficient dynamic

1104: range that effective radii can be determined directly from the integrated surface

1105: brightness profiles; adopting those from fitted profiles changes the results

1106: by a negligible amount. Velocity dispersions are compiled by the authors,

1107: Sersic indices are fitted in \papertwo, and stellar masses are determined from

1108: the integrated photometry using the color-dependent mass-to-light ratios

1109: from \citet{bell:mfs}. We have compared the results given

1110: different observed profiles for the same objects

1111: and different stellar mass estimators, and find that (in a statistical sense) they

1112: are unchanged.}.

1113: The correlations traced by these samples throughout this paper are

1114: statistically indistinguishable from the best-fit correlations (over the range observed)

1115: determined from volume-limited samples of (primarily field) early-type galaxies in

1116: the SDSS, in e.g.\ \citet{shen:size.mass,bernardi:fp,bernardi:bcg.scalings,

1117: gallazzi06:ages,vonderlinden:bcg.scaling.relations}. There may be a slight (not highly

1118: significant) offset between the data-sets, in the sense that the

1119: objects in the Virgo sample may have slightly larger $R_{e}$ and smaller $\sigma$

1120: than the \citet{lauer:bimodal.profiles} sample

1121: at fixed mass; if real, this may relate to the former being a cluster sample,

1122: or to the likely inclusion of a non-negligible S0 population in the latter.

1123: But in any case, the possible offset between the two is smaller than the uncertainties

1124: in our modeling (in terms of normalizing e.g.\ absolute sizes of systems, requiring

1125: accurate priors on their progenitor disk and halo sizes), and within the

1126: predicted and observed scatter, and so reasonably brackets the range of

1127: different observations.  (For more details of the observational samples

1128: and their analysis, we refer to those papers and \papertwo.)

1129:

1130: The correlations are accurately reproduced at $z=0$. Specifically,

1131: the size-mass relation has a steep logarithmic slope $R_{e}\sim \mstar^{0.6}$, compared

1132: to $R_{e}\sim \mstar^{0.3}$ for the progenitor disks -- the sense of this is specifically

1133: that low-mass ellipticals are more compact, relative to their progenitor disks. The

1134: velocity dispersion-mass (Faber-Jackson) relation is also reproduced, although

1135: this correlation is less dramatically different from that obeyed by disks (the

1136: baryonic Tully-Fisher relation). The resulting fundamental plane, expressed

1137: as either $\mdyn(\mstar)$ or $R_{e}(\sigma,\ \mstar)$, is also reproduced; i.e.\ the

1138: predicted ratios of enclosed dark matter to stellar mass within the stellar

1139: $R_{e}$ agree with observations as a function of mass (note that with the adopted

1140: definition of $\mdyn\propto\sigma^{2}\,R_{e}$, $\mdyn$ can be less

1141: than $M_{\ast}$). Specifically,

1142: the tilt of the fundamental plane is reproduced -- rather than $\mdyn\propto\mstar$,

1143: the expectation if systems were perfectly homologous, we predict

1144: $\mdyn\propto\mstar^{1+\alpha}$ (where $\alpha\approx0.2$ is the FP tilt),

1145: namely that lower-mass systems (as a consequence of their being more compact)

1146: are more baryon-dominated within their stellar effective radii.

1147: (The nature of the FP scalings are discussed in much greater detail in

1148: \paperfour; however, we outline the dominant physical effects in

1149: \S~\ref{sec:z0:diss} \&\ \ref{sec:z0:dry} below.)

1150:

1151: We plot the simulated systems, with colors denoting the redshift of their

1152: last gas-rich merger -- i.e.\ approximately the time at which they should

1153: have stopped forming stars. It is clear that, at fixed stellar mass (or

1154: velocity dispersion, given that it more tightly correlates

1155: with stellar mass than e.g.\ effective radius), systems with larger radii (or larger $\mdyn/\mstar$)

1156: tend to be older -- they are often systems that merged at early times and

1157: have grown dissipationlessly since then.

1158: This appears to be borne out in recent observational estimates based on stellar population

1159: constraints in local ellipticals \citep{gallazzi06:ages,graves:prep}.

1160: (We discuss this further below.)

1161:

1162:

1163: \begin{figure}

1164:     \centering

1165:     %\scaleup

1166:     %\plotone{MC_ns.ps}

1167:     \plotone{f3.ps}

1168:     \caption{Correlation between stellar mass and estimated best-fit galaxy

1169:     Sersic index $n_{s}$ (style as in Figure~\ref{fig:fp.proj}).

1170:     {\em Top:} Sersic indices obtained fitting the entire galaxy profile

1171:     to a single Sersic law (over typical observed dynamic range).

1172:     {\em Bottom:} Sersic indices of just the dissipationless (outer, violently relaxed)

1173:     component (not including the dissipational central starburst). Observational

1174:     estimates are from the two-component modeling in \papertwo\ and \paperthree.

1175:     Owing to the

1176:     cosmological dependence of merger history on mass, there is an effective correlation

1177:     between stellar mass and $n_{s}$. This is most directly reflected in the

1178:     outer, dissipationless component. Fitting the systems to a single Sersic index

1179:     represents a more complex combination of merger history and the amount of

1180:     dissipation (both effects can mimic one another).

1181:     %{\bf use a different label for $n_{s}$(outer)? ???}

1182:     \label{fig:ns}}

1183: \end{figure}

1184:

1185: Figure~\ref{fig:ns} plots the

1186: Sersic index of the galaxy light profiles predicted at $z=0$. We stress that these are

1187: rough estimates at best -- individual galaxy profiles show a degree of diversity

1188: representative of many details in their formation and merger history that only

1189: high-resolution numerical simulations can capture. We therefore refer to

1190: \papertwo\ and \paperthree\ for a more detailed study and comparison

1191: of Sersic indices in simulated and observed systems; however, comparison of

1192: our estimates with numerical simulations suggests that the adopted prescriptions

1193: capture at least the median behavior in simulations.

1194:

1195: First, we

1196: consider the Sersic index fitted to the {\em entire} galaxy profile -- this is of course a

1197: useful observational parameter, but it reflects a mix of different {\em physical}

1198: components of the galaxy. In order to transform a galaxy from an exponential

1199: disk profile (a Sersic index $n_{s}=1$) to a typical massive elliptical with a high Sersic

1200: index $\gtrsim4$, both dissipation and violent relaxation are necessary. Violent

1201: relaxation will scatter stars at large radii, building up an extended envelope

1202: (characteristic of high-$n_{s}$ systems), but high-$n_{s}$ profiles rise steeply

1203: at small radii, so dissipation is needed to build up a dense central stellar concentration.

1204: The effects of repeated mergers on the profile at large radii are relatively straightforward,

1205: and (more or less) monotonically increase $n_{s}$, but the

1206: consequences of dissipation are more complex (related not just to the dissipational

1207: mass fraction but to the relative concentration of the dissipative component and

1208: its profile shape) and non-monotonic.

1209:

1210: We construct the best-fit Sersic index

1211: fitting a dynamic range comparable to the observations plotted (representative of

1212: local galaxies for which the highest-quality data is available),

1213: from $\sim50\,$pc to $\sim 30$\,kpc. Because the galaxy profiles are

1214: multi-component, the resulting $n_{s}$ reflects a complex combination of

1215: the observed dynamic range, the relative mass, size, and slope of the dissipational

1216: component, and the merger history.

1217: Together, these effects yield an apparent

1218: steep dependence of Sersic index on mass, in agreement

1219: with the observations, but with large scatter (whereas if the profiles were entirely

1220: dissipationless, the lack of dense stellar concentrations at small radii would restrict them

1221: to smaller Sersic indices and lead to a weaker dependence of $n_{s}$ on

1222: merger history and mass).

1223: Most of the strength of the dependence

1224: here is ultimately a coincidence reflecting particular combinations of parameters

1225: moving into or out of the dynamic range for which the fits are sensitive.

1226:

1227: If we consider

1228: the Sersic indices of just the dissipationless component (i.e.\ the sum of the

1229: stellar components of pre-merger disks that were violently relaxed in the

1230: initial gas-rich mergers that formed each spheroid progenitor),

1231: the dependence on mass is weaker, but it more directly reflects the merger

1232: history of the systems (and depends much less on e.g.\ the dynamic

1233: range of the fit).

1234: We compare this with multi-component decompositions of the same observed

1235: systems, based on a more detailed,

1236: physically motivated approach to studying galaxy profiles which

1237: we discuss in detail in \paperone-\paperthree, where we demonstrate that

1238: the ``dissipationless'' component of observed ellipticals can, in fact, be recovered.

1239: By definition in our simple model, the dependence we predict now is

1240: entirely driven by the mean dependence of merger history on mass -- systems

1241: which have experienced more mergers (and more violent mergers) will have had

1242: more stars scattered to large radii in those mergers, raising the $n_{s}$ of the

1243: outer component.

1244: At low masses,

1245: most systems formed in $\sim1$ major merger at relatively low redshifts

1246: $z<1-2$, so a typical $n_{s}\sim2-3$ is obtained. At high masses, systems often have their

1247: first merger at very high redshifts, involving a large degree of dissipation, forming a

1248: dense core, and then experience a number of dry mergers, building up a more

1249: extended dissipationless outer component and envelope and raising $n_{s}$.

1250:

1251: The apparent trends of Sersic index with mass are

1252: much stronger if dwarf spheroidals are included, as

1253: in many observational samples \citep[e.g.][]{graham:sersic,ferrarese:profiles}. These

1254: systems have most likely not experienced any significant mergers -- as a structural

1255: family, they are clearly related to disks, not ellipticals, and their Sersic indices

1256: (typical $n_{s}\sim1$) demonstrate this; that they dominate at low masses reflects e.g.\ the

1257: dominance of disks and ellipticals, and is again driven by the cosmological

1258: dependence of merger history on mass.

1259:

1260: \begin{figure}

1261:     \centering

1262:     %\scaleup

1263:     %\plotone{MC_bh_corr.ps}

1264:     \plotone{f4.ps}

1265:     \caption{Correlation between black hole mass and host spheroid properties

1266:     (effective radius, velocity dispersion, stellar mass, and the observable proxy for

1267:     bulge binding energy, $\mstar\,\sigma^{2}$, effectively the same as the ``black hole

1268:     fundamental plane'' observed). Style as in Figure~\ref{fig:fp.proj}.

1269:     Magenta stars show observed systems with direct black hole mass measurements,

1270:     compiled in \citet{hopkins:bhfp.obs} \citep[see][]{magorrian,merrittferrarese:msigma,

1271:     tremaine:msigma,marconihunt,haringrix}.

1272:     A simple prescription for black how growth based

1273:     on dissipational binding energy reproduces the $z=0$ correlations.

1274:     \label{fig:mbh}}

1275: \end{figure}

1276:

1277: Figure~\ref{fig:mbh} compares the predicted and observed correlations between

1278: nuclear black hole mass and host bulge/spheroid properties (effective radius,

1279: velocity dispersion, stellar mass, and binding energy). Recall, motivated

1280: by observational and theoretical expectations of a black hole ``fundamental plane''

1281: \citep{hopkins:bhfp.obs,hopkins:bhfp.theory} the fundamental ``assumed''

1282: correlation here is between black hole mass and binding energy of the

1283: dissipational component of the bulge, determined at the time of that dissipation

1284: (time of gas-rich merger). Despite the complex interplay between this and the

1285: total bulge mass and size, dry mergers, and evolution in halo bulge properties,

1286: the $z=0$ correlations

1287: \citep[including the residual correlations observed in][]{hopkins:bhfp.obs})

1288: are accurately reproduced. The curvature seen in the $M_{\rm BH}-R_{e}$

1289: and $M_{\rm BH}-\sigma$ relations reflects a combination of the curvature

1290: seen in the correlations between these properties and stellar mass, and

1291: the increasing importance of dry mergers at higher masses.

1292:

1293: \begin{figure}

1294:     \centering

1295:     %\scaleup

1296:     %\plotone{MC_mbh_ns.ps}

1297:     \plotone{f5.ps}

1298:     \caption{Correlation between black hole mass and estimated best-fit galaxy

1299:     Sersic index $n_{s}$ (style as in Figure~\ref{fig:fp.proj}).

1300:     Magenta stars show observed systems as in Figure~\ref{fig:mbh}.

1301:     Owing to the correlation between $\mbh$ and $\mstar$,

1302:     there is a correlation between $\mbh$ and $n_{s}$, but the

1303:     scatter is significant at large $M_{\rm BH}$ (where there is an

1304:     extended tail towards very high $n_{s}$, sensitive to the dynamic range fitted).

1305:     \label{fig:mbh.ns}}

1306: \end{figure}

1307:

1308: We also consider the correlation between black hole mass and Sersic index in

1309: Figure~\ref{fig:mbh.ns}, comparing to the data compiled in

1310: \citet{hopkins:bhfp.obs} \citep[an update of the fits in][]{graham:concentration,

1311: graham:sersic}. Owing to the

1312: dependence of $n_{s}$ on stellar mass, there is an indirect correlation

1313: between $M_{\rm BH}$ and $n_{s}$, consistent with that claimed

1314: in \citet{graham:sersic}. Those authors note that the correlation is not well-fitted

1315: by a single power-law; here we see such curvature. The reason is that

1316: most objects at $\mstar\lesssim10^{10}\,\msun$ have similar

1317: merger histories, there is little dependence of $n_{s}$ on $\mstar$

1318: or $\mbh$ in this mass range, then a steep dependence, then

1319: again a relatively flat dependence at high $M_{\rm BH}$.

1320: The same caveats regarding these Sersic indices apply as in

1321: Figure~\ref{fig:ns}. In this model, the correlation between Sersic index

1322: and black hole mass may appear reasonably tight over a narrow

1323: range, but this is accidental, and there is no

1324: additional information on the black hole mass

1325: derived from the single Sersic index fits.

1326:

1327:

1328: \subsection{The Effects of Dissipation}

1329: \label{sec:z0:diss}

1330:

1331: \subsubsection{What Drives the Amount of Dissipation?}

1332: \label{sec:z0:diss:driver}

1333:

1334: \begin{figure}

1335:     \centering

1336:     %\scaleup

1337:     %\plotone{MC_fdiss.ps}

1338:     \plotone{f6.ps}

1339:     \caption{Mass fraction formed dissipationally (integrated mass fraction of the

1340:     $z=0$ elliptical formed from gas in progenitor disks at the time of

1341:     gas-rich mergers, in dissipational starbursts) as a function of stellar mass

1342:     (style as in Figure~\ref{fig:fp.proj}).

1343:     We compare ({\em top right}) with empirical estimates of

1344:     this quantity in observed ellipticals from the study of a large sample of observed

1345:     elliptical surface brightness

1346:     profiles and dissipation in \papertwo\ and \paperthree.

1347:     We also compare ({\em bottom}) with observed disk galaxy gas fractions as a function of

1348:     stellar mass, at $z=0$ \citep[][blue diamonds,

1349:     squares respectively]{kannappan:gfs,mcgaugh:tf},

1350:     $z=1$ \citep[][green squares]{shapley:z1.abundances},

1351:     and $z=2$ \citep[][red circles]{erb:lbg.gasmasses}.

1352:     The dissipational fractions at $z=0$ agree well with those observed, and

1353:     reflect the gas fractions of their progenitor disks over the redshift range $z\sim0-2$

1354:     where most systems had their last merger(s).

1355:     \label{fig:diss}}

1356: \end{figure}

1357:

1358: Dissipation is one of the most important factors

1359: controlling the properties of remnant ellipticals.

1360: We demonstrate this robustly with detailed observations and simulations in

1361: \papertwo\ and \paperthree. Here, we discuss how its effects are manifest in a

1362: cosmologically representative population, including systems which have experienced

1363: a number of dry mergers.

1364: Figure~\ref{fig:diss} shows the integrated fraction of our $z=0$ ellipticals which formed

1365: in dissipative starbursts (from gas in disks at the time of each merger), as opposed to

1366: dissipationlessly scattered stellar disks. Recall, dissipation allows gas to collapse to

1367: small scales, building up a dense central component in the stellar mass distribution and

1368: changing the phase-space distribution of the remnant.

1369: We compare with the observationally estimated

1370: dissipational fractions in local ellipticals, determined in \papertwo\ and

1371: \paperthree. The agreement is good at each mass - in other words, the

1372: amount of dissipation observed in ellipticals can be explained by our

1373: simple model.

1374:

1375: What drives the trend of dissipation with mass? We find that it reflects the typical

1376: gas fractions of disks as a function of mass.

1377: That trend (in disks) may, of course, be a complex

1378: consequence of poorly understood processes such as star formation, cold

1379: gas accretion, and stellar feedback; but for our purposes here we are not interested

1380: in how this property arises in disks (simply what the observed properties of disks are,

1381: which are captured by construction in our halo occupation approach).

1382:

1383: Figure~\ref{fig:diss}

1384: compares these integrated dissipational fractions with observationally estimated

1385: gas fractions of disks (as a function of stellar mass)

1386: at $z=0$ \citep[see e.g.][]{belldejong:disk.sfh,

1387: kannappan:gfs,mcgaugh:tf}, $z=1$ \citep{shapley:z1.abundances},

1388: and $z=2$ \citep{erb:lbg.gasmasses}.

1389: There is a systematic evolution in gas fractions with redshift, as expected, but

1390: the gas content of observed disks from $z\sim0-2$ appears to bracket the

1391: range in both our predicted dissipational fractions and the observed dissipational

1392: fractions of ellipticals.

1393:

1394: At the lowest masses, where the

1395: predicted dissipational fractions and observed disk gas fractions approach unity,

1396: the observationally inferred dissipational fractions are somewhat lower, appearing

1397: to reach a maximum near $\sim0.4$. This most likely relates to the fact that extremely

1398: gas-rich mergers will not efficiently dissipate angular momentum and will leave

1399: disk-dominated, rather than elliptical remnants \citep[for details, see][]{hopkins:disk.survival}.

1400: These subtleties, not included in our model here, are important for the evolution of

1401: bulge-to-disk ratios in low-mass galaxies, but do not significantly affect our predictions over

1402: most of the mass range of interest ($M_{\ast}\gtrsim10^{10}\,M_{\sun}$, where

1403: $f_{\rm gas}\lesssim0.5$).

1404:

1405: At fixed mass, systems with earlier formation times tend to have

1406: slightly lower dissipational

1407: fractions; we emphasize that this is true when the systems are observed

1408: at $z=0$. At their time of formation, early-forming systems may have had

1409: very large dissipational fractions, reflecting the characteristically higher

1410: gas fractions of disk progenitors at these redshifts (we show this in

1411: greater detail in \S~\ref{sec:track} below). However, these systems will

1412: also be in the most biased environments, and so (in a $\Lambda$CDM

1413: context) undergo relatively more growth at later times, assembling

1414: much of their $z=0$ mass via mergers with less gas content (systems

1415: that have largely gas-exhausted and/or quenched) and correspondingly

1416: lower dissipational fractions.

1417:

1418: \begin{figure}

1419:     \centering

1420:     %\scaleup

1421:     %\plotone{MC_strongz_n_mgr.ps}

1422:     \plotter{f7.ps}

1423:     \caption{{\em Top:} Fraction of ellipticals at a given stellar mass that have

1424:     had $N$ major mergers from $z=0-6$. {\em Bottom:} Median (solid)

1425:     and $\pm1\sigma$ (dashed) range of redshifts of the last major merger

1426:     for ellipticals of a given stellar mass

1427:     (not the last gas-rich merger; where the number of mergers

1428:     $N\gtrsim2$, this last merger is often dry). Most of the mass in ellipticals (at least at

1429:     $L\lesssim$ a few $\lstar$)

1430:     is assembled by a relatively small number of mergers at relatively low redshifts

1431:     $z\sim0-2$. As a result, dissipational fractions roughly reflect those of disks in

1432:     this redshift interval.

1433:     \label{fig:diss.reasons}}

1434: \end{figure}

1435:

1436: This simple correspondence is expected, because most of the mass in ellipticals is

1437: assembled in a relatively small number of major mergers, at relatively low redshifts

1438: corresponding to this observed range ($z\sim0-2$). Figure~\ref{fig:diss.reasons}

1439: shows the typical number of mergers of ellipticals as a function of stellar mass,

1440: as well as the typical redshifts of the most recent major merger.

1441: The predicted trend is similar to our estimate in \citet{hopkins:groups.ell} as well

1442: as other cosmological models \citep[see e.g.][]{delucia:sam}; most ellipticals

1443: at $\lesssim L_{\ast}$ have had just one major merger, with even massive

1444: ellipticals at $\sim$a few $L_{\ast}$ still having only a small number of such mergers.

1445:

1446: In \citet{hopkins:groups.ell} we show that the fraction of systems with e.g.\

1447: only one major merger in their history versus multiple major mergers, or

1448: for which the last major merger was gas-rich, agrees well with the observed

1449: trend in the fraction of ellipticals with central cusps (power-law nuclear slopes),

1450: rapid rotation, or disky isophotal shapes as opposed to central cores, slow rotation,

1451: or boxy isophotal shapes \citep[indicators of wet versus dry mergers, discussed

1452: in detail in e.g.\ \papertwo\ and][]{faber:ell.centers,naab:gas,cox:kinematics}.

1453: The most recent major mergers, where the final stellar mass

1454: (at least $\sim1/2$ of it) is assembled, occurs at relatively low redshifts, with a

1455: median $z\sim1$ and spanning the range $z\sim0-2$. Of course, the most massive BCGs

1456: form a tail in this distribution with much more complex histories, and for massive galaxies

1457: undergoing $\gtrsim1-2$ major mergers, the most recent mergers are dry,

1458: but in any case the broad point remains: most of the $z=0$ mass in ellipticals is assembled

1459: in a relatively small number of major mergers at $z\lesssim2$, and therefore

1460: their dissipational fractions reflect those of disks over the same interval.

1461:

1462: This general conclusion (that major mergers with mass ratios

1463: $\lesssim$1:3, as opposed to more

1464: numerous minor mergers, dominate the

1465: mass assembly in mergers once gas is exhausted and

1466: star formation ``quenched'')

1467: is supported by calculations from

1468: other halo occupation models \citep{zheng:hod.evolution}

1469: and cosmological simulations \citep{maller:sph.merger.rates}.

1470: However, these calculations, as well as

1471: clustering estimates \citep{masjedi:cross.correlations} and

1472: hydrodynamic simulations \citep{naab:etg.formation}

1473: suggest that as galaxies approach the most massive $M_{\ast}\gtrsim 10^{12}\,M_{\sun}$

1474: BCG mass regimes, growth by a large series of minor mergers becomes more

1475: important than growth by major mergers.

1476:

1477: Roughly speaking, these

1478: calculations suggest that, since most of the mass is in $\sim L_{\ast}$ systems,

1479: mergers with such systems dominate mass growth: so when galaxies

1480: are at masses $\lesssim$ a few $L_{\ast}$, their growth is dominated

1481: by major mergers, and at higher masses, their growth is dominated by

1482: progressively more minor mergers.

1483: Our predictions in this limit should

1484: therefore be treated with some degree of caution, in contrast with

1485: the bulk of the spheroid population at $\lesssim L_{\ast}$

1486: \citep[typically observed in field or low-density environments,

1487: where the expectation for growth by a small number of major

1488: mergers since $z\sim2-4$ is reasonable; see][]{blanton:env,wang:sdss.hod,

1489: masjedi:merger.rates}. On the other hand, it is still unclear whether

1490: such minor mergers actually contribute significantly to mass growth

1491: even in the high-mass regime, or whether

1492: satellites are disrupted in this regime or kicked into orbits where the

1493: merger time is longer than the Hubble time \citep[see e.g.][]{hashimoto03:varying.culomb.log.in.dynfric,

1494: tormen:cluster.subhalos,benson:heating.model,zheng:hod.evolution,

1495: kazantzidis:mw.merger.hist.sim,brown:hod.evol}.

1496:

1497: We, in any case, have experimented with extending our modeling to include

1498: minor mergers, and find that this yields qualitatively similar

1499: conclusions (with moderate quantitative but no qualitative differences

1500: in the predicted

1501: properties of the most massive galaxies, and no significant differences otherwise).

1502: However, our predictions (and those of

1503: other halo-occupation based approaches) are less certain for

1504: minor mergers owing to ambiguities in e.g.\

1505: satellite disruption and weaker constraints on the satellite mass function at small

1506: mass ratios.

1507:

1508:

1509: \subsubsection{What Effects Does this Dissipation Have?}

1510: \label{sec:z0:diss:effects}

1511:

1512:

1513: How does this amount of dissipation change the properties of the $z=0$ ellipticals?

1514: We consider this question in detail in idealized simulations and observed ellipticals

1515: in \paperfour: in short, increasing the amount of dissipation yields remnants

1516: with more of their stellar mass in a compact central starburst remnant. This yields a

1517: stellar remnant with a smaller effective radius $R_{e}$, and therefore overall a more

1518: baryon-dominated object (lower $M_{\rm dyn}/M_{\ast}$). We demonstrate that these

1519: general conclusions remain both qualitatively and quantitatively valid in a more general

1520: scenario here. In order to compare the effects with and without dissipation, we

1521: re-run our model, but without including the effects of dissipation: this effectively amounts to

1522: treating gas identically to stars in mergers, or equivalently treating initial disks as if

1523: they are all gas-free.

1524:

1525: \begin{figure}

1526:     \centering

1527:     %\scaleup

1528:     %\plotone{MC_nodiss_fp_proj.ps}

1529:     \plotone{f8.ps}

1530:     \caption{Same predicted spheroid scaling laws at $z=0$ as Figure~\ref{fig:fp.proj},

1531:     but in a model with no dissipation (gas is treated the same as stars in mergers).

1532:     Without dissipation, the correlations are similar to those observed for disks,

1533:     with a too-shallow size-mass relation ($R_{e}\propto\mstar^{0.3}$).

1534:     Low-mass systems require dissipation to be as compact as observed -- other effects

1535:     (redshift evolution in disk sizes and dry mergers) are insufficient to reproduce this

1536:     effect, and (even tuned) cannot simultaneously reproduce the slope {\em and}

1537:     normalization of the size-mass relation.

1538:     \label{fig:fp.proj.nodiss}}

1539: \end{figure}

1540:

1541: \begin{figure}

1542:     \centering

1543:     %\scaleup

1544:     %\plotone{MC_nodiss_fp.ps}

1545:     \plotone{f9.ps}

1546:     \caption{The fundamental plane, as Figure~\ref{fig:fp}, with no dissipation.

1547:     Without dissipation, there is no tilt (the low-mass ellipticals here

1548:     have too-large an $M_{\rm dyn}/M_{\ast}$, related to their too-large sizes

1549:     in Figure~\ref{fig:fp.proj.nodiss}).

1550:     Fitting $\mdyn\propto\mstar^{1+\alpha}$, we obtain $\alpha \sim -0.1$ to $0$,

1551:     similar to disks, as opposed to $\alpha=0.2$ observed. In this projection,

1552:     dry mergers and redshift evolution in disk/halo sizes cannot give significant

1553:     tilt.  The pile-up in the lower-left panel is related to an effective upper limit

1554:     in the phase space density of disks/halos without dissipation.

1555:     \label{fig:fp.nodiss}}

1556: \end{figure}

1557:

1558:

1559: Figures~\ref{fig:fp.proj.nodiss} \&\ \ref{fig:fp.nodiss}

1560: plot the fundamental plane correlations of

1561: ellipticals at $z=0$, yielded from this alternative model.

1562: Low-mass ellipticals, without dissipation, now have sizes similar to those of their

1563: progenitor disks, and much larger than the sizes observed.

1564: The slope of the size-mass relation, $R_{e}\propto\mstar^{0.3}$, reflects that in

1565: progenitor disks and halos, and disagrees dramatically with

1566: the observed slope for ellipticals $R_{e}\propto\mstar^{0.6}$. As we show in

1567: \papertwo, the trend of increasing dissipation at low masses,

1568: owing to increasing gas fractions in lower-mass disks, drives smaller mass

1569: ellipticals to relatively lower masses, steepening the size-mass relation

1570: and giving rise to the trend in Figure~\ref{fig:fp.proj}. Other trends,

1571: such as e.g.\ the

1572: relation of dry mergers to mass, are insufficient to explain the observed

1573: size-mass relation (this is expected: dissipationless and dry mergers

1574: can only lower the density of objects; they can increase radii at large mass but

1575: cannot make low-mass ellipticals smaller or more dense as needed).

1576: Invoking evolution in disk sizes and high-redshift mergers can make

1577: ellipticals more compact overall, but affects the size-mass slope in the {\em opposite}

1578: sense required (since the highest-mass ellipticals will have their first mergers

1579: at the earliest times, they would be the most compact relative to disks today;

1580: the opposite is observed).

1581:

1582: This difference in our predicted size-mass relation is

1583: also reflected in the velocity dispersion-mass

1584: and fundamental plane relations, albeit more weakly, since the effective

1585: radius of the stars enters only indirectly: much of $\sigma$ is set by the potential

1586: of the halo, which is not altered much by dissipation. Nevertheless, the

1587: Faber-Jackson relation disagrees with that observed (again, in the sense

1588: that ellipticals are not dense enough), and the fundamental plane tilt is

1589: erased (we obtain $\mdyn\propto\mstar^{0.9-1.0}$, essentially identical to

1590: that observed in disks).

1591:

1592: These discrepancies are also apparent from the

1593: predictions in \citet{almeida:durham.ell.scalings},

1594: who use the semi-analytic models from both \citet{baugh:sam} and \citet{bower:sam}

1595: to predict galaxy properties in a similar manner as we have done,

1596: but treating gas identically to stars

1597: (in fact, their prescriptions for galaxy sizes and dark matter scalings

1598: are almost identical to ours in this ``no dissipation'' case). As a direct result, at $z=0$,

1599: their predicted size-mass relation is much too flat ($R_{e}\propto\mstar^{0.3}$)

1600: relative to that observed for

1601: spheroids, and the predicted fundamental plane has no tilt (in

1602: fact it is slightly tilted in the incorrect

1603: sense $M_{\rm dyn}\propto\mstar^{0.8-0.9}$).

1604: They consider the differences between the two semi-analytic models,

1605: as well as the inclusion or removal of adiabatic contraction in the

1606: galaxy halos, supernova and AGN

1607: feedback, self gravity in the baryons, and systematically higher or lower-energy orbits,

1608: and find that none of these significantly change the predicted fundamental

1609: plane scalings. In short, at $z=0$, dissipation (making galaxy stellar

1610: mass distributions more compact) is much more important and more significant to

1611: the fundamental plane correlations than any

1612: differences between various models for galaxy formation and evolution --

1613: only dissipation works as a viable explanation of the observed scalings.

1614:

1615:

1616:

1617: \begin{figure}

1618:     \centering

1619:     %\scaleup

1620:     %\plotone{MC_nodiss_ns.ps}

1621:     \plotone{f10.ps}

1622:     \caption{Correlation between stellar mass and estimated best-fit galaxy

1623:     Sersic index $n_{s}$, as Figure~\ref{fig:ns}, with no dissipation.

1624:     Without

1625:     dissipation, Sersic indices remain too low at all masses (central densities

1626:     cannot rise to the values observed) -- the Sersic indices fitted to the whole galaxy

1627:     are the same as those of just the dissipationless outer component ({\em bottom});

1628:     i.e.\ they can match the outer regions of elliptical profiles but fail to match the

1629:     central surface brightness of ellipticals (not dense enough at the center).

1630:     \label{fig:ns.nodiss}}

1631: \end{figure}

1632:

1633: The Sersic profiles in the absence of dissipation will be those of the

1634: outer/dissipationless components, already shown in Figure~\ref{fig:ns}

1635: (in this case, both the Sersic index of the entire profile and the Sersic index

1636: of just the dissipationless component will be the same, and be

1637: identical to that of just the dissipationless component in our full model). We

1638: directly compare this with the observed Sersic indices in Figure~\ref{fig:ns.nodiss}.

1639: Although this matches the outer profiles of observed systems, it clearly is

1640: not the same as the Sersic indices obtained fitting the observed galaxies to a

1641: single Sersic law over the entire dynamic range -- the high Sersic values (for

1642: the entire profile) $n_{s}\sim5-10$ observed can only be reproduced with

1643: some dissipation to make a dense central concentration of light (matching

1644: the steep rise at small radii of large-$n_{s}$ Sersic profiles).

1645: This ultimately points to the same issue as the size-mass relations: dissipation

1646: is needed to raise the central stellar densities in order to match observed

1647: correlations.

1648:

1649: Without dissipation, there is of course no way to grow the central

1650: black hole; there is no correlation between black hole mass and host galaxy

1651: properties for us to predict. We could in principle assume that black hole

1652: mass is dissipationlessly conserved in mergers, but initially set by disk masses --

1653: however, this would predict that it is disk, not bulge properties that set the

1654: black hole mass, in direct contradiction with observations. We could

1655: assume that $\mbh$ strictly traces the stellar mass or velocity dispersion

1656: in the bulge (or in the dissipationless component of the bulge),

1657: but as demonstrated in \citet{hopkins:bhfp.obs}, this fails to explain

1658: residual correlations between black hole mass and bulge binding energy

1659: at fixed $\mstar$ or $\sigma$.

1660:

1661:

1662: %\begin{figure}

1663: %    \centering

1664: %    %\scaleup

1665: %    \plotone{MC_nodiss_ns.ps}

1666: %    \caption{Correlation between stellar or black hole mass and estimated best-fit galaxy

1667: %    Sersic index $n_{s}$, as Figure~\ref{fig:ns}, with no dissipation. Without

1668: %    dissipation, Sersic indices remain too low at all masses (central densities

1669: %    cannot rise to the values observed).

1670: %    \label{fig:ns.nodiss}}

1671: %\end{figure}

1672: %Figure~\ref{fig:ns.nodiss} shows the Sersic indices

1673:

1674:

1675: \subsection{The Effects of Dry Mergers}

1676: \label{sec:z0:dry}

1677:

1678: Having analyzed the effects of dissipation, we now turn to the effects of dissipationless

1679: or dry mergers. Since essentially all disk-dominated galaxies

1680: do, in fact, have significant gas fractions,

1681: this in practice refers to subsequent spheroid-spheroid re-mergers. What effect do

1682: such mergers have on the $z=0$ scaling relations of ellipticals?

1683: In order to study this, we re-run our model, including dissipation

1684: but excluding such dry mergers

1685: (we have also considered allowing them to occur in the sense that they

1686: add stellar mass, but not allowing them to alter the structural properties of the

1687: galaxy; the results are similar).

1688:

1689:

1690: \begin{figure}

1691:     \centering

1692:     %\scaleup

1693:     %\plotone{MC_nodry_fp_proj.ps}

1694:     \plotone{f11.ps}

1695:     \caption{Same predicted spheroid scaling laws at $z=0$ as Figure~\ref{fig:fp.proj},

1696:     but in a model with no dry mergers (structural change in

1697:     spheroid-spheroid mergers is suppressed).

1698:     Dry mergers do not affect the predictions

1699:     at low mass, but at high mass ($\mstar\gg 10^{11}\,\msun$),

1700:     a lack of dry mergers means that ellipticals are not as large as observed.

1701:     The sense of residual correlations at high mass is now reversed -- the oldest systems

1702:     at fixed mass or $\sigma$ are now the smallest, in contrast to observations

1703:     \citep{gallazzi06:ages,graves:prep}

1704:     \label{fig:fp.proj.nodry}}

1705: \end{figure}

1706:

1707: \begin{figure}

1708:     \centering

1709:     %\scaleup

1710:     %\plotone{MC_nodry_fp.ps}

1711:     \plotone{f12.ps}

1712:     \caption{Fundamental plane, as Figure~\ref{fig:fp}, with no dry mergers.

1713:    The difference is minimal (although there is slightly less tilt),

1714:    since dry mergers tend to move systems

1715:    parallel to the FP.

1716:     \label{fig:fp.nodry}}

1717: \end{figure}

1718:

1719:

1720: Figures~\ref{fig:fp.proj.nodry} \&\ \ref{fig:fp.nodry} plot the fundamental plane correlations of

1721: ellipticals at $z=0$ from this alternative model. At intermediate and low

1722: masses, the number of mergers in our full model is small (see Figure~\ref{fig:diss.reasons}),

1723: and dissipation is dominant, so the correlations predicted here are essentially

1724: identical to those in Figures~\ref{fig:fp.proj} \&\ \ref{fig:fp}. At high masses or

1725: luminosities ($\gtrsim$a few $\lstar$), however, there is an appreciable effect

1726: on the projections of the FP. The highest mass systems have their first mergers at early times,

1727: when their progenitors are more gas-rich, and are therefore compact when first formed.

1728: Without dry mergers to puff them up at later times, they do not have effective radii

1729: as large as observed. The difference in the scaling laws over the dynamic range observed

1730: in Figure~\ref{fig:fp.proj.nodry} is subtle, becoming dramatic only at the highest masses.

1731:

1732: There is, however, a significant effect in the residual correlations with e.g.\ size and

1733: merger history at fixed stellar mass or velocity dispersion. Without dry mergers,

1734: then (because of the same effect), the oldest systems at fixed mass would be the

1735: smallest. This is the opposite of observed trends

1736: \citep{gallazzi06:ages,graves:prep}: older BCGs

1737: are extremely clustered, and so (unless CDM models are seriously incorrect)

1738: had their first mergers at early times, where spheroids are observed to be

1739: compact (see \S~\ref{sec:evol}). The observations that e.g.\ older

1740: BCGs tend to be the most extended systems

1741: \citep{vonderlinden:bcg.scaling.relations,

1742: bernardi:bcg.scalings}

1743: therefore demands some dry mergers

1744: in their history \citep{hausman:mergers,hernquist:phasespace}.

1745: However, as shown in Figure~\ref{fig:diss.reasons}, the number of

1746: such dry mergers (especially at $z\lesssim1-2$) is still relatively small, so

1747: this is consistent with the $\sim1$ major dry merger since $z\sim1$ which direct observations

1748: of dry merging pairs \citep{vandokkum:dry.mergers,bell:dry.mergers,lin:mergers.by.type}

1749: and constraints from mass function evolution

1750: \citep{bundy:mfs,borch:mfs,perezgonzalez:mf.compilation,

1751: brown:mf.evolution,hopkins:transition.mass,zheng:hod.evolution} suggest.

1752:

1753: The fundamental plane itself is less affected by dry mergers. This is not surprising;

1754: numerical simulations demonstrate that both major and

1755: minor dry mergers tend to move systems parallel to the fundamental plane

1756: \citep[see \paperfour\ and][]{boylankolchin:mergers.fp,robertson:fp}. Where an individual

1757: galaxy ends up on the FP is therefore affected by its dry merger history, but

1758: the FP itself is not very much altered. There is a very weak effect because of the

1759: evolution in the FP discussed in \S~\ref{sec:evol}, but it is much less apparent

1760: here than in e.g.\ the size-mass relation.

1761:

1762: \begin{figure}

1763:     \centering

1764:     %\scaleup

1765:     %\plotone{MC_nodry_ns.ps}

1766:     \plotone{f13.ps}

1767:     \caption{Correlation between stellar mass and estimated best-fit galaxy

1768:     Sersic index $n_{s}$, as Figure~\ref{fig:ns}, with no dry mergers. Without

1769:     dry mergers, Sersic indices

1770:     remain flat at the values characteristic of gas-rich merger

1771:     remnants. The large Sersic indices at high masses (reflecting the extended envelopes

1772:     of the most massive ellipticals) are not reproduced.

1773:     \label{fig:ns.nodry}}

1774: \end{figure}

1775:

1776: Figure~\ref{fig:ns.nodry} shows the Sersic indices as a function of mass

1777: in the absence of dry mergers. Without repeated dry mergers to scatter

1778: stars at large radii and build up a more extended envelope, the profile

1779: of the outer/dissipationless component is self-similar at all masses (failing

1780: to reproduce the weak, but significant dependence observed). Fitting the

1781: entire profile to a single Sersic index, the effects of dissipation still

1782: contribute to a dependence of this index on mass (again emphasizing that

1783: with such a single-component fit, the Sersic index does not robustly track merger

1784: history as it does in a two-component decomposition).

1785: Both the effects of dry mergers and dissipation (in particular

1786: combinations) are needed to reproduce the observed trends.

1787:

1788:

1789: \begin{figure}

1790:     \centering

1791:     %\scaleup

1792:     %\plotone{MC_nodry_bh_corr.ps}

1793:     \plotone{f14.ps}

1794:     \caption{Correlation between black hole mass and host spheroid properties,

1795:     as Figure~\ref{fig:mbh}, with no dry mergers. There is fairly little effect here --

1796:     dry mergers conserve both spheroid binding energy and total black hole mass,

1797:     so move systems parallel on the black hole fundamental plane correlations.

1798:     The (weak) change along projected correlations (e.g.\

1799:     $M_{\rm BH}-\sigma$) reflects the small structural changes of the

1800:     remnants in Figure~\ref{fig:fp.proj.nodry}.

1801:     \label{fig:mbh.nodry}}

1802: \end{figure}

1803:

1804: Figure~\ref{fig:mbh.nodry} shows the correlations between

1805: BH mass and host properties in this model. The

1806: effects here are quite weak.

1807: Since stellar mass and black hole mass are conserved in dry mergers

1808: (and the projected correlation is roughly $M_{\rm BH}\propto\mstar$) there

1809: is almost no change in that correlation with or without dry mergers.

1810: Total spheroid binding energy is also conserved (which leads to

1811: velocity dispersion being nearly constant) in dry mergers, but the

1812: correlation here is not linear (although it is the more fundamental correlation

1813: in our models). If we assume the ``initial'' correlation is the

1814: power-law $M_{\rm BH}\propto E_{\rm bul}^{0.7}$ (or $M_{\rm BH}\propto \sigma^{4}$

1815: for the $M_{\rm BH}-\sigma$ relation), then (assuming $\mstar\propto\sigma^{4}$,

1816: which gives

1817: $E_{\rm bul}\propto \mstar^{3/2}$) after a merger of mass ratio

1818: $f$ ($f\le1$) two systems initially on the correlation will move a

1819: factor $(1+f)/(1+f^{3/2})^{0.71}$ off the correlation (likewise for

1820: the $M_{\rm BH}-\sigma$ relation.

1821: This amounts to a $\sim20\%$ ($\sim0.1$\,dex) effect in major mergers --

1822: so we expect (given the $\sim0.3$\,dex intrinsic scatter in the correlations)

1823: it will only be noticeable at the most massive end where systems

1824: have undergone a large number of dry mergers.

1825: We do see this, but over the dynamic range of the present data,

1826: there is little distinction that can be clearly drawn -- the scatter

1827: in various properties (and the fact that e.g.\ neither $M_{\rm BH}-\sigma$

1828: nor $M_{\rm BH}-\mstar$ is the most fundamental correlation in this model)

1829: mostly washes it out.

1830: In terms of the correlations with effective radius $R_{e}$,

1831: there are more noticeable changes, directly reflecting

1832: the different size-mass relation in Figure~\ref{fig:fp.proj.nodry}.

1833:

1834:

1835: \breaker

1836: \section{Evolution of Scaling Laws as a Function of Redshift}

1837: \label{sec:evol}

1838:

1839: We have shown how dissipation and dry mergers alter the correlations of

1840: ellipticals at $z=0$. Of course, gas fractions of their progenitors evolve with

1841: redshift, and dry mergers preferentially occur at low redshift. In addition, other

1842: properties of the progenitors, such as disk and halo sizes, will evolve.

1843: We therefore expect there could be evolution in these scaling laws, and

1844: make predictions for this here. We adopt our standard model, but construct

1845: samples as they would be observed at a given non-zero redshift.

1846:

1847: \subsection{Predicted Evolution In Our Standard Model}

1848: \label{sec:evol:pred}

1849:

1850: \subsubsection{Spheroid Sizes}

1851: \label{sec:evol:sizes}

1852:

1853: \begin{figure}

1854:     \centering

1855:     %\scaleup

1856:     %\plotone{r_z_mbins.ps}

1857:     \plotone{f15.ps}

1858:     \caption{Predicted evolution in spheroid sizes with redshift. We show the

1859:     predicted median size of spheroids at a given stellar mass, relative to the

1860:     median size of objects of that mass at $z=0$. Shaded range is the uncertainty owing

1861:     to both cosmology and how rapidly progenitor (disk and halo) properties evolve.

1862:     Solid lines are simple power-law fits (Equation~\ref{eqn:re.z}).

1863:     Dashed line shows the power-law fit corrected (approximately) to the $R_{e}$

1864:     that would be measured in optical/UV bands (as opposed to the half-mass radius:

1865:     stellar population effects tend to make very young ellipticals with

1866:     recent central starbursts more concentrated

1867:     in short-wavelength light).

1868:     Dotted line is

1869:     the prediction from the semi-analytic model of \citet{khochfar:size.evolution.model}.

1870:     Observational estimates for galaxies in the appropriate stellar mass intervals

1871:     are shown from

1872:     \citet[][circles]{mcintosh:size.evolution}, \citet[][stars]{trujillo:size.evolution},

1873:     \citet[][square]{zirm:drg.sizes}, and \citet[][diamond]{vandokkum:z2.sizes}.

1874:     Progenitors are more gas-rich

1875:     at high-$z$, yielding mergers with more dissipation and therefore smaller remnants.

1876:     At high masses, low-$z$ disks are gas poor (and dry mergers have been more important),

1877:     so the relative difference is larger; at low masses, there is little room for gas fractions

1878:     to increase, hence weak evolution.

1879:     \label{fig:reff.z}}

1880: \end{figure}

1881:

1882: Figure~\ref{fig:reff.z} shows how the median effective radii of spheroids

1883: at fixed stellar mass are predicted to evolve with redshift. We vary the assumptions

1884: described in \S~\ref{sec:model}, most importantly (other uncertainties being

1885: relatively negligible at the redshifts of interest here) whether (and how strongly,

1886: within the observational limits) disk sizes gas fractions evolve with redshift,

1887: and show the resulting allowed range. We compare with observations of

1888: the spheroid size-mass relation from \citet{trujillo:size.evolution},

1889: \citet{mcintosh:size.evolution}, \citet{vandokkum:z2.sizes}

1890: and \citet{zirm:drg.sizes} \citep[see also][]{franx:size.evol,damjanov:red.nuggets}.

1891: Although the observational

1892: constraints are weak, they agree with our predictions at redshifts

1893: $z\sim0-3$.

1894:

1895: Evolution in galaxy gas fraction, which directly translates to evolution in the amount

1896: of dissipation in mergers, drives this behavior.

1897: This is expected -- in \S~\ref{sec:z0:diss},

1898: and in greater detail in \papertwo, we show that the degree of

1899: dissipation is the dominant factor determining the stellar effective radius.

1900: Disk gas fractions are larger at high redshift, so in mergers, a larger mass fraction will

1901: be formed in a compact dissipational starburst, yielding a smaller stellar remnant.

1902: Dry mergers have also had less time to act and puff up remnants at high redshift.

1903: If disks are more compact at high redshift, this drives further evolution; however,

1904: observational constraints suggest that this is relatively weak

1905: (see \S~\ref{sec:model:hod}), and so it is less

1906: important than the evolution in gas fractions.

1907:

1908: This also predicts that the relative size evolution

1909: should be stronger in high-mass systems.

1910: Fitting these predictions to simple scaling laws of the form

1911: \begin{equation}

1912: \langle R_{e}(M_{\ast}\,|\,z) \rangle = (1+z)^{-\beta}\,\langle R_{e}(M_{\ast}\,|\,z=0) \rangle

1913: \label{eqn:re.z}

1914: \end{equation}

1915: we obtain $\beta=(0.0-0.2,\,0.24,\,0.48,\,0.64)$ ($\pm0.05$ or so for each) for

1916: $M_{\ast}\sim(10^{9},\,10^{10},\,10^{11}\,10^{12})\,\msun$

1917: (roughly $\beta\approx {\rm MAX}[0.23\,\log{(M_{\ast}/10^{9}\,\msun),\ 0.2]}$).

1918: % fit to exp(-z^beta) -> 0.00, 0.11, 0.23, 0.31 %

1919: Low mass disks are still gas-rich at $z=0$ (and have generally

1920: not experienced many dry mergers), so there is limited room for

1921: evolution (the evolution we do predict is almost entirely driven by

1922: whatever evolution in progenitor disk sizes is assumed, hence the

1923: quoted range $0.0-0.2$ in the lowest-mass bin).

1924: At high mass, however, dry mergers have been increasing spheroid sizes

1925: for a significant redshift interval. Furthermore, disks at high mass at low redshift

1926: are relatively gas-poor; by $z\sim3$, they have doubled or tripled their gas

1927: fractions \citep{erb:lbg.gasmasses}. The relative size

1928: difference is therefore much more pronounced.

1929:

1930: These predictions are similar to those made by \citet{khochfar:size.evolution.model},

1931: who adopted a full semi-analytic model to estimate the history of disk galaxies,

1932: accretion, and star formation, and then

1933: modeled disk sizes by assuming effective radius is proportional to

1934: the mass fraction in the dissipationless (non-starburst) component:

1935: $R_{e}(M_{\ast},\,z)=R_{e}(M_{\ast},\,0)\,\times\,(f_{\rm dissipationless}[z]/f_{\rm dissipationless}[0])$.

1936: For comparison, we show their predictions in Figure~\ref{fig:reff.z}.

1937: At intermediate and low masses, they are similar to ours.

1938: This is because their simple model captures the key qualitative aspect which drives

1939: our predicted evolution, namely the evolution in disk gas fractions with redshift,

1940: making high-redshift ellipticals more dissipational than their low-redshift counterparts.

1941: Ultimately, their determination of the size evolution with dissipational fraction is

1942: not a bad approximation over intermediate gas fractions (to the accuracy here).

1943: However, there is a significant difference in the predictions at the highest masses --

1944: \citet{khochfar:size.evolution.model} predicted much stronger evolution than we do. This is

1945: the regime where our attempts to improve on their estimates are of particular importance.

1946:

1947: First, their predictions are tied to some assumptions and modeling of disk galaxies

1948: in their semi-analytic model; this is precisely why we have tried to be as empirical

1949: as possible in constructing our progenitor galaxies. They

1950: note that their strong evolution in this mass bin is the

1951: result of a high-redshift overcooling problem in

1952: describing massive galaxies in their

1953: semi-analytic model (a well-known difficulty without strong feedback).

1954: As a consequence, their disk gas fraction evolution to $z\sim2$ is much steeper than observed --

1955: i.e.\ ``progenitor'' disks at these masses and redshifts are demonstrably too gas-rich and

1956: in halos of too low a mass. \citet{erb:lbg.gasmasses} see typical gas

1957: fractions for $M_{\ast}\gtrsim10^{11}\,\msun$

1958: galaxies at $z\sim2$ of $\sim20\%$ (whereas \citet{khochfar:size.evolution.model}

1959: predict $\sim70-80\%$)

1960:  -- even if the ellipticals observed at $z=2$ formed from

1961: higher-redshift mergers with gas fractions say, double this value, this predicts

1962: factor $\lesssim2$ size evolution at $z=2$, similar to our estimates. By using a halo

1963: occupation approach, we avoid this problem.

1964:

1965: Second,

1966: our size estimates are calibrated directly to numerical simulations: we track the

1967: evolution of multiple galaxy components in merger simulations and use

1968: this to design our prescriptions. Of particular importance, we include the effects of

1969: dry mergers appropriately and extrapolate our prescriptions to arbitrary dissipational

1970: fractions: at high $f_{\rm dissipational}$, the simple size estimator from

1971: \citet{khochfar:size.evolution.model} clearly must break down, as galaxies will still have

1972: finite sizes. This direct calibration with simulations

1973: avoids ambiguity in e.g.\ degeneracies between

1974: cosmological effects and assumptions about how spheroid sizes scale, and

1975: allows for more robust predictions.

1976:

1977: Nevertheless the reason for our predicted evolution, and its sense,

1978: is essentially the same as that identified in \citet{khochfar:size.evolution.model}.

1979: Ellipticals at

1980: high redshift have, on average, formed a larger fraction of their mass in dissipational

1981: starbursts (owing to their progenitors being, on average, more gas-rich)

1982: at these redshifts. Dissipation allows this component to be significantly more compact

1983: than the dissipationlessly violently relaxed stars from the progenitor disks,

1984: giving rise to a more compact remnant.

1985:

1986:

1987: At the highest masses ($\gg 10^{11}\,\msun$), the evolution we predict is somewhat weaker

1988: than that observationally inferred by

1989: \citet{zirm:drg.sizes,vandokkum:z2.sizes,franx:size.evol}. However, as demonstrated in \paperone,

1990: this can be explained by

1991: an additional bias -- if systems at high $z$ are very young

1992: (ages $\lesssim0.5-1\,$Gyr)

1993: remnants of

1994: gas-rich mergers \citep[as we predict precisely these systems are,

1995: and as observations of their stellar populations suggest, see][]{kriek:drg.seds} and observed in

1996: even rest-frame optical wavelengths (let along rest-frame UV),

1997: the gradients in $M_{\ast}/L$ introduce a bias towards smaller $R_{e}$

1998: \citep[there may also be some related, but weak, biases in these stellar population parameters

1999: and stellar masses, see e.g.][]{wuyts:irac.drg.colors}.

2000: The younger stellar population from the starburst -- i.e.\ the dissipational

2001: component -- is significantly brighter in $B$-band at ages $\lesssim1\,$Gyr,

2002: so this dominates the fit, yielding a smaller $R_{e}$ (equivalent

2003: to artificially further increasing the dissipational mass fraction -- this is why

2004: it actually looks as if the \citet{khochfar:size.evolution.model} prediction provides an

2005: acceptable match to these objects).

2006:

2007: In line with this expectation, the sizes

2008: we obtain if we ignore the dissipationless components of our predicted

2009: $z>2-3$ systems are consistent with the observations

2010: from \citet{zirm:drg.sizes,vandokkum:z2.sizes,franx:size.evol,damjanov:red.nuggets}.

2011: Essentially, this effect can yield a bias of an additional factor $\sim2$

2012: (see Figure~20 in \paperone) for gas-rich systems.

2013: It will not be a problem at lower redshifts, both because systems are older

2014: (the effect is negligible at starburst ages $>1\,$Gyr, or even earlier, as

2015: induced metallicity gradients from the starburst can partially cancel it out),

2016: and because gas fractions are lower (we expect significant bias only

2017: for $f_{\rm dissipational} \gtrsim 0.3-0.4$). Furthermore, there is no bias {\em within}

2018: the starburst light (there is not a strong gradient within the starburst itself),

2019: so for low-mass systems (which are largely dissipational at every redshift),

2020: there is no significant effect. It is precisely for

2021: large mass systems at high redshift

2022: that we expect this to be an important concern, and measurements that could

2023: ultimately estimate e.g.\ color gradients and stellar population gradients in

2024: these systems will be necessary for more detailed constraints.

2025:

2026:

2027: \subsubsection{Velocity Dispersions}

2028: \label{sec:evol:sigma}

2029:

2030: \begin{figure}

2031:     \centering

2032:     %\scaleup

2033:     %\plotone{s_z_mbins.ps}

2034:     \plotone{f16.ps}

2035:     \caption{Predicted evolution in spheroid velocity dispersions with redshift

2036:     (style as Figure~\ref{fig:reff.z}). Despite significant evolution in $R_{e}$ at

2037:     fixed mass, the role of dark matter halos (which evolve more weakly) in setting

2038:     $\sigma$ yields much less evolution in $\sigma$ for the same systems.

2039:     Lines are the simple prediction given the fitted scaling of

2040:     $R_{e}$ with $z$ (Figure~\ref{fig:reff.z}) and Equation~(\ref{eqn:sigma.z}).

2041:     \label{fig:sigma.z}}

2042: \end{figure}

2043:

2044: Figure~\ref{fig:sigma.z} shows the corresponding evolution in velocity

2045: dispersion with redshift. The evolution is much weaker than might be

2046: intuitively expected: if $\sigma^{2}\sim G\,M/R_{e}$, then the factor

2047: $\sim3$ evolution in size in massive systems in Figure~\ref{fig:reff.z} would

2048: translate to factor $\sim\sqrt{3}$ evolution in $\sigma$. Instead, we predict

2049: such systems have roughly the same or at most a factor $\sim1.3$ larger

2050: $\sigma$. This is because, at $z=0$, dark matter plays an important role

2051: in setting the velocity dispersion -- the potential at $r=0$ has a contribution

2052: $\sim G\, \mhalo/R_{\rm halo}$ (here $R_{\rm halo}$ is the effective radius of

2053: the halo, {\em not} the virial radius; $R_{\rm halo}\sim R_{\rm vir}/c$, where $c$

2054: is the halo concentration), which is comparable to or larger than

2055: $\sim G\,\mstar/R_{e}$

2056: in most systems (although $R_{\rm halo}\gg R_{e}$,

2057: $M_{\rm halo}\gg \mstar$ by about the same factor). This is especially true

2058: for the high-mass ellipticals (for which the evolution in $R_{e}$ is strongest)

2059: which, at $z=0$, are the most dark-matter dominated

2060: within their stellar $R_{e}$.

2061: At a given halo mass, $R_{\rm halo}$ evolves weakly -- the evolution in halo

2062: concentrations

2063: almost completely offsets the evolution in virial radii \citep[this reflects

2064: the fact seen in most simulations that massive halos build inside out -- the

2065: central potential is set first and then the outer halo builds up;

2066: see][]{bullock:concentrations, wechsler:concentration},

2067: so assuming the ratio $\mstar/\mhalo$ does not

2068: evolve much with redshift (as inferred in most observations and

2069: halo occupation models; see \S~\ref{sec:model:cosmology}), then the contribution of the halo to

2070: $\sigma$ ($\sigma_{\rm halo}$) evolves weakly at fixed $\mstar$. If anything, it

2071: can decrease -- if the baryonic component is more compact at high-$z$,

2072: then the enclosed dark matter mass in the stellar $R_{e}$ is smaller.

2073:

2074: Roughly, then, if we consider the observed velocity dispersion to reflect a

2075: galaxy potential and halo potential which add linearly, we obtain

2076: $\sigma^{2}\propto (\mstar/R_{e} + \mhalo/R_{\rm halo})$. If we consider

2077: systems of fixed $\mstar$, and $\mhalo(\mstar)$ and $R_{\rm halo}(\mhalo)$

2078: evolve weakly, then we obtain

2079: \begin{equation}

2080: \frac{\langle\sigma(z\, |\,\mstar)\rangle}{\langle\sigma(0\, |\,\mstar)\rangle}

2081: =\frac{1}{\sqrt{1+\gamma}}

2082: \,\sqrt{\gamma + \frac{\langle R_{e}(0) \rangle}{\langle R_{e}(z) \rangle}},

2083: \label{eqn:sigma.z}

2084: \end{equation}

2085: where $\gamma\equiv (\mhalo/R_{\rm halo})/(\mstar/R_{e})\sim 1-2$ is the

2086: relative fraction of the central potential contributed by the dark matter at $z=0$.

2087: For the values of $\gamma$ we estimate ($\sim1$ at $\mstar\sim10^{11}\,\msun$

2088: and $\sim2$ at $\mstar\sim10^{12}\,\msun$), this simple expectation fits

2089: the observed trends well.

2090:

2091:

2092: \subsubsection{The Fundamental Plane}

2093: \label{sec:evol:fp}

2094:

2095: \begin{figure}

2096:     \centering

2097:     %\scaleup

2098:     %\plotone{md_ms_z_mbins.ps}

2099:     \plotone{f17.ps}

2100:     \caption{Predicted evolution in the ratio of dynamical mass estimator

2101:     $\mdyn=k\,\sigma^{2}\,R_{e}/G$ to stellar mass $\mstar$ with redshift

2102:     (as Figure~\ref{fig:reff.z}).

2103:     Comparing Figures~\ref{fig:reff.z} \&\ \ref{fig:sigma.z}, the evolution is

2104:     primarily driven by evolving $R_{e}$ -- more dissipational, compact

2105:     stellar remnants are increasingly baryon-dominated inside their

2106:     stellar $R_{e}$. Again, low-mass systems are already highly dissipational

2107:     and baryon-dominated inside $R_{e}$ at $z=0$, so there is little room

2108:     for evolution.

2109:     \label{fig:mdyn.z}}

2110: \end{figure}

2111:

2112: Combining these trends, we anticipate the evolution in $\mdyn/\mstar$ shown

2113: in Figure~\ref{fig:mdyn.z}. Since $\mdyn\equiv k\,\sigma^{2}\,R_{e}/G$,

2114: the stronger evolution in $R_{e}$ largely drives the evolution in $\mdyn$.

2115: We emphasize that this reflects a real difference in the enclosed matter

2116: within the stellar $R_{e}$ (as we have defined it, there are no significant structural or

2117: kinematic non-homology effects). The decrease in $\mdyn$ at fixed $\mstar$ in

2118: the highest-mass systems is a consequence of the fact that at low-$z$, they have

2119: small dissipational content and have larger stellar $R_{e}$, so have larger

2120: dark matter masses enclosed in $R_{e}$ (and thus higher enclosed mass in $R_{e}$).

2121: At high-$z$, their progenitors are gas-rich, so they are formed in highly dissipational

2122: mergers and have compact stellar distributions, which enclose less dark matter

2123: in the stellar $R_{e}$.

2124:

2125: \begin{figure}

2126:     \centering

2127:     %\scaleup

2128:     %\plotone{fp_z.ps}

2129:     \plotter{f18.ps}

2130:     \caption{Best-fit fundamental plane ($\mdyn\propto\mstar^{1+\alpha}$, where

2131:     $\alpha$ is the FP tilt) as a function of redshift.

2132:     The evolution in Figure~\ref{fig:mdyn.z}

2133:     is reflected here: at high-$z$, even large-mass disks are gas-rich so high-mass

2134:     systems experience significant dissipation, bringing their stellar $R_{e}$ in and

2135:     making the remnant more baryon-dominated within $R_{e}$ (lowering the

2136:     dark matter fraction and therefore $\mdyn$ within $R_{e}$, at fixed $\mstar$).

2137:     The tilt therefore weakens slightly with redshift (there is

2138:     less difference in the degree

2139:     of dissipation -- the driver of tilt -- at high-$z$). Solid and dashed lines

2140:     allow (do not allow) for progenitor disk size and stronger (weaker)

2141:     disk gas fraction evolution, bracketing the range allowed by observations

2142:     (representing the uncertainty in our predictions).

2143:     \label{fig:fp.z}}

2144: \end{figure}

2145:

2146: Figure~\ref{fig:fp.z} shows the implications of this evolution for the FP

2147: as a function of redshift.

2148: At low redshift, the observed tilt ($\alpha\approx0.2$) is recovered. At high redshift,

2149: however, all progenitors are gas-rich,

2150: so there is not much difference in the dissipational content at low and high masses -- meaning

2151: their compactness (relative to e.g.\ their dark matter halos or progenitor disks) is no longer a

2152: strong function of mass. The high-mass systems, being nearly as dissipational as low-mass

2153: systems, are similarly baryon-dominated in their stellar $R_{e}$, and there is

2154: less tilt.

2155:

2156: At the level predicted here, it may be difficult to observe the predicted FP evolution.

2157: Robust velocity dispersions and effective radii in the same rest-frame band (avoiding a

2158: bias towards smaller $R_{e}$ as observations probe closer to the rest-frame UV) may be

2159: obtainable in large samples at $z\lesssim1$, but the predicted evolution in that interval

2160: is weak -- within the systematic uncertainties at $z\sim0$.

2161:

2162: This is consistent with observational

2163: constraints in this redshift interval from

2164: weak lensing \citep{heymans:mhalo-mgal.evol}

2165: and optical studies \citep[][]{alighieri:fp.evolution,

2166: treu:fp.evolution,vanderwel:fp.evolution,vandokkum:fp.evol,brown:hod.evol}. These observations do see

2167: evolution in optical bands, (in the opposite sense to that predicted here, namely an increasing

2168: apparent tilt), but find that this owes to stellar population effects

2169: \citep{alighieri:fp.evolution} -- the evolution observed in terms of $\mdyn$ and $\mstar$ is

2170: either negligible or slightly negative (as predicted here).

2171: Any model where the stellar populations of

2172: lower-mass ellipticals are systematically younger

2173: (as is predicted here; see \citet{hopkins:groups.ell} for details) will predict the

2174: observed trend in optical bands: if, say, a

2175: low-mass system at $z=0$ has a stellar population age of $\sim7\,$Gyr whereas a

2176: high-mass system has an age of $\sim12\,$Gyr, as observed

2177: \citep[see e.g.][]{trager:ages,nelan05:ages,thomas05:ages,gallazzi06:ages},

2178: then inverting passive evolution to $z=0.7$, the low-mass (younger) system would have

2179: a lower $M_{\ast}/L_{B}$ by a factor $\gtrsim10$, whereas the high-mass system

2180: would have increased $M_{\ast}/L_{B}$ by a factor of just $\approx2$. It is therefore

2181: challenging to observe the weak evolution here ($\sim0.1$\,dex), given expected

2182: stellar population effects of magnitude $\sim0.5-1$\,dex.

2183:

2184: However, this degree of evolution is important for a number of subtle effects:

2185: for example, \citet{vandokkum:imf.evol} uses the evolution of the optical FP with redshift to

2186: infer evolution in elliptical mass-to-light ratios (under the assumption that

2187: the physical -- i.e.\ stellar mass -- FP is invariant). This is broadly fine, as the optical

2188: $M/L$ for any reasonable stellar population age evolves much more rapidly than

2189: the physical FP evolution in Figure~\ref{fig:fp.z}. But the authors then compare this

2190: change in $M/L$ with time to the mean colors of ellipticals as a function of time,

2191: and argue that the relation between the two suggests evolution in the stellar

2192: initial mass function.

2193: At this level (as the authors acknowledge), the comparison is sensitive to

2194: evolution in the physical (stellar mass) FP at the $\sim0.1\,$dex level -- if massive galaxies

2195: at $z\sim1$ have lower $M_{\rm dyn}/M_{\ast}$ by $\sim0.1\,$\,dex (similar to what we

2196: predict in Figure~\ref{fig:mdyn.z}) this will bias the inferred $M/L$ ratios

2197: (under the assumption of no physical FP evolution), but not, obviously the galaxy colors,

2198: by an amount comparable to the significance of the effect seen (in other words, allowing

2199: for this evolution, the evidence for any evolution in the stellar IMF may be less strong).

2200:

2201: At this level of detail, physical evolution in the FP

2202: in the manner predicted here may change this comparison, and should be

2203: considered in future, more detailed comparisons with observations. However,

2204: the effects here may not be important so long as the observed sample is

2205: appropriately selected: \citet{vandokkum:imf.evol} focus on satellites in rich

2206: clusters, which are not likely to undergo significant subsequent merging

2207: (especially as compared to the central galaxies modeled here, particularly those

2208: in moderately dense field and group environments that contribute

2209: much of the ``lever arm'' to the FP tilt evolution predicted). This emphasizes the

2210: importance of appropriate sample selection when comparing these quantities,

2211: and the need for further study to determine exactly how our predictions

2212: generalize to populations with more complex or distinct histories (such as

2213: satellites) and therefore pertain to the observations.

2214:

2215:

2216: \subsubsection{Black Hole Masses}

2217: \label{sec:evol:bh}

2218:

2219: \begin{figure}

2220:     \centering

2221:     %\scaleup

2222:     %\plotone{mbh_ms_z_mbins.ps}

2223:     \plotone{f19.ps}

2224:     \caption{Predicted evolution in $\mbh$ at fixed stellar mass with redshift

2225:     (style as Figure~\ref{fig:reff.z}). Red circles with arrows show the upper limits

2226:     from observed evolution in spheroid mass

2227:     density in \citet{hopkins:old.age} \citep[see also][]{merloni:magorrian.evolution},

2228:     applicable to $\sim M_{\ast}$

2229:     galaxies. Blue squares show the observational estimates of evolution for

2230:     massive black holes and galaxies from \citet{peng:magorrian.evolution}. More dissipation

2231:     at high redshifts means more work for the black hole to do before its

2232:     growth self-regulates, giving rise to larger $M_{\rm BH}$ (for fixed

2233:     feedback efficiency); however, the net effect is relatively weak.

2234:     Selection effects \citep{lauer:mbh.bias} may explain

2235:     the small difference between $M_{\rm BH}/M_{\ast}$ observed and predicted.

2236:     The trend is mass-dependent in the same manner

2237:     as Figure~\ref{fig:reff.z}.

2238:     \label{fig:mbh.ms.z}}

2239: \end{figure}

2240:

2241: \begin{figure}

2242:     \centering

2243:     %\scaleup

2244:     %\plotone{mbh_sig_z_mbins.ps}

2245:     \plotone{f20.ps}

2246:     \caption{Predicted evolution in $\mbh$ at fixed velocity dispersion

2247:     $\sigma$ with redshift (style as Figure~\ref{fig:reff.z}).

2248:     The complex interplay between evolution in $M_{\rm BH}$, $R_{e}$,

2249:     and $\sigma$ yields weak trends here that are not necessarily

2250:     monotonic. In general the predicted $M_{\rm BH}/\sigma$ evolution is

2251:     weaker than (or inverse to) that in $M_{\rm BH}/ M_{\ast}$,

2252:     but at $z\sim2-3$ massive black holes may be overmassive

2253:     relative to $\sigma$ by a factor $\lesssim2-3$ (owing to the evolution

2254:     in $M_{\rm BH}/ M_{\ast}$ and weak evolution in $\sigma/M_{\ast}$).

2255:     \label{fig:mbh.sigma.z}}

2256: \end{figure}

2257:

2258: Figure~\ref{fig:mbh.ms.z} \&\ \ref{fig:mbh.sigma.z} show the evolution in

2259: BH masses at fixed stellar mass and velocity dispersion, respectively.

2260: Because systems are more dissipational at high redshifts, the binding energy of

2261: the baryonic material which must be supported or expelled to halt accretion is

2262: higher (at fixed stellar mass). As a consequence, the typical BH masses

2263: are larger. We discuss this evolution, its physical causes, and consequences,

2264: in greater detail in \citet{hopkins:bhfp.theory}. Here, we note that the same conclusions

2265: hold, adopting a more complex cosmological model than was considered in that

2266: paper.

2267:

2268: The evolution is similar to that suggested by a number of recent observations

2269: \citep[e.g.][]{shields06:msigma.evolution,

2270: walter04:z6.msigma.evolution,peng:magorrian.evolution,woo06:lowz.msigma.evolution,

2271: salviander:msigma.evolution}

2272: and indirect estimations \citep{hopkins:old.age,merloni:magorrian.evolution,

2273: adelbergersteidel:magorrian.evolution,

2274: wyithe:magorrian.clustering,hopkins:clustering,lidz:clustering}

2275: but we emphasize that it is not strong (it is in fact on the

2276: lower $\sim1\sigma$ end of the estimated evolution) -- typical BHs are

2277: a factor $\lesssim2$ larger at $z\sim2-3$. Larger differences from the

2278: local relations (in particular in very luminous quasar populations) may

2279: result from selection biases \citep{lauer:mbh.bias}, where the most

2280: luminous quasars (most massive BHs) are likely to represent the upper

2281: end of the scatter in the BH-host correlations. The combination of such a

2282: selection effect with our predicted evolution is a very good match to the

2283: evolution in \citet{peng:magorrian.evolution}.

2284:

2285: We also compare with the constraints

2286: in \citet{hopkins:old.age}, derived from observations of the evolution in the

2287: spheroid mass density with redshift (since the black hole mass density cannot, in

2288: any reasonable model, decrease, the maximum evolution in the

2289: typical ratio $\mbh/\mstar$ is limited by the fact that the resulting black hole

2290: mass density, predicted from the observed spheroid mass functions, must

2291: not be higher than that observed at $z=0$). This constraint

2292: is most directly applicable to $\sim L_{\ast}$ galaxies, since this is where most of the

2293: mass density of the universe resides. These constraints limit evolution to a

2294: factor $\lesssim 2$ at $z=2$, consistent with our predictions. Comparing

2295: to similar estimates from \citet{merloni:magorrian.evolution}, who adopt a similar approach

2296: but model the evolution in the black hole mass density from quasar luminosity functions,

2297: yields a similar constraint and expectation of

2298: weak evolution, $M_{\rm BH}/M_{\ast}\sim(1+z)^{0.4-0.6}$,

2299: consistent with our predictions.

2300:

2301: The evolution in $M_{\rm BH}/\sigma$ is somewhat more complex, reflecting the

2302: full interplay between evolution in $M_{\rm BH}$ with spheroid binding energy

2303: and evolution in $\sigma$ with redshift (discussed above). Because

2304: deeper potentials will, in general, be reflected in higher $\sigma$ values,

2305: we expect weaker (or even inverse) evolution here, relative to $M_{\rm BH}/M_{\ast}$

2306: (where $M_{\ast}$ was not a measure of the central potential depth, so

2307: deeper potentials at fixed $M_{\ast}$ translated to higher $M_{\rm BH}$).

2308: However, because $\sigma$ reflects a combination of the halo and galaxy over a

2309: significant dynamic range, the results are non-trivial and can be non-monotonic in

2310: redshift. For example, in our most massive bin, the predicted evolution in

2311: black hole masses is sufficiently strong (and evolution in $\sigma$ relatively weak

2312: as a reflection of the tradeoff between dark matter and baryonic potential)

2313: that evolution to larger $\mbh/\sigma$ (albeit at the factor $\lesssim2-3$ level)

2314: may be expected, consistent with recent observations

2315: \citep{shields06:msigma.evolution,salviander:msigma.evolution}, although

2316: again selection effects are probably important and may explain

2317: cases where the inferred evolution appears to be much larger.

2318:

2319:

2320:

2321: \subsection{Effects of Dissipation}

2322: \label{sec:evol:diss}

2323:

2324: \begin{figure}

2325:     \centering

2326:     %\scaleup

2327:     %\plotone{r_z_mbins_nodiss.ps}

2328:     \plotone{f21.ps}

2329:     \caption{Predicted evolution in spheroid sizes with redshift, as

2330:     Figure~\ref{fig:reff.z}, but in a model with no dissipation (gas is treated

2331:     the same as stars in mergers). Even allowing for evolution in disk

2332:     sizes with redshift, there is essentially no evolution predicted in

2333:     spheroid sizes without dissipation

2334:     (other effects can make the evolution in spheroid sizes

2335:     {\em weaker} than that in disk sizes, which is already observed to be weak).

2336:     \label{fig:reff.z.nodiss}}

2337: \end{figure}

2338:

2339: We have argued that these effects are largely driven by varying degrees of dissipation

2340: as a function of redshift. We briefly illustrate this here by repeating our predictions

2341: for the size evolution of ellipticals (which, as we showed in \S~\ref{sec:evol}, is the

2342: most pronounced evolution) in the version of our model where we ignore

2343: dissipation or treat gas identically to stars (as in \S~\ref{sec:z0:diss:effects}).

2344: Figure~\ref{fig:reff.z.nodiss} shows the results. Without

2345: evolution in gas fractions driving evolution in

2346: the degree of dissipation, only the (weaker) evolution in progenitor disk sizes

2347: has a noticeable effect, and the evolution is substantially suppressed.

2348: In fact, the predicted evolution can, in this case, be even weaker than

2349: that in the progenitor disks, because ellipticals with a given final mass

2350: tend to undergo their mergers over a similar redshift range -- so their final sizes

2351: will reflect whatever the sizes of disks were at that time, whether or not

2352: those disk sizes evolved later.

2353:

2354:

2355: \begin{figure}

2356:     \centering

2357:     %\scaleup

2358:     %\plotone{mbh_ms_z_mbins_nodiss.ps}

2359:     \plotone{f22.ps}

2360:     \caption{Predicted evolution in $\mbh$ at fixed stellar mass with redshift,

2361:     as Figure~\ref{fig:mbh.ms.z}, but in a model with no dissipation.

2362:     The predicted evolution for massive black holes is almost entirely

2363:     a consequence of dissipation.

2364:     \label{fig:mbh.ms.z.nodiss}}

2365: \end{figure}

2366:

2367: A similar

2368: (relative) effect is seen for the evolution in $\mdyn$ and the FP, as well as

2369: BH masses. We show the evolution in black hole mass at fixed stellar mass

2370: predicted without dissipation in Figure~\ref{fig:mbh.ms.z.nodiss}. As expected

2371: based on the evolution in $R_{e}$, the evolution here is essentially

2372: eliminated without dissipation.

2373:

2374:

2375: \subsection{Effects of Dry Mergers}

2376: \label{sec:evol:dry}

2377:

2378: \begin{figure}

2379:     \centering

2380:     %\scaleup

2381:     %\plotone{r_z_mbins_nodry.ps}

2382:     \plotone{f23.ps}

2383:     \caption{Predicted evolution in spheroid sizes with redshift, as

2384:     Figure~\ref{fig:reff.z}, but in a model with no dry mergers (structural change in

2385:     spheroid-spheroid mergers is suppressed). The predicted evolution is

2386:     essentially identical at $M_{\ast}\lesssim10^{11}\,\msun$, but weaker

2387:     for very massive systems, which can no longer be puffed up at

2388:     low redshift to match observations.

2389:     \label{fig:reff.z.nodry}}

2390: \end{figure}

2391:

2392: At the high-mass end, there is also some evolution

2393: because systems merging early will have dry mergers puff up their sizes at

2394: later times. Figure~\ref{fig:reff.z.nodry} shows the evolution in the size-mass relation, with

2395: dry mergers suppressed (as in \S~\ref{sec:z0:dry}). The difference here is

2396: real, but small. Dry mergers -- while important for the size

2397: evolution of individual objects --

2398: do not have a dramatic effect on the overall evolution

2399: of the size-mass relation. If the ``pre dry-merger''

2400: size-mass relation is a power-law, $R_{e}\propto\mstar^{\alpha}$, then

2401: by the energy conservation argument in Equation~(\ref{eqn:egy}), a merger with

2402: mass ratio $f$ ($f\le1$) of two systems initially on this

2403: correlation will have a final effective radius a factor $(1+f)^{2}/(1+f^{2-\alpha})$

2404: larger than the

2405: primary. Relative to the size-mass relation, the system has moved above

2406: the relation by a factor $(1+f)^{2-\alpha}/(1+f^{2-\alpha})$.

2407:

2408: For observed values of $\alpha$ in disks or ellipticals ($\sim0.3$ and $\sim0.6$, respectively)

2409: this amounts to only $\sim0.1$\,dex evolution per

2410: major ($f\gtrsim1/3$) merger, compared to e.g.\ $\sim0.3$ dex scatter in the correlation.

2411: This is why, in Figure~\ref{fig:fp.proj.nodry}, the absence

2412: of dry mergers makes a difference only

2413: at the highest end of the mass function, where the number of major dry mergers

2414: might be large. Most of the population did not experience substantial dry merging, and

2415: what did, experienced it mainly at late times. As a result, the inferred redshift evolution

2416: without dry mergers is largely the same, with only an offset in the rate of size evolution

2417: at $z\lesssim1-2$ for the most massive systems (leading to an offset in the

2418: relative size prediction at $z\sim3$).

2419:

2420: It should be emphasized that the difference seen at the highest masses and redshifts

2421: in Figure~\ref{fig:reff.z.nodry} pertains by definition to those ellipticals which are

2422: {\em already} that massive at those redshifts -- i.e.\ systems which have formed and

2423: assembled $\sim10^{12}\,\msun$ by $z\gtrsim2-3$. These systems obviously

2424: live in the most dense environments, and are very rare -- their evolution is strong

2425: because they are expected to undergo a large number of dry mergers, as well

2426: as mergers with later-forming gas-poor disks and spheroids and minor

2427: mergers (see \S~\ref{sec:track}

2428: below) that will build up an extended envelope and rapidly increase their sizes.

2429: The typical system with the same mass observed at lower redshifts

2430: was not fully assembled at these early times, but forms in less

2431: overdense regions and (although star formation in progenitors may have ceased

2432: at early times) assembled more recently, in a smaller number of dry mergers.

2433: So the evolution seen should not be taken to imply a large number of dry mergers

2434: for the average massive galaxy, but rather to imply a large number  of

2435: subsequent mergers (``large'' being $\sim$ a few; again see \S~\ref{sec:track}

2436: for discussion of how this

2437: can efficiently increase the spheroid size) for, in particular, galaxies

2438: in extreme environments that are already massive at early times.

2439:

2440:

2441: \breaker

2442: \section{Evolution of Individual Systems: What Happens to the Systems

2443: Formed at High Redshift?}

2444: \label{sec:track}

2445:

2446: We have predicted that high-$z$ massive ellipticals should be substantially more compact

2447: than low-$z$ ellipticals of the same mass, in apparent agreement with

2448: observations. If this is true, then what happens to these early-forming, compact ellipticals

2449: as they evolve to $z=0$?

2450:

2451: \begin{figure*}

2452:     \centering

2453:     %\scaleup

2454:     %\plotone{mc_demo_hist.ps}

2455:     \plotone{f24.ps}

2456:     \caption{Median size ({\em left}), velocity dispersion ({\em middle}), and

2457:     instantaneous stellar mass ({\em right})

2458:     as a function of redshift, for various galaxies

2459:     that have similar $z=0$ stellar mass (labeled).

2460:     These median tracks

2461:     are shown for systems that have their first major merger (that first form a

2462:     spheroid) within narrow intervals in $z$ (shown where each line begins: e.g.\ the line

2463:     beginning at $z\sim3.5$ shows the median track of all systems which have their first

2464:     major merger at $z=3.5\pm0.25$ and will have the same labeled stellar mass

2465:     at $z=0$). Different colors show different variations in our model, and reflect the

2466:     uncertainties in redshift evolution (as the shaded range in Figure~\ref{fig:reff.z}).

2467:     Systems which form early are relatively compact, but grow in mass owing to

2468:     subsequent mergers with gas-poor disks and other spheroids.

2469:     \label{fig:tracks}}

2470: \end{figure*}

2471:

2472:

2473: \begin{figure}

2474:     \centering

2475:     \scaleup

2476:     %\plotone{mc_demo_sizemassplane.ps}

2477:     \plotterr{f25.ps}

2478:     \caption{Tracks followed by early-forming systems from

2479:     Figure~\ref{fig:tracks} in the size-stellar mass plane.

2480:     {\em Top:}

2481:     Symbol shape denotes the $z=0$ stellar mass (labeled),

2482:     and color denotes redshift (from black at $z=0$ to red at $z=4$, as labeled).

2483:     Points show the median tracks as in Figure~\ref{fig:tracks} for systems which

2484:     first formed at a given redshift (the beginning of each plotted track).

2485:     Lines show the $z=0$ size mass relation (solid), with slope

2486:     $R_{e}\propto M_{\ast}^{0.6}$, and the same divided by a factor of 2 (dashed)

2487:     and 4 (dotted).

2488:     {\em Bottom:} Vectors illustrate motion in the plane owing to different

2489:     types of mergers, for a system that will have a mass $M_{\ast}\sim10^{12}\,\msun$

2490:     at $z=0$ but first forms as a compact $M_{\ast}\sim10^{11}\,\msun$

2491:     spheroid ($\sim1/4$ the size of a typical similar-mass system at $z=0$)

2492:     at $z\sim3-4$ in a gas-rich merger.

2493:     As most systems are still gas-rich at these times, the next merger ($z\sim3-4$)

2494:     may be similarly gas-rich, moving the system along the

2495:     $R_{e}\propto M_{\ast}^{0.6}$ slope (maintaining the relative

2496:     compact-ness; {\em red}). In biased regions, nearby systems

2497:     are also rapidly transformed into ellipticals (which, having their initial mergers

2498:     at early times from gas-rich disks, are also compact). A 1:1 merger with

2499:     an identical elliptical will double $M_{\ast}$ and $R_{e}$

2500:     ($R_{e}\propto M_{\ast}^{1.0}$; {\em green}). At later times, systems

2501:     merging will be newly accreted (into the halo),

2502:     evolved (relatively gas-poor) disks, or ellipticals made

2503:     at later times from such evolved disks (which, being formed in more

2504:     gas-poor mergers, will be less compact). Such a dry/mixed merger

2505:     (with a less compact system) moves the system up more rapidly than

2506:     dry mergers with identical systems ($R_{e}\propto M_{\ast}^{1.4-1.8}$; {\em blue}).

2507:     \label{fig:tracks.sizemass}}

2508: \end{figure}

2509:

2510: We are now interested in the evolution of individual systems, from their formation redshift

2511: to $z=0$, rather than the evolution of the population at the same mass (recall,

2512: the systems with $\mstar\sim10^{11}\,\msun$ at $z\sim2-3$ are {\em not}, generally,

2513: the same galaxies that have $\mstar\sim10^{11}\,\msun$ at $z=0$).

2514: Figure~\ref{fig:tracks} shows such a case study, in fact a set of them, tracking the

2515: median (averaged over many Monte Carlo realizations) evolution in sizes of

2516: individual systems that are formed at a given redshift and will have a

2517: specific stellar mass at $z=0$.

2518: For two $z=0$ stellar masses where evolution is significant, we

2519: show the evolution in effective radii, velocity dispersions, and stellar masses.

2520: Figure~\ref{fig:tracks.sizemass} shows the same, but projects a

2521: few of the representative median

2522: tracks (for early-forming systems)

2523: into the size-stellar mass plane, to show how the systems move as a

2524: function of time and merger history

2525: in this space relative to e.g.\ the observed $z=0$ correlation.

2526:

2527:

2528: The systems which have their first major merger at some early time are indeed much

2529: more compact than their $z=0$ descendents. Partly, this is because they begin life

2530: as lower-mass systems, but even for their (instantaneous)

2531: stellar mass they are more compact than $z=0$ objects of that mass,

2532: reflecting the higher degree of dissipation (the evolution in Figure~\ref{fig:reff.z}).

2533: However, by $z=0$, systems of the same stellar mass from different formation

2534: epochs all have effective radii within a narrow range (a factor of $\sim2$).

2535: The systems which form late ($z\lesssim1$)

2536: evolve weakly -- they rarely experience any dry mergers, so their $z=0$ sizes

2537: largely reflect their sizes at formation. However, the systems which form

2538: early ($z\gtrsim3$) evolve strongly -- they experience a number of

2539: both mergers with lower-redshift, larger and much less gas-rich disks,

2540: as well as other spheroids (true spheroid-spheroid dry mergers).

2541: Such mergers increase the sizes of the ellipticals both in absolute

2542: terms and in relative terms (moving them above the size-mass relation

2543: representative of the time when they formed). Although the

2544: number of major mergers for most systems is relatively small,

2545: it is a function of their formation time. The systems which first merge at

2546: earliest times do so because they live in highly biased environments -- they

2547: are precisely those expected to undergo the most dry merging.

2548: As a result, the effects of evolving dissipational content and the effects of

2549: dry mergers appear to conspire such that, despite a significant evolution in the

2550: size-mass relation with redshift, by $z=0$ ellipticals formed

2551: at early times will have broadly similar sizes to those forming around $z\sim0$.

2552:

2553: If we examine Figures~\ref{fig:tracks}-\ref{fig:tracks.sizemass}, we can plot out the likely

2554: course of evolution for the compact, massive passive galaxies seen at

2555: $z\sim3$ in \citet{kriek:drg.seds,

2556: zirm:drg.sizes,labbe05:drgs,wuyts:irac.drg.colors,

2557: daddi05:drgs,vandokkum:z2.sizes,franx:size.evol,damjanov:red.nuggets}.

2558: At the redshifts observed, these are systems which have formed

2559: recently (systems which form earlier will be even more rare) in a

2560: gas-rich merger \citep[in agreement with their young observed ages,

2561: relative to $z=0$ spheroids of the same mass;][]{kriek:drg.seds},

2562: very high gas fractions $\gtrsim30-40\%$ characteristic of

2563: disks at these and higher redshifts.

2564: The large dissipational fraction

2565: yields a massive galaxy ($M_{\ast}\gtrsim10^{11}\,\msun$)

2566: with a very small effective radius $\sim1$\,kpc. As discussed in \S~\ref{sec:evol}

2567: above, however, the velocity dispersion is not

2568: extremely large (if we took $\sigma(z)=\sigma(\mstar,0)\times\,R_{e}(0)/R_{e}(z)$,

2569: we would estimate $\sigma\sim600-800\,{\rm km\,s^{-1}}$;

2570: however, the predicted velocity dispersion is in fact $\sim250-300\,{\rm km\,s^{-1}}$).

2571: Because the system is so much more compact, the dark matter fraction within

2572: $R_{e}$ is much smaller than in a typical $z=0$ spheroid of the same mass,

2573: so its contribution to the velocity dispersion is much smaller.

2574:

2575: The system,

2576: having formed in a high density environment \citep[consistent with

2577: their observed clustering and number densities; e.g.][]{quadri:highz.color.density,

2578: vandokkum06:drgs},

2579: will undergo a

2580: significant number of mergers between this time and $z\sim1-2$, where

2581: the merger rate begins to level off. These mergers are with a mix of

2582: disks and ellipticals. At times shortly after formation, successive mergers

2583: may be with other gas-rich disks, yielding similar amounts of

2584: dissipation and keeping the system relatively compact.

2585: This is illustrated in Figure~\ref{fig:tracks.sizemass} as a

2586: ``similar gas-rich merger''; which will move the system parallel to

2587: the size-mass relation ($R_{e}\propto M_{\ast}^{0.6}$).

2588: Soon, nearby systems will (given the biased environment) be

2589: largely elliptical -- but they may be similar, compact ellipticals.

2590: Spheroid-spheroid dry mergers will puff up the system,

2591: moving it up somewhat relative to the initial size-mass relation at

2592: formation. This is shown as a ``dry merger with similar

2593: (compact) elliptical'' in Figure~\ref{fig:tracks.sizemass} --

2594: here merging two identical (equally compact) ellipticals doubles

2595: both $R_{e}$ and $M_{\ast}$ (see \S~\ref{sec:model:sims:dry}),

2596: moving the system ``up'' relative to the size-mass

2597: relation along the steeper axis $R_{e}\propto M_{\ast}^{1.0}$.

2598:

2599: As time goes by (in a hierarchical scenario),

2600: the galaxies merging will have formed in a more extended

2601: physical region over longer periods of time (being recently accreted into

2602: the parent halo); merging disks at later times will be progressively

2603: more gas-poor (characteristic of low-$z$ disks, which have consumed their

2604: gas in star formation), and thus contribute dissipationless stars with

2605: very large effective radii, building up the stellar component

2606: (an extended, violently relaxed envelope) at large radii. Likewise, pure

2607: spheroid-spheroid mergers will include spheroids that formed

2608: later from such less gas-rich disks (and are therefore less compact),

2609: and these will serve to puff up the system even more efficiently

2610: and build up an envelope consistent with that typically observed in

2611: BCGs at $z=0$ (the expected location of such systems at $z=0$,

2612: based on their clustering properties and inferred halo masses).

2613: This is denoted in Figure~\ref{fig:tracks.sizemass} by the label

2614: ``dry/mixed mergers with gas-poor disks/later-forming (less compact)

2615: ellipticals,'' and allows the system to move more rapidly

2616: ``up'' in size relative to the size-mass relation (along an

2617: axis $R_{e}\propto M_{\ast}^{1.4-1.8}$, depending on the details of the

2618: type of mergers involved). This enables systems that form compact

2619: for their mass to migrate to the $z=0$ size-mass relation in a

2620: relatively small number of mergers

2621: (in the illustrative example in Figure~\ref{fig:tracks.sizemass}, the

2622: system moves from being $1/4$ the size of a $z=0$ analogue to

2623: lying on the local relation in just two equal-mass

2624: dry mergers, one with a similarly compact elliptical, one

2625: with a later-forming, less compact elliptical; the latter merger

2626: builds a more extended envelope and therefore moves the

2627: system more rapidly towards agreement with the $z=0$ relation).

2628:

2629: At sufficiently large masses (especially for galaxies at the center of

2630: massive clusters), minor mergers will increasingly dominate the

2631: growth of the galaxy \citep[see e.g.][]{maller:sph.merger.rates,zheng:hod.evolution,

2632: masjedi:cross.correlations}; these will have similar effects to those described

2633: here, and may even further increase the size at $z=0$ (especially if small

2634: merging systems are disrupted as they merge; in this case they will

2635: not contribute much to the central structure of the galaxy, but will increasingly

2636: build up an extended envelope at large radii). More detailed modeling of the structure

2637: of e.g.\ BCGs and the most massive ($M_{\ast}\gtrsim10^{12}\,M_{\sun}$)

2638: galaxies should account for both major and minor mergers in dense environments.

2639:

2640: The effective radius therefore rapidly grows, and by $z=0$ the system

2641: lies on (or even somewhat above) the median size-mass relation for

2642: systems of the same $z=0$ stellar mass. The velocity dispersion grows

2643: slightly as mass is accumulated, but since most of this mass is contributed

2644: from dissipationless components to the extended envelope (recall,

2645: a merger of two identical spheroids will leave $\sigma$ unchanged), it

2646: does not contribute much, and the $z=0$ galaxy has a peak

2647: (central) velocity dispersion $\approx400\,{\rm km\,s^{-1}}$, large but only

2648: $30\%$ larger than its initial central velocity dispersion and

2649: completely consistent with those observed in the most massive galaxies

2650: today \citep{bernardi:most.massive}.

2651: In detail, in fact, the predicted descendants of

2652: early compact systems here have similar abundances,

2653: velocity dispersions, and remarkably similar predicted locations in e.g.\ the

2654: $z=0$ size-mass, fundamental plane, and Faber-Jackson relations

2655: to the sample in \citet{bernardi:most.massive}, suggesting that

2656: many of those systems may be the products of this process.

2657: For the most part, then, these extreme systems

2658: are completely consistent in all the properties we can predict here

2659: (effective radius, mass, velocity dispersion, black hole mass, and

2660: profile shape/Sersic index) with their constituting the central $\sim10-30\%$ of

2661: the mass in a significant fraction of the brightest group or cluster galaxies

2662: today.

2663:

2664:

2665:

2666: \begin{figure}

2667:     \centering

2668:     %\scaleup

2669:     %\plotone{MC_z_last.ps}

2670:     \plotone{f26.ps}

2671:     \caption{Fraction of $z=0$ spheroids of a given stellar mass which had their

2672:     last major merger (gas rich or gas poor) above a given redshift.

2673:     A few percent of massive systems today are expected to have survived

2674:     from a much earlier formation time $z\sim2-3$ without any subsequent

2675:     mergers modifying their structure -- i.e.\ be remnants of the era where

2676:     scaling relations were different. These ellipticals would generally be compact

2677:     for their mass, with old ages, and strongly clustered.

2678:     \label{fig:surviving.frac}}

2679: \end{figure}

2680:

2681:

2682: This can explain why such objects are not ubiquitous in the

2683: local Universe. However, we could still

2684: ask what fraction of such systems might be expected to survive to $z=0$,

2685: without disappearing into a larger $z=0$ galaxy. In

2686: short, should any of this massive and compact population exist at $z=0$, which

2687: might represent the direct remnants of high-redshift, compact ellipticals?

2688:

2689: Figure~\ref{fig:surviving.frac} shows the fraction (at a given $z=0$ stellar mass) of objects

2690: which had their last major merger of any kind (which will largely

2691: set their observed size) at (or above) a given redshift. At

2692: masses corresponding to those observed in high-redshift compact galaxy populations

2693: ($\sim10^{11}\,\msun$), the fraction of the population today which has survived

2694: relatively intact since $z\gtrsim2$ is expected to be small, $\sim1-10\%$.

2695: This translates to a $z=0$ space density of such systems

2696: $\sim10^{-4}\,{\rm Mpc^{-3}}$: not large, but not completely negligible.

2697: This is an upper limit to the number density of such systems: if they have experienced a

2698: sufficiently large number of minor mergers, they may be significantly altered

2699: by $z=0$ (although many of the same reasons that allow some systems to avoid

2700: major mergers will allow them to avoid a large number of minor mergers).

2701: At any redshift, however, there is a scatter in elliptical sizes owing to a scatter in

2702: e.g.\ the dissipational fractions at the time of their formation (reflecting scatter

2703: in disk gas fractions). Since disk gas fractions have significant (factor $\sim1.5-2$)

2704: scatter, that is actually expected to dominate the scatter in e.g.\ the size-mass relation

2705: at $z=0$ (rather than scatter in the formation times of ellipticals). That is

2706: to say, there could be a similar fraction of

2707: ellipticals that are {\it too compact} present at $z=0$

2708: which formed recently (not at $z>2$), from the most anomalously gas-rich disks

2709: at lower redshifts (say the $\sim2\,\sigma$ outliers

2710: in the disk gas fraction-mass relations). This probably will not explain systems

2711: as small as $\sim1\,$kpc at these stellar masses, but it illustrates that some additional

2712: indicator is needed to distinguish those possibilities. If stellar population ages could

2713: be measured, this is straightforward: the system which formed at low redshift will have

2714: significant recent star formation (in fact, quite a lot, since we are interested in

2715: compact systems with large dissipational fractions). So it should be possible, in principle,

2716: to identify the descendents of the early-forming ellipticals.

2717:

2718: One important caveat to this discussion is that we have focused here on the

2719: role of major mergers with mass ratios below 1:3.

2720: Our calculation and other halo occupation models \citep{zheng:hod.evolution},

2721: clustering measurements \citep{masjedi:cross.correlations},

2722: and cosmological simulations \citep{maller:sph.merger.rates}

2723: suggest that as galaxies approach the extreme $M_{\ast}\gg 10^{12}\,M_{\sun}$

2724: BCG populations, growth by a large series of minor mergers becomes

2725: progressively more

2726: important than growth by major mergers. Our predictions in this limit should

2727: therefore be treated with some degree of caution,

2728: in contrast with the bulk of the elliptical population at masses

2729: $\lesssim$ a few $L_{\ast}$, the vast majority of which still live in less

2730: extreme environments \citep[e.g.][]{blanton:env,wang:sdss.hod,

2731: masjedi:merger.rates} and are expected to

2732: have experienced a small number ($\sim$ a couple) of major re-mergers

2733: since redshifts $z\sim2-4$. Our modeling can, in principle, be extended to

2734: include minor mergers, and doing so we find qualitatively similar conclusions

2735: (because merger histories are roughly self-similar, the relative growth

2736: and courses of evolution seen in Figure~\ref{fig:tracks} are conserved).

2737: However, as demonstrated in \citet{hopkins:groups.ell}, our predictions

2738: (and those of other halo occupation-based approaches) become considerably

2739: less robust in the regime of minor mergers (mass ratios $\sim$1:10 or so),

2740: owing to greater uncertainty in the satellite mass function at these

2741: mass ratios and weaker observational constraints on the halo occupation of

2742: small galaxies.

2743:

2744: In any case, because early-forming, compact systems were massive and experienced

2745: mergers at such early times, many will represent the primary galaxies

2746: in the highest density peaks, being BCGs today. Indeed, this is true for most of

2747: the massive high-$z$ spheroid population. However, in order to

2748: {\em survive} to $z=0$ from this population, mergers must be avoided (by our

2749: definition) -- something very hard for BCGs. We would still expect to find

2750: such objects around high-density peaks (there is no getting around

2751: the fact that they will only form so early and so massive in high-density peaks),

2752: but those that survive, in many cases, would be expected to do so

2753: precisely because they were {\em not} the central galaxy or BCG. (In detail, they

2754: probably were the central galaxy or BCG at their time of formation, but

2755: were displaced from this position in subsequent halo-halo mergers,

2756: becoming a satellite in a still larger system, suppressing their merger history).

2757:

2758: There may be some interesting candidates for such objects

2759: among the so-called ``compact elliptical'' family. For example

2760: (although we intend this only as an illustrative case), NGC 4486B is a

2761: satellite of NGC 4486 in Virgo. It has a stellar mass $\gtrsim10^{10}\,\msun$

2762: (\papertwo) and

2763: velocity dispersion $\sigma=200\,{\rm km\,s^{-1}}$, but an effective

2764: radius of only $200$\,pc \citep[see e.g.][]{jk:profiles}. This is somewhat smaller than

2765: the $\sim10^{11}\,\msun$ which we just described, but the scenario

2766: can be generalized to masses about this low.

2767: The stellar population age is quite old, $10.5$\,Gyr

2768: \citep[perhaps even up to $\sim13$\,Gyr,

2769: depending on the indicator used;][]{caldwell:ssps}, implying a time of

2770: last significant star formation $z\gtrsim2$.

2771: This is further supported by a corresponding $\alpha$-enrichment and metallicities.

2772: The system has high rotation and ellipticity suggestive of a very gas-rich formation

2773: \citep{bender:88.shapes,bender:ell.kinematics}.

2774: The estimated dynamical friction time is sufficiently long that the system could have become

2775: a satellite in the Virgo progenitor as early as $z\sim2$ and still not merged with the

2776: central galaxy. All of these lines of evidence suggest it may have

2777: survived from an early formation

2778: time as a very compact system. Unfortunately, except for the direct stellar population

2779: age measurements, these arguments are mostly circumstantial, and those

2780: stellar population estimates are still quite uncertain. Better determinations of the

2781: star formation history in this object would be an important test of this hypothesis.

2782:

2783: \breaker

2784: \section{Conclusions}

2785: \label{sec:discuss}

2786:

2787: We have combined the results from a large library of numerical simulations with

2788: simple, observationally constrained models for the halo occupation

2789: (and corresponding merger history) of galaxies in order to predict

2790: how spheroid scaling laws evolve. We demonstrate that, despite

2791: a variety of progenitor properties, complex merger histories, and evolution in

2792: e.g.\ progenitor sizes and gas content with redshift, the most important

2793: property driving the structural scaling relations of spheroids at each redshift

2794: (and, in particular, making these scaling relations {\em different} from those

2795: obeyed by disk galaxies) is the amount of dissipation involved in the

2796: formation of the spheroid. At the highest masses, dry mergers at relatively

2797: late times are also important, but are few, consistent with

2798: observational constraints. Together, these predict a significant mass-dependent

2799: evolution in spheroid scaling laws, which has important consequences

2800: for the fundamental plane, spheroid densities, and the formation histories of

2801: massive galaxies.

2802:

2803: At $z=0$, a simple model accounting for the effects of dissipation in mergers

2804: is able to explain the size-mass, velocity dispersion-mass, and fundamental

2805: plane (dynamical-stellar mass) correlations, the

2806: relation between galaxy profile shape (Sersic index) and mass,

2807: the correlation between black hole mass and various host properties

2808: (bulge binding energy, mass, velocity dispersion, and profile shape),

2809: the relative mass fractions in compact dissipational components in ellipticals,

2810: and the abundance of cuspy/disky/rapidly rotating versus cored/boxy/slowly

2811: rotating ellipticals \citep[see][]{hopkins:groups.ell}.

2812: Dissipation in the initial disk-disk mergers that form the spheroid

2813: progenitors is the most important parameter controlling these relations.

2814:

2815: In particular, at high masses, where disks are gas-poor, the sizes, densities,

2816: and dynamical mass to stellar mass ratios of ellipticals reflect those of their

2817: progenitor disks. At lower masses, where disks are gas-rich, mergers involve a

2818: significant amount of new star formation in dissipational nuclear starbursts triggered

2819: by loss of energy and angular momentum in the gas in the merger, building up a

2820: compact central stellar distribution (the ``dissipational'' component)

2821: that yields a smaller, more dense remnant. This mass-dependence in the

2822: dissipational content of progenitor disks makes the size-mass relation of ellipticals

2823: steeper than that of disks and gives the observed tilt in the fundamental plane

2824: (essentially the statement that low-mass disks, being more dominated by this

2825: nuclear dissipational component, have higher baryon-to-dark matter fractions within

2826: their {\em stellar} effective radii). Black holes are formed with a mass that, in a simple

2827: feedback-regulated model, corresponds to the binding energy of the

2828: dissipational component of bulge at the time of this dissipation, which the black hole

2829: must work against in order to self-regulate its growth. This matches the

2830: detailed observations of a black hole fundamental plane in \citet{hopkins:bhfp.obs}

2831: and gives rise to the secondary (indirect) correlations between black hole

2832: mass and spheroid velocity dispersion, mass, size, and Sersic index (all of which

2833: agree with those observed).

2834:

2835: To lowest order, the dissipational fractions that give rise to these trends at

2836: $z=0$ reflect the observed gas fractions of disk galaxies at

2837: moderate redshifts $z\sim0-2$. This is because most ellipticals are formed in a

2838: relatively small number of major mergers at low redshifts in this range.

2839: It is only when one considers the most massive populations of e.g.\ BCGs and

2840: massive cored, boxy, slowly rotating ellipticals that dry mergers become important.

2841: Their effect is to puff up these systems, producing extended envelopes and

2842: raising their outer Sersic indices. This is sufficient to explain the observed weak dependence

2843: of outer Sersic index on mass in elliptical galaxies, following the

2844: detailed multi-component decompositions of observed systems

2845: in \papertwo\ and \paperthree. When systems are fitted to a single

2846: Sersic index, the result is a steeper dependence that reflects a complex

2847: combination of e.g.\ the outer Sersic index (primarily driven by merger history),

2848: the mass fraction in a dissipational component (yielding rising densities

2849: at small radius), the shape of the two components where they are comparable,

2850: and the dynamic range of the fit (since real ellipticals are not perfect Sersic profiles,

2851: there is some dependence, and in particular when fitting the entire profile to a single

2852: Sersic law, the dependence can be significant).

2853:

2854: The detailed interplay of dissipation and dry mergers also gives rise to

2855: important residual trends that can be used as tests of these models. For example,

2856: at a given stellar mass, the oldest systems will have the largest effective

2857: radii and dynamical masses. This is driven by two effects: first, disks which have

2858: high star formation efficiencies (earlier star formation times)

2859: will be more gas-poor at the time of their mergers -- i.e.\ will have less mass in a

2860: dissipational component. Second, systems which may be gas-rich but form very

2861: early (i.e.\ have a high redshift of their last gas-rich merger) tend to do so because

2862: they are in dense environments, and will therefore experience subsequent

2863: mergers with both more gas-poor disks and other spheroids, increasing their

2864: size and dynamical-to-stellar mass ratios.

2865:

2866: At higher redshifts, disk gas fractions are expected and observed to be

2867: systematically larger. As a result, spheroids observed at these redshifts

2868: are expected to have formed in more gas-rich mergers and

2869: to have a larger mass fraction in their dissipational component, making

2870: them more compact.

2871: We predict that elliptical sizes evolve in a mass-dependent fashion:

2872: since low-mass disks are still quite gas-rich at $z=0$,

2873: there is little room for them to become more so at high redshift, so

2874: they cannot evolve much. High mass disks, on the other hand, could in principle

2875: be much more gas rich than their observed $z=0$ $\lesssim10\%$ values.

2876: As well, at high masses, dry mergers become important in the

2877: predicted size evolution. Consequently, the highest mass ellipticals

2878: are expected to evolve to become a factor $\sim2-3$ smaller at $z\gtrsim2$.

2879: In terms of the mean effective radius of spheroids of a given

2880: stellar mass, relative to that at $z=0$, we find evolution of the form

2881: $(1+z)^{-\beta}$ where $\beta\approx0.23\,\log{(M_{\ast}/10^{9}\,\msun)}$

2882: (of course enforcing $\beta>0$ or whatever minimum evolution is set

2883: by observed evolution in progenitor disk sizes at these masses).

2884:

2885: This is consistent with

2886: observations from e.g.\ \citet{trujillo:size.evolution,

2887: mcintosh:size.evolution,zirm:drg.sizes,vandokkum:z2.sizes},

2888: although we highlight important biases arising from

2889: stellar population gradients which are relevant when attempting to estimate

2890: the sizes of the most massive galaxies at $z\gtrsim2$.

2891: We demonstrate that this size evolution owes almost entirely to

2892: the change in dissipation with redshift, although dry mergers

2893: are of some importance for the most massive systems. Whether

2894: we explicitly include it or not, the evolution cannot be explained

2895: by simple scaling of disk sizes with redshift \citep[which, in any case, is

2896: observed to be weak; see e.g.][and references therein]{somerville:disk.size.evol}.

2897:

2898: This evolution has important implications for the other fundamental plane correlations.

2899: Because the fundamental plane tilt (at $z=0$) arises owing to systematically

2900: higher dissipational fractions in low-mass systems (making them more

2901: compact and yielding lower dark matter fractions within the stellar $R_{e}$),

2902: at high redshift (where even high-mass systems must be highly dissipational,

2903: since disk gas fractions are much higher) we expect the tilt to be weaker.

2904: Note that we refer here to the {\em stellar mass} fundamental plane,

2905: i.e.\ $M_{\rm dyn}$ versus $\mstar$ -- there will of course be stellar population

2906: effects introducing normalization and tilt changes in various observed bands.

2907: The predicted change is within the $z=0$ uncertainties at $z\le1$, but by $z\gtrsim3$,

2908: the tilt is predicted to largely disappear. This is important both as a test of these

2909: models, but also as a caution to observational attempts to use the fundamental plane

2910: as a detailed stellar population probe. It is certainly much less significant than the

2911: expected evolution in mass-to-light ratios in optical bands, but could be important

2912: for detailed stellar population probes such as that in \citet{vandokkum:imf.evol}

2913: which assume a fixed stellar mass fundamental plane (although selection of

2914: e.g.\ satellite galaxies in clusters, which are less likely to undergo significant future

2915: merging relative to central galaxies in the field or groups, may be sufficient to mitigate

2916: these evolutionary effects).

2917: % and compare different

2918: % stellar population indicators and mass-to-light ratios (basically this physical evolution could

2919: % be mistaken for an ``excess'' or ``deficit'' of $\sim0.1$\,dex in stellar population

2920: % mass-to-light ratio evolution).

2921:

2922: We argue that velocity dispersions will

2923: change weakly with redshift, despite the evolution in

2924: effective radii, because systems with higher dissipational fractions (smaller $R_{e}$)

2925: are more baryon-dominated in their stellar effective radii (i.e.\ the velocity dispersion

2926: includes a weaker contribution from the halo). This is important, because velocity dispersion

2927: is in general preserved or increases in subsequent re-mergers. If high-redshift

2928: massive galaxies had extreme velocity dispersions $\gtrsim600\,{\rm km\,s^{-1}}$

2929: (corresponding to the simple assumption that $\sigma$ is inversely

2930: proportional to just the stellar effective radius), then the appropriate number of

2931: systems with equal or higher velocity dispersions would have to exist at $z=0$

2932: \citep[and they do not, even in survey volumes of the SDSS;][]{bernardi:most.massive}.

2933: However, accounting for the interplay between

2934: dark matter and baryonic mass, our predicted evolution in velocity dispersions is

2935: completely consistent with observational constraints.

2936:

2937: Furthermore, because of the increasing dissipational components and

2938: deepening potential wells at high redshift, we expect that, in any feedback-regulated

2939: model of black hole growth, black holes must become at least somewhat more

2940: massive (relative to their host spheroid stellar mass) at high redshift. We predict a

2941: similar mass-dependent evolution in $M_{\rm BH}/M_{\ast}$, with little or

2942: no significant evolution at $M_{\ast}\lesssim10^{10}\,\msun$ (corresponding to

2943: black holes with masses $\lesssim10^{7}\,\msun$), and factor

2944: $\sim2$ increase to $z\sim2-3$ in the masses of the black holes in the most massive

2945: systems ($M_{\ast}\gg10^{11}\,\msun$, corresponding to $\mbh\gtrsim10^{9}\,\msun$).

2946: Likewise, we predict the evolution in black hole mass relative to host velocity dispersion,

2947: which exhibits a more complex behavior.

2948: The physical details of driving these trends are discussed in more detail in

2949: \citet{hopkins:bhfp.theory}, but here we embed those models in a more fully

2950: cosmological context. The evolution predicted is consistent with recent

2951: observations \citep{peng:magorrian.evolution}

2952: and with indirect constraints based on the relative

2953: evolution of black hole mass density and spheroid mass density \citep{hopkins:old.age,

2954: merloni:magorrian.evolution}, however it is still relatively weak.

2955: Selection effects \citep{lauer:mbh.bias}

2956: would be expected to explain the difference between some observational

2957: estimates that infer very strong \citep[e.g.][]{walter04:z6.msigma.evolution} or rapid

2958: low-redshift \citep{woo06:lowz.msigma.evolution} evolution

2959: and our predictions.

2960:

2961: We outline the history of these massive systems which are different

2962: when first formed at high redshift (where they are very dissipational) and

2963: observed at low redshift. Most of these compact, early forming ellipticals will

2964: undergo a significant number of dry mergers, or mergers with lower-redshift,

2965: more gas-depleted and larger disks, which will serve to puff up the remnant, building

2966: up some stellar mass and an extended stellar envelope (although changing

2967: the central velocity dispersion by a relatively small amount). This raises

2968: their effective radii more rapidly than their stellar mass, in the sense of

2969: moving them up in the size-mass relation. Interestingly, the evolution in disk

2970: gas fractions and the effects of such dry mergers almost exactly offset one another

2971: in massive systems. In other words, a $\sim10^{12}\,\msun$ galaxy which

2972: first formed from a merger at $z=3$ (which had a high gas content, leading to a very

2973: compact $\gtrsim10^{11}\,\msun$ initial galaxy with $R_{e}\sim1$\,kpc)

2974: will have almost the same size at $z=0$ as a

2975: galaxy of the same observed stellar mass which formed at much later time,

2976: say $z=1$, from a less dissipational merger (because disk gas fractions have

2977: decreased in that time interval). The system which formed earlier was initially more

2978: compact, but experienced more mergers in the intervening time, so for a given

2979: stellar mass, the scatter in $R_{e}$ is not large. Likewise, velocities dispersions

2980: and other fundamental plane properties have small scatter at a given stellar mass

2981: at any observed redshift because of this cancellation effect.

2982:

2983: This is not to say that most systems will undergo many dry mergers -- the overall

2984: density of dry mergers predicted here is low, consistent with observational

2985: constraints from e.g.\ \citet{bell:dry.mergers,vandokkum:dry.mergers,lin:mergers.by.type},

2986: and as we noted above most

2987: (especially $\lesssim L_{\ast}$) ellipticals today formed in a few

2988: (or just one) gas-rich mergers at relatively low redshifts. However, the systems

2989: which form earliest do so because they live in the most dense environments,

2990: which evolve the most rapidly, and are expected to undergo the most subsequent

2991: merging activity. We predict, as a result,

2992: that most of these compact systems observed at high redshifts

2993: will end up constituting some of the central dense stellar mass in

2994: observed brightest cluster or group galaxies at $z=0$; this is consistent with

2995: the observed sizes \citep{vonderlinden:bcg.scaling.relations},

2996: surface brightness profile shapes (extended envelopes

2997: build up through their subsequent dry merging) \citep[see e.g.\

2998: \paperthree\ and][]{gallagherostriker72,

2999: naab:dry.mergers},

3000: and masses and velocity dispersions \citep{bernardi:bcg.scalings} of these $z=0$ systems,

3001: and also expected given the clustering properties of high redshift massive

3002: systems \citep{quadri:highz.color.density}.

3003:

3004: We estimate what

3005: fraction, as a function of stellar mass, of

3006: the massive galaxy population today may have survived from these early times un-remerged

3007: and thus reflect the smaller sizes and higher dissipational content characteristic of

3008: systems formed at that time, finding that e.g.\ $\sim1-5\%$ of $z=0$

3009: $\sim10^{11}\,\msun$ systems may have survived since $z\sim2$. This is small,

3010: but not negligible, and by identifying compact ellipticals with old stellar population

3011: ages, it should be possible to locate the surviving remnants of this population.

3012: We predict that these will be massive, compact (with other indicators of gas-rich

3013: origin and high inferred dissipational fractions from their stellar profiles),

3014: with old stellar population ages. Because they still generally would initially form in

3015: high-density environments, but have avoided merging since then, we

3016: would roughly expect them to be in dense environments (rich groups and

3017: clusters) today, but {\em not} to be the central galaxy (again, most early-forming

3018: systems will be the central galaxy -- but if they are all the way until $z=0$,

3019: it is unlikely that they will have avoided further mergers, and thus would

3020: not be

3021: un-remerged). There may be some observed candidate members of this

3022: population in the so-called ``compact elliptical'' class, galaxies similar to

3023: NGC 4486B in Virgo which appears to satisfy these criteria. Further observations

3024: to test this hypothesis could prove extremely valuable as nearby probes of

3025: high-redshift galaxy evolution.

3026:

3027: Future work should extend these models to include the (at present

3028: more uncertain) role of particularly minor mergers

3029: (where e.g.\ satellite disruption may be important to explain the extended halos in

3030: massive BCGs) and other sources of dissipation, such as stellar mass

3031: loss \citep[see e.g.][]{ciottiostriker:recycling}. However,

3032: we show that other effects, e.g.\ evolution in the sizes of disk galaxies,

3033: dry mergers alone, and evolution in galaxy host halo properties, are insufficient to

3034: explain effects like those predicted here, in particular to account for

3035: e.g.\ the tilt of the fundamental plane, the relative sizes of ellipticals (versus those of

3036: disks) as a function of mass, the scaling of elliptical galaxy profile shapes with mass,

3037: and the redshift evolution in the fundamental plane

3038: correlations. Therefore, these predictions are important tests of any model

3039: in which dissipational star formation is a significant influence on galaxy formation.

3040:

3041: Because dissipation is necessary in a basic sense to reconcile the

3042: densities of ellipticals and spirals, this represents an important test of

3043: the merger hypothesis itself, as well as a test of our understanding of galaxy merger

3044: histories and the evolution in different galaxy components with redshift.

3045: In addition, we provide a unified

3046: theoretical lens through which to interpret a number of

3047: observations of spheroid and black hole-host

3048: scalings at both $z=0$ and high redshift, in which spheroids are fundamentally

3049: multi-component objects (with dissipational components originally formed

3050: in central starbursts triggered in the initial gas-rich interactions that formed

3051: the system, and dissipationless components representing the scattered,

3052: violently relaxed stars from the pre-initial merger stellar disks).

3053: As multi-component objects, it is primarily the relative mass fraction in the

3054: dissipational component, reflecting the gas content of spheroid progenitors,

3055: that drives their structural properties and the evolution of those structural

3056: properties with redshift.

3057:

3058:

3059: \acknowledgments We thank Tod Lauer, John Kormendy, Sadegh Khochfar,

3060: Marijn Franx, Ivo Labbe, Norm Murray, Chien Peng,

3061: and Barry Rothberg for helpful discussions and contributed data sets

3062: used in this paper. This work

3063: was supported in part by NSF grants ACI 96-19019, AST 00-71019, AST

3064: 02-06299, and AST 03-07690, and NASA ATP grants NAG5-12140,

3065: NAG5-13292, and NAG5-13381. Support for

3066: TJC and SW was provided by the W.~M.\ Keck

3067: Foundation.

3068:

3069: \bibliography{/Users/phopkins/Documents/lars_galaxies/papers/ms}

3070:

3071:

3072: %\clearpage

3073:

3074:

3075: \begin{appendix}

3076: %\appendixcolumns

3077:

3078: \section{Implementation of the Halo Occupation Model}

3079: \label{sec:appendix}

3080:

3081: Here, we present a condensed outline of the simplest implementation of the model

3082: used to make the predictions in this paper. As described in the text

3083: (\S~\ref{sec:model}), we have experimented with a wide variety of

3084: modifications to the model elements and methodology -- but our intention here

3085: is to summarize

3086: the basic framework upon which these modifications represent increasing

3087: layers of complexity. We therefore leave the

3088: description of these experiments to \S~\ref{sec:model} and

3089: \citet{hopkins:groups.qso}.

3090:

3091: As outlined in \S~\ref{sec:model:sims:summary}, our methodology consists of a

3092: few key steps:

3093:

3094: We construct a Monte Carlo sample of halos at some observed redshift

3095: $z_{\rm obs}$, sampling according to the halo mass function at that

3096: redshift \citep[constructed in standard fashion for the adopted cosmology

3097: following][]{shethtormen}.

3098:

3099: For each halo, we determine a mock growth history, i.e.\

3100: $M_{\rm halo}(z)$ for all $z>z_{\rm obs}$ up to some

3101: initial (maximum) redshift $z_{\rm init}$. There are several

3102: ways to do this: using the extended Press-Schechter formalism,

3103: adopting fits to individual halo growth histories in simulations

3104: \citep[taken from e.g.][]{delucia:ell.formation,

3105: stewart:mw.minor.accretion} or directly tracking the main-branch

3106: progenitor halo mass of each $z=0$ halo in the simulations,

3107: integrating over analytic fits to mean merger histories from

3108: initial seed populations \citep{fakhouri:halo.merger.rates}, or adopting the mean

3109: $M_{\rm halo}(z | M_{\rm halo}[z=z_{\rm obs}])$ for a given $z=z_{\rm obs}$

3110: population of halos

3111: fitted in e.g.\ \citet{wechsler:concentration,wechsler:assembly.bias,

3112: neistein:natural.downsizing}.

3113: Because we are considering the population in a

3114: {\em statistical} fashion, it makes no difference which of these

3115: approaches we adopt so long as they yield a similar

3116: median (and the scatter in galaxy properties in a given halo is,

3117: in any case, larger than the scatter at different times between

3118: these methodologies). The simplest approach, then, is to

3119: consider each halo in the Monte Carlo sample to have a mass

3120: at each $z>z_{\rm obs}$ given by the mean growth

3121: history \citep[adopting the analytic fits in][in

3122: their Equation~11 and Appendices]{neistein:natural.downsizing}.

3123:

3124: Now we have an average $M_{\rm halo}(z)$ for each halo in

3125: our representative Monte Carlo sample.

3126: We then begin at some initial redshift $z_{\rm initial}$

3127: and evolve the system forward in timesteps of

3128: some $\Delta z$.\footnote{Here, we choose $z_{\rm initial}=6$ and

3129: $\Delta z=0.01$, but in general we find that

3130: our results converge with respect to $\Delta z$ for values

3131: $\Delta z \lesssim 0.1$, and the predictions at any given

3132: $z_{\rm obs}$ converge rapidly once $z_{\rm initial}$

3133: is larger by some difference $z_{\rm initial}-z_{\rm obs}\gtrsim 1-2$.

3134: The reasons for this are discussed in the text (\S~\ref{sec:model:hod}) and below.}

3135: We ignore halos until they reach a mass (our effective resolution limit,

3136: corresponding to resolution limits in many of the simulations

3137: on which the HOD calculations are based) of

3138: $M_{\rm halo}=10^{10}\,\msun$. This corresponds, given a typical HOD, to

3139: an extremely small stellar mass $M_{\ast}\lesssim10^{7}\,\msun$, so

3140: is irrelevant for the final mass of all the galaxies considered here, and

3141: is a source of little uncertainty.

3142:

3143: At each timestep, all galaxies that have not yet experienced a major merger

3144: above the resolution limit are initialized according to the halo occupation

3145: model (specifically, we

3146: initialize the galaxy stellar mass $M_{\ast}(M_{\rm halo})$ and

3147: then determine $R_{e}(M_{\ast})$ and $f_{\rm gas}(M_{\ast})$).

3148: As described in the text, we have experimented with a variety of

3149: observational constraints regarding the implementation of the HOD

3150: \citep[see][for a more detailed comparison]{hopkins:groups.qso}. It is possible,

3151: for example, to use the quoted fits from \citet{conroy:monotonic.hod}

3152: at each of several redshifts, and linearly interpolate between each redshift

3153: where the HOD was fitted to apply it to our model.

3154: It is also possible to use the methodology in that

3155: paper and in \citet{valeostriker:monotonic.hod} -- monotonically ranking galaxies

3156: in stellar mass and halos in either mass or circular velocity, and assigning

3157: them to one another in one-to-one correlation -- together with

3158: an analytic redshift-dependent fit to the galaxy stellar mass function

3159: (extending to high redshift), such as that in \citet{fontana:highz.mfs}, to

3160: obtain the HOD at each redshift. In practice, all of these applications yield

3161: similar results -- these observational comparisons and other

3162: direct measurements \citep[see e.g.][]{yan:clf.evolution,cooray:highz,

3163: conroy:monotonic.hod,heymans:mhalo-mgal.evol,

3164: conroy:mhalo-mgal.evol,brown:hod.evol}

3165: imply that the evolution in $M_{\ast}(M_{\rm halo})$ with

3166: redshift is weak (and only significant at the highest masses,

3167: where number densities drop sufficiently rapidly at high redshift

3168: as to make them a negligible contribution to the lower-redshift population).

3169: For this reason, we obtain nearly identical results using the simplest possible

3170: prescription: assuming $M_{\ast}(M_{\rm halo})$ is redshift-independent

3171: and applying the observed $z=0$ relation, assigning each halo

3172: galaxy a stellar mass in Monte Carlo fashion

3173: \citep[compare the redshift-dependent fits in][who reach a similar conclusion]{conroy:hod.vs.z}. Specifically, we consider the fits from \citet{wang:sdss.hod}, for central galaxies

3174: \begin{equation}

3175: M_{\ast} =  M_{1}\,{\Bigl [}(M_{\rm halo}/M_{0})^{-\alpha} + (M_{\rm halo}/M_{0})^{-\beta} {\Bigr ]}^{-1}

3176: \label{eqn:hod.fit}

3177: \end{equation}

3178: with $(M_{0},\,M_{1},\,\alpha,\,\beta)=(3.16\times10^{11}\,h^{-1}\,\msun,\,

3179: 4.48\times10^{10}\,\msun,\,0.39,\,1.96)$ and a

3180: lognormal scatter with $\sigma=0.148$\,dex dispersion at each $M_{\rm halo}$

3181: \footnote{The ``turnover'' in $M_{\ast}(M_{\rm halo})$ in Equation~(\ref{eqn:hod.fit}) mainly

3182: corresponds to quenched, spheroid galaxies, and so one could argue

3183: should not be applied to a sample of strictly un-merged galaxies.

3184: We have experimented with not including this turnover, i.e.\

3185: adopting the low-mass slope of the HOD with $M_{\ast}\propto M_{\rm halo}^{1.96}$,

3186: with a maximum at a stellar mass equal to the halo mass times the

3187: universal baryon fraction. Because very few systems survive to such large

3188: masses without merging, however, this makes no difference to our predictions.}.

3189:

3190: We then assign each disk an effective

3191: radius $R_{e}(M_{\ast}\,|\,z)$

3192: according to the methodology in \S~\ref{sec:model:hod}, where

3193: $R_{e}(M_{\ast}\,|\,z=0)\approx3.4\,(M_{\rm disk}/10^{10}\,\msun)^{0.3}$\,kpc

3194: from \citet{shen:size.mass} (converted

3195: to our adopted cosmology and stellar IMF)

3196: and $R_{e}(M_{\ast}\,|\,z)=(1+z)^{-\beta_{d}}\,R_{e}(M_{\ast}\,|\,z=0)$

3197: as per Equation~(\ref{eqn:rdisk.evol}) (and we consider

3198: both $\beta_{d}=0$ -- no evolution -- and $\beta_{d}=0.4$ -- the

3199: evolution suggested by observations -- showing the

3200: difference between the two in the range of uncertainties in

3201: our model). The gas fractions are assigned in similar fashion,

3202: with the $z=0$ observed relation in Equation~(\ref{eqn:fgas.z0}) and

3203: appropriate redshift evolution in Equation~(\ref{eqn:fgas.z}) (where

3204: we again consider a range to allow for observational uncertainties

3205: as described in the text, specifically $\beta=0.5-2.0$ in

3206: Equation~\ref{eqn:fgas.z}).

3207:

3208: These assignments are re-initialized at each redshift, if

3209: the halo remains un-merged. For this reason (and because at

3210: any redshift, most galaxies of a given mass have assembled

3211: in the last redshift interval $\Delta z\sim1-2$), the large uncertainties

3212: at high redshift are quickly suppressed at any lower redshift.

3213: For example, even though the uncertainty in $R_{e}$ of a typical

3214: disk at high redshift (e.g.\ $z=4$) is large (say, $0.5\,$dex), and this

3215: will enter as such in the uncertainty in predicted size of an

3216: elliptical forming at this redshift (although the uncertainty is

3217: somewhat suppressed since the system is likely to be gas-rich; see

3218: the discussion in \S~\ref{sec:model}), by $z=0$,

3219: the original uncertainty will be suppressed by a typical

3220: factor $\sim16-32$ (depending on how much the galaxy has grown

3221: by low-redshift mergers), introducing only $\sim 0.02-0.03$\,dex

3222: uncertainty in the low-redshift prediction (in other words, at lower

3223: redshifts, the uncertainties are dominated by the uncertainties in

3224: the HOD around those redshifts, rather than by the propagation of

3225: uncertainties from higher redshifts).

3226:

3227: The critical step at each redshift is then determining whether or

3228: not the galaxy experiences a merger.

3229: Because we are interested in galaxy-galaxy mergers,

3230: rather than e.g.\ a subhalo merging into the parent halo

3231: (and the two can be very different, as many subhalos, especially

3232: those which are small relative to the parent halo,

3233: may not merge for a Hubble time), we do not wish to

3234: adopt an extended Press-Schechter or

3235: simulation-based merger tree, but opt for a more sophisticated

3236: approach. At each timestep, we consider the halo to have a subhalo

3237: population according to that fitted in simulations

3238: \citep{kravtsov:subhalo.mfs}

3239: \citep[alternatively, following the fits in][who adopt a semi-analytic

3240: methodology but reach similar conclusions]{vandenbosch:subhalo.mf}

3241: -- this subhalo mass function is, in units of

3242: $M_{\rm subhalo}/M_{\rm halo}$, only weakly halo mass

3243: or redshift-dependent, and it makes no difference whether

3244: we adopt the fits from different authors

3245: \citep[we have also compared the subhalo mass functions in][and reach similar

3246: conclusions]{springel:cluster.subhalos,

3247: tormen:cluster.subhalos,delucia:subhalos,gao:subhalo.mf,

3248: zentner:substructure.sam.hod,nurmi:subhalo.mf}.

3249: We then calculate the time for each such subhalo and

3250: its contained galaxy to merge with the central galaxy, according

3251: to one of several methodologies: either assuming a simple

3252: dynamical friction timescale, employing a more sophisticated

3253: timescale based on the cross section for resonant galaxy-galaxy

3254: interaction and orbital capture, or a

3255: timescale calculated based on a similar calculation in

3256: angular momentum space; the probability of a merger with

3257: each subsystem is then taken as the ratio of the timestep

3258: to the merger timescale. We compare these methodologies

3259: in detail in \citet{hopkins:groups.qso} and \citet{hopkins:groups.ell}

3260: and discuss the details of the derivation for each. For our purposes

3261: here, as demonstrated in those papers, they give similar results,

3262: and we note that the results are also similar to what is commonly

3263: adopted in many semi-analytic models based on N-body simulations,

3264: which follow subhalos until they can no longer be resolved and then

3265: approximate the remaining merger timescale with a dynamical

3266: friction timescale estimate.

3267:

3268: In \citet{hopkins:groups.qso} we show the detailed PDFs for

3269: the merger rate as a function of halo mass, galaxy mass,

3270: and redshift (see specifically Figures~2-4 therein),

3271: that arise from various combinations of these

3272: assumptions. We show there that

3273: the different approaches agree reasonably well, and agree

3274: with independent estimates such as sufficiently high resolution

3275: N-body experiments \citep[e.g.][]{maller:sph.merger.rates}

3276: and semi-analytic models with similar merger timescale

3277: calibrations from numerical tests \citep{somerville:new.sam}.

3278: For our purposes here, the important thing is that

3279: \citet{hopkins:groups.qso} show that the model predictions

3280: agree well with the observed mass function of

3281: galaxy mergers at all redshifts observed; as well as

3282: the observed galaxy merger rate as a function of

3283: environmental density, halo mass (estimated from

3284: clustering or group dynamics), and redshift;

3285: and the clustering (both large-scale and small-scale)

3286: and therefore halo occupation of mergers and

3287: recent merger remnants (see their Figures~6-12).

3288: This gives us some confidence that the calculation yields a

3289: reasonable approximation to the merger rate.

3290: For the details and summary of the adopted calculation,

3291: we refer to \S~2.1 of \citet{hopkins:groups.qso}.

3292: For our fiducial model in this paper, we employ their ``default''

3293: model: we construct the subhalo mass function for each halo

3294: adopting the analytic fits in \citet{kravtsov:subhalo.mfs}.

3295: We populate each subhalo as if it were a random member of

3296: our halo population (of the same mass) -- specifically, we

3297: draw a random halo of the same mass from our Monte Carlo

3298: sample and assign the subhalo the same properties\footnote{

3299: So long as the Monte Carlo sample is sufficiently large,

3300: this is identical to tracing the growth and merger history of the

3301: subhalo itself. As discussed in the text, we have explicitly chosen

3302: to ignore effects unique to satellites such as ram-pressure

3303: stripping. Although these are important for low-mass satellites

3304: which are long-lived in e.g.\ massive groups and clusters,

3305: they do not generally apply to the cases of interest: namely major

3306: mergers, for which the merger times are short (too short for

3307: long-timescale effects such as stripping and harassment) and the halos are of comparable

3308: mass (making e.g.\ ram-pressure relatively unimportant). Moreover,

3309: for the typical $\lesssim$ a few $L_{\ast}$ galaxies of interest here,

3310: the ``subhalo'' phase is very short in major mergers -- it represents a brief

3311: intermediate merging stage between field or small/loose group galaxies

3312: on their way to merging. For more massive cluster systems, however, the

3313: caveat should apply, along with the others discussed in \S~\ref{sec:track}.}.

3314: We then estimate the merger timescale using the ``group capture''

3315: gravitational cross-section timescales, specifically using

3316: the fitting formulae from \citet{krivitsky.kontorovich} calibrated to a

3317: large set of numerical simulations of different encounters in group environments

3318: \citep[see also][]{white:cross.section,makino:merger.cross.sections,mamon:groups.review}.

3319: This ratio of the timestep to this timescale gives the probability of each merger,

3320: and we determine whether each occurs in Monte Carlo fashion.

3321:

3322: When a merger occurs above the cutoff mass ratio threshold (a ``major'' merger),

3323: the properties of the remnant are determined as a function of the

3324: properties of both progenitors, as described in \S~\ref{sec:model:sims}.

3325: The final stellar mass is the sum of the two progenitor baryonic masses;

3326: the two constituent components, the dissipational and dissipationless

3327: stellar mass, of the remnant are the sum of the dissipational and

3328: dissipationless masses of the progenitors. For un-merged

3329: progenitors, the dissipational and dissipationless mass are the

3330: pre-merger gas and stellar mass, respectively. The effective radii of the

3331: remnants are calculated as described in the text, for the dissipationless

3332: component as a function of the mass ratio and scale radii of the progenitors

3333: (Equation~\ref{eqn:re.dry}),

3334: and then with the appropriate correction applied for the dissipational

3335: mass fraction (Equation~\ref{eqn:re.wet}). Other properties follow

3336: as described in \S~\ref{sec:model:sims}. Properties such as the dark matter

3337: fraction within a given radius (for dynamical masses, etc) are calculated

3338: assuming a \citet{hernquist:profile} profile for the dark matter halo, with a

3339: redshift-dependent concentration from \citet{bullock:concentrations}, and

3340: projecting the profile along with that of the galaxy.

3341:

3342: The stellar mass and stellar mass profile of major merger remnants

3343: are taken to be those calculated at the previous merger, although the

3344: halo grows continuously with time. In other words, major merger

3345: remnants are effectively ``quenched'' in some sense (although new gas can and

3346: does come into the galaxy through subsequent mergers with

3347: gas-rich galaxies) -- we ignore details of the re-growth of disks around

3348: early-forming ellipticals through direct cosmological accretion of cold gas.

3349: Although this may be important for especially the most high-redshift

3350: systems, a proper treatment requires a much more complete cosmological

3351: model (ideally highly resolved cosmological simulations that can form

3352: proper low-redshift disks, still a major theoretical challenge). Moreover,

3353: it has been argued that feedback from quasar and starburst activity in

3354: major mergers may actually be an important physical agent of

3355: ``quenching,'' \citep{scannapieco:sam,granato:sam,monaco:feedback,

3356: hopkins:red.galaxies,hopkins:transition.mass,hopkins:clustering,naab:dry.mergers,

3357: bundy:agn.lf.to.mf.evol},

3358: in which case this is a more accurate assumption than allowing for

3359: new cooling.

3360:

3361: In any event, we show in \citet{hopkins:groups.ell} that

3362: such a prescription, with a major merger

3363: mass ratio threshold set to the value used in this paper (1:3),

3364: yields very good agreement with the observed mass function and mass density of

3365: quenched galaxies as a function of redshift, as well as

3366: quenched central galaxy fraction as a bivariate function of

3367: stellar and halo mass, the clustering of red

3368: galaxies as a function of mass and redshift, the distribution of stellar

3369: population ages (and implicitly, ``quenching times'') as a function of

3370: elliptical stellar mass, and the distribution of elliptical structural properties

3371: related to cooling. This argues that the current prescription is at least a

3372: good approximation to the actual cooling histories of the galaxies

3373: of interest here. We have also experimented with alternative

3374: prescriptions such as those found in semi-analytic models

3375: \citep[with halo cooling suppressed by low-level AGN feedback, or

3376: truncated above a certain halo or stellar mass threshold,

3377: see e.g.][]{croton:sam,bower:sam,cattaneo:sam,delucia:sam,somerville:new.sam}, and find

3378: similar results \citep[this is because, it turns out, the halo mass threshold

3379: for the transition to ``hot mode'' accretion and effective quenching in these

3380: models, and empirically constrained stellar/halo mass ranges for

3381: quenching, correspond quite closely with the regime where most galaxies

3382: experience their first major merger; see e.g.][]{cattaneo:sam}.

3383:

3384: This process is repeated until the desired $z_{\rm obs}$ is reached.

3385: We then discard the systems that remain un-merged (as these will not,

3386: predominantly, be elliptical galaxies) and construct the mock observed

3387: sample of interest from our Monte Carlo population.

3388:

3389: \end{appendix}

3390:

3391:

3392: \end{document}

3393:

3394:

3395: