0706.1243/ms.tex
1: % !iTeXMac(typeset): simpdftex latex --keep-psfile ${iTMInput}
2: % !iTeXMac(compile): "./local Command"
3: \documentclass{emulateapj}
4: \usepackage{apjfonts}
5: \usepackage{rotating}
6: \bibliographystyle{apj}
7: %\usepackage{lscape}
8: %\usepackage{natbib}
9: %\usepackage{graphics}
10: %\documentclass[12pt,preprint]{aastex}
11: 
12: 
13: 
14: \newcommand{\beginsidefig}{\begin{sidewaysfigure*}}
15: \newcommand{\xendsidefig}{\end{sidewaysfigure*}}
16: \newcommand{\figexpand}{\epsscale{1.15}}
17: \newcommand{\plotter}{\plotone}
18: 
19: %\newcommand{\beginsidefig}{\begin{figure*}}
20: %\newcommand{\xendsidefig}{\end{figure*}}
21: %\newcommand{\figexpand}{}
22: %\newcommand{\plotter}{\includegraphics[scale=0.60]}
23: 
24: 
25: \newcommand{\etal}{et al.}
26: \newcommand{\mbh}{M_{\rm BH}}
27: \newcommand{\mstar}{M_{\ast}}
28: \newcommand{\lstar}{L_{\ast}}
29: \newcommand{\mdyn}{M_{\rm dyn}}
30: \newcommand{\re}{R_{e}}
31: \newcommand{\vvir}{V_{\rm vir}}
32: \newcommand{\fgas}{f_{\rm gas}}
33: \newcommand{\sersic}{n_{s}}
34: \newcommand{\msun}{M_{\sun}}
35: \newcommand{\tH}{t_{\rm H}}
36: \newcommand{\tmerger}{t_{\rm merger}}
37: \newcommand{\mdotstar}{\dot{M}_{\ast}}
38: \newcommand{\mhalo}{M_{\rm halo}}
39: \newcommand{\mgal}{M_{\rm gal}}
40: \newcommand{\mh}{\mhalo}
41: \newcommand{\mg}{\mgal}
42: \newcommand{\lbol}{L_{\rm bol}}
43: \newcommand{\mmerger}{M_{\rm merger}}
44: \newcommand{\paperone}{Paper \textrm{I}}
45: \newcommand{\papertwo}{Paper \textrm{II}}
46: 
47: 
48: 
49: \shorttitle{Co-Evolution of Quasars, Black Holes, and Galaxies \textrm{I}}
50: \shortauthors{Hopkins \etal}
51: \slugcomment{Submitted to ApJ, June 8, 2007}
52: \begin{document}
53: 
54: \title{A Cosmological Framework for the Co-Evolution of Quasars, Supermassive Black
55: Holes, and Elliptical Galaxies: \textrm{I}. Galaxy Mergers \& Quasar Activity}
56: \author{Philip F. Hopkins\altaffilmark{1}, 
57: Lars Hernquist\altaffilmark{1}, 
58: Thomas J. Cox\altaffilmark{1}, 
59: \&\ Du{\v s}an Kere{\v s}\altaffilmark{1}
60: }
61: \altaffiltext{1}{Harvard-Smithsonian Center for Astrophysics, 
62: 60 Garden Street, Cambridge, MA 02138}
63: 
64: \begin{abstract}
65: We develop a model for the cosmological role of mergers in the evolution of 
66: starbursts, quasars, and spheroidal galaxies. By combining theoretically well-constrained 
67: halo and subhalo mass functions as a function of redshift and 
68: environment with empirical halo occupation models, we can estimate where 
69: galaxies of given properties live at a particular epoch. This allows us to 
70: calculate, in an {\em a priori} cosmological manner, where major galaxy-galaxy 
71: mergers occur and what kinds of galaxies merge, at all redshifts. 
72: We compare this with the observed mass functions, clustering, fractions as a function 
73: of halo and galaxy mass, and small-scale environments of mergers, and show 
74: that this approach yields robust estimates in good agreement with
75: observations, and can be extended to predict detailed properties 
76: of mergers. Making the simple ansatz that 
77: major, gas-rich
78: mergers cause quasar activity (but not strictly assuming they are the only 
79: triggering mechanism), we demonstrate that this model naturally reproduces 
80: the observed rise and fall of the quasar luminosity density from $z=0-6$, as well as
81: quasar luminosity functions, fractions, host galaxy colors, and clustering 
82: as a function of redshift and luminosity.
83: The recent observed excess of quasar clustering on small scales at $z\sim0.2-2.5$ 
84: is a natural prediction of our model, as mergers will preferentially occur in regions 
85: with excess small-scale galaxy overdensities.
86: In fact, we demonstrate that quasar environments at all observed redshifts 
87: correspond closely to the empirically determined small group scale, where
88: major mergers of $\sim L_{\ast}$ gas-rich galaxies will be most efficient. 
89: We contrast this with a secular model in which quasar activity is driven by 
90: bars or other disk instabilities, and show that while these modes of fueling 
91: probably dominate the high-Eddington ratio population at Seyfert luminosities 
92: (significant at $z=0$), 
93: the constraints from quasar clustering, 
94: observed pseudobulge populations, and disk mass functions 
95: suggest that they are a small contributor to the $z\gtrsim1$ quasar luminosity density, 
96: which is dominated by massive BHs in predominantly classical 
97: spheroids formed in mergers. 
98: Similarly, low-luminosity Seyferts do not show a clustering excess on small scales, 
99: in agreement with the natural prediction of secular models, but bright quasars at all redshifts do so. 
100: We also compare recent observations of the colors of quasar host galaxies, and 
101: show that these correspond to the colors of recent 
102: merger remnants, in the transition region between the blue cloud and 
103: the red sequence, and are distinct from the colors of systems with observed 
104: bars or strong disk instabilities. Even the most extreme secular models, 
105: in which all bulge (and therefore BH) formation proceeds via disk instability, 
106: are forced to assume that this instability acts before the (dynamically inevitable) mergers, and 
107: therefore predict a history for the quasar luminosity density which is 
108: shifted to earlier 
109: times, in disagreement with observations. 
110: Our model provides a powerful means to predict the abundance and 
111: nature of mergers, and to contrast cosmologically motivated predictions of 
112: merger products such as starbursts and AGN. 
113: \end{abstract}
114: 
115: \keywords{quasars: general --- galaxies: active --- 
116: galaxies: evolution --- cosmology: theory}
117: 
118: \section{Introduction}
119: \label{sec:intro}
120: 
121: \subsection{Motivation}
122: \label{sec:intro:motives}
123: 
124: 
125: Over the past decade, observations have established that supermassive
126: black holes likely reside in the centers of all galaxies with
127: spheroids \citep[e.g.,][]{KormendyRichstone95,Richstone98,KormendyGebhardt01}, 
128: and that the properties of these black holes
129: and their hosts are correlated.  These correlations take various
130: forms, relating the black hole mass to e.g.\ the mass \citep{magorrian,
131: mclure.dunlop:magorrian,marconihunt,haringrix}, 
132: velocity dispersion \citep{FM00,Gebhardt00,tremaine:msigma}, 
133: and concentration or Sersic index \citep{graham:concentration,graham:sersic} 
134: of the spheroid.  Most recently,
135: \citet{hopkins:bhfp} have demonstrated that these relationships are
136: not independent and can be understood as various projections of a
137: black hole fundamental plane analogous to the fundamental plane
138: for elliptical galaxies \citep{dressler87:fp,dd87:fp}. 
139: The striking similarity between these two fundamental planes
140: indicates that galaxy spheroids and supermassive black holes are not
141: formed independently, but originate via a common physical process.
142: 
143: Furthermore, although there may be some relatively weak evolution in the 
144: correlation between BH mass and host mass or velocity dispersion 
145: owing to changes in spheroid structural properties and 
146: internal correlations with redshift 
147: \citep[e.g.,][]{peng:magorrian.evolution,
148: shields03:msigma.evolution,shields06:msigma.evolution,walter04:z6.msigma.evolution,
149: salviander:msigma.evolution,woo06:lowz.msigma.evolution,hopkins:msigma.limit}, 
150: the fundamental plane appears to be 
151: preserved \citep{hopkins:bhfp}, and in any case {\em some} correlation 
152: exists at all redshifts. There are not, at any redshifts, bulgeless 
153: systems with large black holes or bulges without correspondingly 
154: large black holes. This empirically demonstrates that whatever 
155: process builds up black hole mass {\em must} trace the formation of 
156: spheroids (albeit with potentially redshift-dependent efficiency). 
157: 
158: These connections extend to other phenomena associated with galaxies
159: that have sometimes been interpreted as being independent.  For
160: example, by estimating the total energy radiated by quasars, \citet{soltan82} 
161: showed that nearly all the mass in supermassive black holes
162: must have been accumulated during periods of bright quasar activity.
163: This analysis has since been revisited on a number of occasions
164: \citep{salucci:bhmf,yutremaine:bhmf,marconi:bhmf,shankar:bhmf,yulu:bhmf}, 
165: with various assumptions for
166: quasar obscuration and bolometric corrections. 
167: \citet{hopkins:bol.qlf} 
168: have reformulated the Soltan argument from the evolution of the {\it
169: bolometric} quasar luminosity function (LF).  In their analysis,
170: Hopkins et al. combined observations of the quasar LF in a variety of
171: wavebands with purely empirical determinations of the luminosity
172: dependence of quasar obscuration and spectral emission to
173: infer the bolometric quasar LF.  By integrating this over luminosity
174: and redshift, it is then possible to obtain a {\it model-independent}
175: estimate of the total energy density of radiation from quasars.  The
176: cosmic black hole mass density then follows if black holes in quasars
177: accrete with constant radiative efficiency $\epsilon_{r}$ \citep{shakurasunyaev73}, 
178: by integrating $L_{\rm bol} = \epsilon_{r}\,
179: \dot{M}_{\rm BH}\,c^{2}$.  This yields a $z=0$ black hole mass density of 
180: \begin{equation}
181: \rho_{\rm BH}(z=0) = {4.81}^{+1.24}_{-0.99}\,{\Bigl(}\frac{0.1}
182: {\epsilon_{r}}{\Bigr)}\,h_{70}^{2}\times 10^{5}\,M_{\sun}\,{\rm M
183: pc^{-3}},
184: \end{equation}
185: consistent with estimates of $\rho_{\rm BH}(z=0)$ obtained from
186: local bulge mass, luminosity, and velocity dispersion functions
187: \citep[e.g.,][]{marconi:bhmf,shankar:bhmf}.
188: 
189: Taken together, the black hole fundamental plane
190: and the Soltan argument imply that the common
191: physical process which produces galaxy spheroids and supermassive
192: black holes also must be responsible for triggering {\em most} bright
193: quasars.  Moreover, there is compelling evidence that quasar activity
194: is preceded by a period of intense star formation in galaxy centers so
195: that, for example, ultraluminous infrared galaxies (ULIRGs) and
196: distant submillimeter galaxies (SMGs) would eventually evolve into
197: quasars \citep{sanders88:quasars,sanders88:warm.ulirgs,sanders96:ulirgs.mergers,
198: dasyra:pg.qso.dynamics}. Essentially all sufficiently deep studies of 
199: the spectral energy distributions (SEDs) of
200: quasar host galaxies
201: reveal the presence of young stellar populations 
202: indicative of a recent starburst 
203: \citep{brotherton99:postsb.qso,canalizostockton01:postsb.qso.mergers,
204: kauffmann:qso.hosts,yip:qso.eigenspectra,
205: jahnke:qso.host.sf,jahnke:qso.host.uv,sanchez:qso.host.colors,
206: vandenberk:qso.spectral.decomposition,barthel:qso.host.sf,zakamska:qso.hosts}. 
207: There further appears to be a 
208: correlation in the sense that the most luminous quasars have the youngest 
209: host stellar populations \citep{jahnke:qso.host.sf,vandenberk:qso.spectral.decomposition} and 
210: the greatest prominence of post-merger tidal features and disturbances 
211: \citep{canalizostockton01:postsb.qso.mergers,kauffmann:qso.hosts,
212: hutchings:redqso.lowz,hutchings:highz,hutchings:redqso.midz,
213: zakamska:qso.hosts,letawe:qso.merger.ionization}.  
214: These observations indicate that intense starbursts
215: must result from the same process as 
216: most quasars and supermassive black holes.
217: 
218: In the simplest interpretation, we seek an explanation for the various
219: phenomena summarized above such that they result from the {\it same
220: event}.  There are general, theoretical requirements that any such
221: event must satisfy.  In particular, it must be fast and violent, blend
222: together gas and stellar dynamics appropriately, and involve a supply
223: of mass comparable to that in large galaxies.  Why should this be the
224: case?
225: 
226: The accepted picture for the growth of supermassive black holes
227: is that the mass is primarily assembled by gas accretion \citep{Lynden-Bell69}. 
228: From the Soltan argument, we know that this mass
229: must be gathered in a time comparable to the lifetimes of bright
230: quasars, which is similar to the \citet{salpeter64} time $\sim 10^{7.5}$
231: years, for black holes accreting at the Eddington rate. Independent 
232: limits \citep[][and references therein]{martini04} from quasar 
233: clustering, variability, luminosity function evolution, and other methods 
234: demand a {\em total} quasar lifetime (i.e.\ duration of major growth for 
235: a given BH) of $\lesssim\,10^{8.5}\,{\rm yr}$. 
236: In order to explain the existence of black holes with masses
237: $\sim 10^{9} M_\odot$, the amount of
238: gas required is likely comparable to that contained in entire large
239: galaxies. Thus, the process we seek must be able to deliver
240: a galaxy's worth of gas to the inner regions of a galaxy on a
241: relatively short timescale, $\ll10^{9}$ years.
242: 
243: If this event is to simultaneously build galaxy spheroids, it must
244: involve stellar dynamics acting on a supply of stars similar to that
245: in large galaxies because the stellar mass is $\sim 1000$ times larger 
246: than that of the
247: black hole and it is believed that spheroids are assembled mainly
248: (albeit not entirely) 
249: through dissipationless physics (i.e.\ the movement of stars from 
250: a circular disk to random spheroid orbits).  A plausible candidate process is
251: violent relaxation \citep[e.g.][]{Lynden-Bell67} which has been
252: demonstrated to yield phase space distributions akin to those of
253: elliptical galaxies through large, rapid fluctuations in the
254: gravitational potential.  Violent relaxation operates on a timescale
255: similar to the free-fall time for self-gravitating systems, again
256: $\ll 10^{9}$ years for the bulk of the mass.
257: 
258: Motivated by these considerations, \citet{hopkins:qso.all} developed a
259: model where starbursts, quasars, supermassive black hole growth, and
260: the formation of red, elliptical galaxies are connected through an
261: evolutionary sequence, caused by {\it mergers} between {\it gas-rich}
262: galaxies.  There is, in fact, considerable observational evidence
263: indicating that mergers are responsible for triggering ULIRGs, SMGs,
264: and quasars \citep[see references in Hopkins et al.\ 2006a; for
265: reviews see][]{barneshernquist92, schweizer98,jogee:review}. 
266: Furthermore, the long-standing ``merger
267: hypothesis,'' which proposes that most elliptical galaxies formed in
268: mergers \citep{toomre72,toomre77}, is supported by the
269: structure of known ongoing mergers \citep[e.g.,][]{schweizer92,
270: rothberg.joseph:kinematics,rothberg.joseph:rotation} and the
271: ubiquitous presence of fine structures such as shells, ripples,
272: tidal plumes, nuclear light excesses, and
273: kinematic subsystems in ellipticals \citep[e.g.][]{schweizerseitzer92,
274: schweizer96}, 
275: which are signatures of mergers 
276: \citep[e.g.][]{quinn.84,hernquist.quinn.87,hernquist.spergel.92,
277: hernquist:kinematic.subsystems,mihos:cusps}.
278: 
279: Numerical simulations performed during the past twenty years verify
280: that {\it major} mergers of {\it gas-rich} disk galaxies can plausibly
281: account for these phenomena and have elucidated the underlying physics.
282: Tidal torques excited during a merger lead to rapid inflows of gas
283: into the centers of galaxies \citep{hernquist.89,barnes.hernquist.91,
284: barneshernquist96}. 
285: The amount of gas involved can be a large fraction of
286: that in the progenitor galaxies and is accumulated on roughly a
287: dynamical time in the inner regions, $\ll 10^9$ years \citep{hernquist.89}.
288: The resulting high gas densities trigger starbursts \citep{mihos:starbursts.94,
289: mihos:starbursts.96}, and feed rapid black hole growth \citep{dimatteo:msigma}.
290: Gas consumption by the starburst and dispersal of residual
291: gas by supernova-driven winds and feedback from black hole growth 
292: \citep{springel:red.galaxies} terminate star formation so that the remnant
293: quickly evolves from a blue to a red galaxy.  The stellar component of
294: the progenitors provides the bulk of the material for producing the
295: remnant spheroid \citep{barnes:disk.halo.mergers,barnes:disk.disk.mergers,
296: hernquist:bulgeless.mergers,hernquist:bulge.mergers}
297: through violent relaxation.
298: 
299: The simulations also place significant constraints on the types of
300: mergers that can initiate this sequence of events.  First, a major
301: merger is generally required in order for the tidal forces to excite a
302: sufficiently strong response to set up nuclear inflows of gas.
303: Although simulations involving minor mergers with mass ratios $\sim
304: 10:1$ show that gas inflows can be excited under some circumstances
305: \citep[e.g.][]{hernquist.89,hernquist.mihos:minor.mergers,bournaud:minor.mergers}, a systematic study
306: indicates that such an outcome is limited to specific orbital
307: geometries \citep{younger:minor.mergers} and 
308: that the overall efficiency of triggering inflows declines rapidly 
309: with increasing mass ratio.  Thus, while the precise
310: definition of a major merger in this context is blurred by the
311: degeneracy between the mass ratio of the progenitors and the orbit of
312: the interaction, it appears that a mass ratio $\sim 3:1$ or smaller is
313: needed. 
314: This is further supported by
315: observational studies \citep{dasyra:mass.ratio.conditions,woods:tidal.triggering}, 
316: which find 
317: that strong gas inflows and nuclear starbursts are typically seen
318: only below these mass ratios, despite the much greater frequency of 
319: higher mass-ratio mergers. 
320: 
321: Second, the merging galaxies must contain a supply of {\it cold} gas,
322: which in this context refers to gas that is rotationally supported, in
323: order that the resonant response leading to nuclear inflows of gas in
324: a merger be excited.  Elliptical galaxies contain large quantities of
325: hot, thermally supported gas, but even major mergers between two such
326: objects will not drive the nuclear inflows of gas that fuel rapid
327: black hole growth.
328: 
329: It also must be emphasized that essentially all numerical studies 
330: of spheroid kinematics find that {\em only} mergers 
331: can reproduce the observed kinematic properties of elliptical 
332: galaxies and ``classical'' bulges \citep{hernquist.89,hernquist:bulgeless.mergers,
333: hernquist:bulge.mergers,barnes:disk.halo.mergers,barnes:disk.disk.mergers,
334: schweizer92,naab:boxy.disky.massratio,
335: naab:minor.mergers,naab:gas,naab:dry.mergers,naab:profiles,
336: bournaud:minor.mergers,jesseit:kinematics,cox:kinematics}. 
337: Disk instabilities and
338: secular evolution (e.g.\ bar instabilities, harassment, and other 
339: isolated modes) can indeed produce bulges, but these are invariably 
340: ``pseudobulges'' \citep{schwarz:disk-bar,athanassoula:bar.orbits,
341: pfenniger:bar.dynamics,combes:pseudobulges,
342: raha:bar.instabilities,kuijken:pseudobulges.obs,oniell:bar.obs,athanassoula:peanuts}, 
343: with clearly distinct shapes (e.g.\ flattened or 
344: ``peanut''-shaped isophotes), rotation properties (large $v/\sigma$), 
345: internal correlations (obeying different Kormendy and Faber-Jackson relations), 
346: light profiles (nearly exponential Sersic profiles), and colors and/or 
347: substructure from classical bulges 
348: \citep[for a review, see][]{kormendy.kennicutt:pseudobulge.review}. 
349: Observations indicate that 
350: pseudobulges constitute only a small fraction of the total mass density 
351: in spheroids \citep[$\lesssim10\%$; see][]{allen:bulge-disk,ball:bivariate.lfs,
352: driver:bulge.mfs}, becoming a large fraction of the bulge 
353: population only for small bulges in late-type hosts 
354: \citep[e.g.\ Sb/c, corresponding to typical $\mbh\lesssim10^{7}\,\msun$; see][and 
355: references therein]{carollo98, kormendy.kennicutt:pseudobulge.review}. 
356: Therefore, it is clear that although such processes may be important 
357: for the buildup of the smallest black hole and spheroid 
358: populations, secular evolution {\em cannot} be the agent 
359: responsible for the formation of most 
360: elliptical galaxies, or for the buildup of 
361: most black hole mass, or the triggering of bright quasar activity. 
362: 
363: We are thus led to suggest a generalization of the merger hypothesis
364: proposed by \citet{toomre77} whereby major mergers of {\it gas-rich} disk
365: galaxies represent the dominant process for producing the supermassive
366: black hole and spheroid populations in the Universe.  Then, by the
367: Soltan argument and the association of starbursts with quasars, it
368: follows that this must also be the primary mechanism for triggering
369: the most intense infrared luminous galaxies and the brightest quasars
370: and active galactic nuclei (AGN).  It is important to keep in mind
371: that this does not rule out other processes occurring at lower levels
372: and under other circumstances.  For example, we are not claiming that
373: all AGN result from mergers.  In fact, low levels of such activity, as in
374: Seyfert galaxies, often appear in undisturbed galaxies.  For these
375: objects, other modes of fueling are likely more significant, as in the
376: stochastic accretion scenario of \citet{hopkins:seyferts}.  The primary
377: requirement on our model is that the bulk of the supermassive black
378: hole mass density should have accumulated through gas-rich mergers,
379: consistent with the redshift evolution of the quasar population
380: \citep{hopkins:bol.qlf}.
381: Similarly,
382: spheroid evolution by gas-free (``dry'') mergers will go on, but does
383: not explain how stellar mass is initially moved onto the red sequence 
384: or how black hole mass is initially accreted.
385: 
386: \subsection{Outline}
387: \label{sec:intro:outline}
388: 
389: To test our hypothesis, we have developed methods for following the
390: growth of black holes in numerical simulations of galaxy mergers,
391: using a multiphase model for the star-forming gas that enables us to
392: consider progenitor disks with large gas fractions. Generically, we
393: find that major mergers of gas-rich galaxies evolve through distinct
394: phases that can plausibly be identified with the various observed
395: phenomena summarized above. 
396: 
397: Figure~\ref{fig:outline} presents a 
398: schematic outline of these phases. 
399: In this picture, galactic disks grow mainly
400: in quiescence, with 
401: the possibility of secular-driven bar or pseudobulge
402: formation, until the onset of a major merger. A significant, perhaps
403: even dominant fraction of Seyferts and 
404: low-luminosity quasars will almost certainly arise from this secular 
405: evolution, but the prevalence of pseudobulges only in the 
406: hosts of $\lesssim10^{7}\,\msun$
407: black holes suggests this is limited to luminosities $M_{B}\gtrsim-23$ 
408: (see the discussion in \S~\ref{sec:quasars:secular}). 
409: 
410: %\begin{landscape}
411: %\begin{sidewaysfigure*}
412: \beginsidefig
413: %\begin{figure*}
414:     \centering
415:     \figexpand
416:     %\plotone{outline.ps}
417:     \plotone{f1.ps}
418:     \caption{An schematic 
419:     outline of the phases of growth in a ``typical'' galaxy undergoing a 
420:     gas-rich major merger.
421:     {\em Image Credit:} (a) NOAO/AURA/NSF; (b) REU program/NOAO/AURA/NSF; 
422:     (c) NASA/STScI/ACS Science Team; (d) Optical (left): NASA/STScI/R.\ P.\ van 
423:     der Marel \&\ J.\ Gerssen; X-ray (right): NASA/CXC/MPE/S.\ Komossa et al.; (e) Left: 
424:     J.\ Bahcall/M.\ Disney/NASA; Right: Gemini Observatory/NSF/University of Hawaii 
425:     Institute for Astronomy; (f) J.\ Bahcall/M.\ Disney/NASA; (g) F.\ Schweizer (CIW/DTM); 
426:     (h) NOAO/AURA/NSF.
427:     \label{fig:outline}}
428: %\end{figure*}
429: %\end{sidewaysfigure*}
430: \xendsidefig
431: %\end{landscape}
432: 
433: During the early stages of the merger,
434: tidal torques excite some enhanced star formation 
435: and black hole accretion, but the effect is relatively weak, and the combination 
436: of large galactic dust columns and relatively small nuclear black holes 
437: means that only in rare circumstances (involving particular initial 
438: orbits and/or bulge-to-disk ratios) will the pair be identified as Seyferts or quasars. 
439: Most observationally identified mergers (and essentially all merging pairs) 
440: will be in this stage, and numerical simulations suggest it is the last stage 
441: at which the distinct nuclei enable automated morphological selection 
442: criteria to efficiently 
443: identify the system as a merger \citep{lotz:gini-m20,lotz:merger.selection}. 
444: Care must therefore be taken with conclusions regarding the prevalence of 
445: starbursts and AGN in these samples, as the small observed 
446: incidence of quasar activity \citep{dasyra:mass.ratio.conditions,
447: myers:clustering.smallscale,straughn:tadpoles,alonso:agn.in.pairs} is actually expected. 
448: 
449: During the final coalescence of the galaxies, massive inflows of gas trigger
450: starbursts with strengths similar to those inferred for ULIRGs and
451: SMGs, although the actual mass in stars formed in these bursts is 
452: generally small compared to the stellar mass contributed by the merging disks. 
453: The high gas densities feed rapid black hole growth, but the
454: black holes are obscured at optical wavelengths by gas and dust 
455: and are initially small compared to the newly forming spheroid. However, 
456: by the final stages, high accretion rate, heavily obscured 
457: (and in some cases nearly Compton-thick) 
458: BH growth in a ULIRG stage (often with merging binary BHs) appears ubiquitous 
459: \citep{komossa:ngc6240,alexander:xray.smgs,borys:xray.ulirgs,brand:xray.ir.contrib}, and 
460: by high redshifts ($z\sim2$) may dominate the obscured luminous quasar 
461: population \citep{alexander:bh.growth,stevens:xray.qso.hosts,
462: martinez:host.obscured.qsos,brand:ulirg.qsos}.
463: 
464: Most of the nuclear gas is consumed by the starburst and eventually
465: feedback from supernovae and the black hole begins to disperse the
466: residual gas. This brief transition or ``blowout'' phase will be 
467: particularly associated with highly dust-reddened (as opposed to more 
468: highly obscured Type II) and/or IR-luminous 
469: quasars. As a relatively short phase, such objects 
470: constitute only $\sim20-40\%$ of the quasar population, similar to 
471: that observed \citep{gregg:red.qsos,white:red.qsos,richards:red.qsos,richards:seds,
472: hopkins:dust}. In fact, observational studies find 
473: that red quasar populations are related to mergers, 
474: with $\gtrsim75\%$ (and as high as $100\%$) showing clear evidence of 
475: recent/ongoing merging \citep{hutchings:redqso.lowz,hutchings:redqso.midz,
476: kawakatu:type1.ulirgs,guyon:qso.hosts.ir,urrutia:qso.hosts}, with young post-starburst stellar 
477: populations \citep{guyon:qso.hosts.ir}, much of the dust arising on 
478: scales of the galaxy \citep[in turbulent motions, inflow, and outflow;][]{urrutia:qso.hosts}, 
479: and extremely high Eddington ratios indicative of a 
480: still active period - making them (as opposed to most fully 
481: obscured quasars) a substantial contributor to the most luminous quasars 
482: in the Universe \citep{white:red.qsos,hutchings:redqso.midz,zakamska:qso.hosts}. 
483: As the dust is removed, the black hole is 
484: then visible as a traditional optical quasar (although very small-scale 
485: ``torus'' obscuring structures may remain intact, allowing for 
486: some rare, bright Type II systems). 
487: 
488: Here, observations of the host morphology 
489: are more ambiguous \citep[see e.g.][]{bahcall:qso.hosts,canalizostockton01:postsb.qso.mergers,
490: floyd:qso.hosts,zakamska:qso.hosts,pierce:morphologies}, but this is expected, for two 
491: reasons. First, the point
492: spread function of the bright and unobscured optical quasar must be subtracted 
493: and host galaxy structure recovered, a difficult procedure. Second, 
494: by this time the merger is complete and the spheroid has formed, leaving only fading tidal 
495: tails as evidence for the recent merger. Mock observations constructed from the simulations 
496: \citep{krause:mock.qso.obs}
497: imply that, with the best presently attainable data, these features are difficult to 
498: observe even locally and (for now) nearly impossible to identify at the 
499: redshifts of greatest interest ($z\gtrsim1$). This appears to be borne out, as 
500: \citet{bennert:qso.hosts} have re-examined low-redshift quasars previously recognized from 
501: deep HST imaging as having relaxed spheroid hosts, and found (after 
502: considerably deeper integrations) that every such object shows clear evidence for 
503: a recent merger. These difficulties will lead us to consider a number of 
504: less direct, but more robust tests of the possible association between mergers and quasars. 
505: 
506: Finally, as the remnant relaxes, star formation and quasar activity decline as the
507: gas is consumed and dispersed, and the remaining galaxy resembles an
508: elliptical with a quiescent black hole satisfying observed correlations
509: between black hole and spheroid properties. During this intermediate $\sim$Gyr decay, 
510: depending on details of the 
511: merger and exact viewing time, the remnant may be classified as a low-luminosity 
512: (decaying) AGN in a massive (and relatively young) spheroid, or as a 
513: post-starburst (E+A/K+A) galaxy. Observationally, the link between 
514: K+A galaxies and mergers is well-established 
515: \citep[e.g.][and references therein]{yang:e+a.merger.ell,goto:e+a.merger.connection,hogg:e+a.env}, 
516: and there is a clear tendency for these galaxies to host low-luminosity 
517: AGN or LINERs \citep{yang:e+a.agn.connection,goto:e+a.agn.connection}. 
518: Again, for the reasons given
519: above, the situation is less clear for all low-luminosity AGN (and there 
520: will be, as noted above, many such sources driven by secular 
521: mechanisms in disks). But more importantly 
522: most objects seen in this stage are expected to have relaxed 
523: to resemble normal spheroids. 
524: The merger exhausts gas and star formation in an immediate sense very efficiently, 
525: so the remnant reddens rapidly onto the red sequence. If this is also associated 
526: with quenching of future star formation (see \papertwo), then the 
527: spheroid will evolve passively, growing largely by dry mergers. 
528: 
529: Individual simulations of mergers have enabled us to quantify the
530: duration of these stages of evolution and how this depends on
531: properties of the merging galaxies, such as their masses and gas
532: content and the mass ratio and orbit of the encounter.  In particular,
533: we used the results to suggest a physical interpretation of quasar
534: lifetimes \citep{hopkins:lifetimes.letter}, to examine how quasars 
535: \citep{hopkins:lifetimes.methods} 
536: and starbursts \citep{chakrabarti:SEDs} would evolve in
537: this scenario, and quantify structural properties of the remnant and
538: how they depend on e.g.\ the gas fractions of the merging galaxies 
539: \citep{cox:xray.gas,cox:kinematics,robertson:fp,robertson:msigma.evolution,
540: hopkins:bhfp}.
541: 
542: In addition to making predictions for individual systems, we would
543: also like to characterize how entire {\it populations} of objects
544: would evolve cosmologically in our picture to test the model against
545: the large body of observational data that exists from surveys of
546: galaxies, quasars, and starbursts.  Previously, we have adopted a 
547: semi-empirical approach to this problem, as follows.  In our
548: simulations, we can label the outcome by the final black hole mass in
549: the remnant, $M_{BH,f}$ or, equivalently, the peak bolometric
550: luminosity of the quasar, $L_{peak}$.  Our simulations predict a
551: regular behavior for the evolution of the different merger phases as a
552: function of $M_{BH,f}$ or $L_{peak}$ and also for the properties of
553: the remnant as a function of $M_{BH,f}$ or $L_{peak}$.  If we have an
554: estimate of the observed distribution of systems in one phase of the
555: evolution, we can then use our models to deconvolve the observations
556: to infer the implied birthrate of such objects as a function of
557: $M_{BH,f}$ or $L_{peak}$.  Given this, the time behavior of the
558: simulations provides a mapping between the different phases enabling
559: us to make independent predictions for other populations. 
560: For example, knowing the observed quasar luminosity function (QLF) 
561: at some redshift, 
562: our simulations allow us to predict how many quasar-producing mergers of a given 
563: mass must be occurring at the time, which can then be tested against the  
564: observed merger statistics. 
565: 
566: We exploited this approach to examine the relationship between the
567: abundance of quasars and other manifestations of quasar activity, and
568: showed that our model for quasar lifetimes and lightcurves yields 
569: a means to interpret the shape of the QLF
570: \citep{hopkins:lifetimes.interp}, 
571: provides a consistent explanation for observations of
572: the QLF at optical and X-ray frequencies \citep{hopkins:lifetimes.obscuration},
573: explains observed evolution in the faint-end slope of the QLF \citep{hopkins:faint.slope}, 
574: and can account for the spectral shape of the cosmic
575: X-ray background \citep{hopkins:qso.all,hopkins:bol.qlf}. 
576: Using this technique to map between different types of objects, we
577: demonstrated that the observed evolution and clustering of the quasar
578: population is consistent with observations of red galaxies 
579: \citep{hopkins:red.galaxies,hopkins:clustering,hopkins:old.age} and 
580: merging systems \citep{hopkins:transition.mass,hopkins:merger.lfs}, 
581: as well as the mass function of supermassive black holes
582: and its estimated evolution with redshift \citep{hopkins:qso.all,hopkins:bol.qlf}. 
583: In each case, we found
584: good agreement with observations provided that the mappings were based
585: on the lifetimes and lightcurves from our merger simulations and not
586: idealized ones that have typically been used in earlier theoretical
587: studies.  We further showed that our picture makes numerous
588: predictions \citep{hopkins:transition.mass,hopkins:qso.all} 
589: that can be used to test our
590: hypothesis, such as the luminosity dependence of quasar clustering
591: \citep{lidz:clustering}.  However, the cosmological context of our results
592: was not provided in an entirely theoretical manner because our
593: analysis relied on an empirical estimate of one of the connected
594: populations.
595: 
596: Obtaining a purely theoretical framework for our scenario is difficult
597: because cosmological simulations including gas dynamics currently lack
598: the resolution to describe the small-scale physics associated with
599: disk formation, galaxy mergers, star formation, and black hole growth.
600: Semi-analytic methods avoid
601: some of these limitations, but at the expense of parameterizing the
602: unresolved physics in a manner this is difficult to calibrate
603: independently of observational constraints.  For the time being,
604: neither approach is capable of making an entirely {\it ab initio}
605: prediction for how the various populations we are attempting to
606: model would evolve with time.
607: 
608: In this paper, we describe a strategy that enables us, for the first
609: time, to provide a purely theoretical framework for our picture.  Our
610: procedure is motivated by, but does not rely upon, observations
611: suggesting that there is a characteristic halo mass hosting bright
612: quasars.  This inference follows from measurements of the clustering
613: of quasars in the 2dF, SDSS, and other surveys
614: \citep{porciani2004,porciani:clustering,
615: wake:local.qso.clustering,croom:clustering,coil:agn.clustering,
616: myers:clustering,daangela:clustering,shen:clustering} and 
617: investigations of
618: the quasar proximity effect \citep{faucher:proximity,kim:proximity,guimaraes:proximity}.
619: By adopting simple models for the merger efficiency of galaxies as a
620: function of environment and mass ratio, we show that this
621: characteristic halo mass for quasars corresponds to the most favorable
622: environment for major mergers between gas-rich disks to occur, namely
623: the ``small group'' scale.  This finding argues for an intimate
624: link between such mergers and the triggering of quasar activity and
625: naturally leads to a method for determining the redshift evolution
626: of the quasar population from dark matter simulations of structure
627: formation in a $\Lambda{\rm CDM}$ Universe.
628: 
629: By combining previous estimates of the evolution of the halo mass
630: function with halo occupation models and our estimates for merger
631: timescales, we infer the statistics of mergers that excite quasar
632: activity.  We then graft onto this our modeling of quasar lightcurves
633: and lifetimes, obtained from our simulations of galaxy mergers that
634: include star formation and black hole growth to deduce, in an {\it ab
635: initio} manner, the redshift dependent
636: birthrate of quasars as a function of their peak
637: luminosities and the corresponding formation rate of black holes as a
638: function of mass.  Because our merger simulations relate starbursts,
639: quasars, and red galaxies as different phases of the same events, we
640: can then determine the cosmological formation rate of these various
641: populations and their evolution with redshift.  In particular, as we
642: demonstrate in what follows, the observed abundance of all these
643: objects is well-matched to our estimates, unlike for other theoretical
644: models, supporting our interpretation that mergers between gas-rich
645: galaxies represent the dominant production mechanism for quasars,
646: intense starbursts, supermassive black holes, and elliptical galaxies.
647: 
648: We investigate this in a pair of companion papers. Here (\paperone), 
649: we describe our model and use it to investigate the properties of 
650: mergers and merger-driven quasar activity. 
651: In the companion paper \citep[][henceforth \papertwo]{hopkins:groups.ell}, 
652: we extend our study to the properties of merger remnants and the 
653: formation of the early-type galaxy population.
654: Specifically, 
655: \S~\ref{sec:mergers} outlines our methodology, describing 
656: the physical criteria for and identification of major 
657: mergers (\S~\ref{sec:mergers:criteria}), the distribution of mergers 
658: across different scales and galaxy types (\S~\ref{sec:mergers:scales}), 
659: and the dependence of mergers on environmental properties 
660: (\S~\ref{sec:mergers:env}). We then examine the predicted merger 
661: mass functions, fractions, and clustering properties from this 
662: model, and compare with observations to verify that we are appropriately 
663: modeling the merger history of the Universe (\S~\ref{sec:mergers:populations}).
664: In \S~\ref{sec:quasars} we examine the consequences of 
665: a general model in which mergers trigger quasar activity. 
666: We present a number of robust predictions both independent of 
667: (\S~\ref{sec:quasars:mergers}) and including (\S~\ref{sec:quasars:qlf}) 
668: physical models for the quasar lightcurves and duty cycles in mergers. 
669: We contrast this with a ``secular'' model in which quasar activity 
670: is caused
671: by disk instabilities (\S~\ref{sec:quasars:secular}), and show 
672: that a variety of independent constraints suggest that such a mode cannot dominate 
673: the formation of bright, high redshift quasars. We discuss and summarize our 
674: conclusions in \S~\ref{sec:discussion}. 
675: 
676: Throughout, we adopt a WMAP3 
677: $(\Omega_{\rm M},\,\Omega_{\Lambda},\,h,\,\sigma_{8},\,n_{s})
678: =(0.268,\,0.732,\,0.704,\,0.776,\,0.947)$ cosmology 
679: \citep{spergel:wmap3}, and normalize all observations and models 
680: shown to these parameters.
681: Although the exact choice of 
682: cosmology may systematically 
683: shift the inferred bias and halo masses (primarily scaling with $\sigma_{8}$), 
684: our comparisons (i.e.\ relative biases) are for the most part unchanged, 
685: and repeating our calculations for 
686: a ``concordance'' $(0.3,\,0.7,\,0.7,\,0.9,\,1.0)$ cosmology or 
687: the WMAP1 $(0.27,\,0.73,\,0.71,\,0.84,\,0.96)$ results of \citet{spergel:wmap1}
688: has little effect on our conclusions. 
689: We also adopt a diet Salpeter IMF following \citet{bell:mfs}, and convert all stellar masses 
690: and mass-to-light ratios accordingly. Again, the choice of the IMF systematically 
691: shifts the normalization of stellar masses herein, but does not substantially change 
692: our comparisons. 
693: $UBV$ magnitudes are in the Vega system, and 
694: SDSS $ugriz$ magnitudes are AB.
695: 
696: 
697: 
698: \section{Mergers}
699: \label{sec:mergers}
700: 
701: \subsection{What Determines Whether Galaxies Merge}
702: \label{sec:mergers:criteria}
703: 
704: \subsubsection{Physical processes}
705: \label{sec:mergers:processes}
706: 
707: To begin, we postulate which 
708: mergers are relevant to our picture.
709: Minor mergers (mass ratios $\gg3:1$) will not trigger 
710: significant star formation or quasar activity
711: for most orbits, and consequently will neither exhaust a 
712: large fraction of the larger galaxy's gas supply nor be typically identified as mergers 
713: observationally. 
714: We are therefore specifically interested in major 
715: mergers, with mass ratios $\leq3:1$, but note that our conclusions are unchanged 
716: if, instead of this simple threshold, we include all mergers and adopt some 
717: mass-ratio dependent efficiency 
718: \citep[e.g.\ assuming the fractional BH/bulge growth scales with mass ratio $R$ in 
719: some power-law fashion, $\propto R^{-1}$, as suggested by numerical simulations;][]{younger:minor.mergers}. In this case, the decreasing efficiency of BH fueling in minor 
720: mergers leads (as expected) to the conclusion that they are only 
721: important at low masses/luminosities 
722: (similar to where secular activity may dominate quasar populations; see \S~\ref{sec:quasars:secular}), 
723: and our predictions for massive bulges and BHs are largely unaffected. 
724: If the timescale for two galaxies to merge 
725: is long compared to the Hubble time, they clearly will not have
726: merged in the actual Universe. However, the merger 
727: timescale must also be short compared to the time required to tidally strip or disrupt 
728: either of the galaxies -- if it is not, then by the time the galaxies finally
729: coalesce, the end result
730: will simply be tidal accretion of material at large radii. 
731: 
732: This defines two fundamental criteria for galaxy mergers to occur in the 
733: setting of a halo of mass $\mhalo$:
734: \begin{itemize}
735: \item The halo must host at least two galaxies of comparable mass $\sim\mgal$. Note that 
736: even for mergers of distinct host halos in the field, the halo-halo merger proceeds much 
737: faster than the merger of the galaxies, so there is some period where the two can 
738: be considered distinct substructures or distinct galaxies within a common host. 
739: \item The merger must be efficient -- i.e.\ occur in much less than a Hubble time. This requires 
740: that the mass of the galaxies and their associated (bound) dark matter subhalos 
741: be comparable to the mass of the parent halo (e.g.\ for the simplest dynamical 
742: friction arguments, requiring $\mhalo/\mgal \ll 30$).  
743: \end{itemize}
744: 
745: Together, these criteria naturally define a preferred 
746: mass scale for major mergers (host halo mass $\mhalo$) for 
747: galaxies of mass $\mgal$. A halo of mass $\langle\mhalo\rangle(\mgal)$ typically hosts a galaxy of mass 
748: $\mgal$. At smaller (relative) 
749: halo masses $\mhalo\ll\langle\mhalo\rangle$, the probability that the halo 
750: hosts a galaxy as large as $\mgal$ declines rapidly (and eventually must be zero or else violate 
751: limits from the cosmic baryon fraction). At larger $\mhalo\gg\langle\mhalo\rangle$, the 
752: probability that the halo will merge with or accrete another halo hosting a comparable $\sim\mgal$ 
753: galaxy increases, but the efficiency of the merger of these galaxies declines rapidly. Eventually the 
754: $\mgal$ galaxies are relatively small satellites in a large parent halo of mass 
755: $\mhalo\gg\langle\mhalo\rangle$, for which (satellite-satellite) mergers are extremely 
756: inefficient (given the high virial velocities of the host, and dynamical friction timescales 
757: $\gg \tH$). 
758: 
759: The preferred major-merger scale for galaxies of mass $\mgal$ is therefore only slightly 
760: larger (factor $\sim2$) than the average host halo mass for galaxies of this mass. 
761: We refer to this as the small group scale, and emphasize the term {\em small} in this name: 
762: the average halo of this mass still hosts only 1 galaxy of mass $\sim\mgal$, and 
763: the identifiable groups will only consist of $2-3$ members of similar mass 
764: (although there may of course be several much smaller systems in the group, 
765: which have little dynamical effect). This is very different from 
766: large group scales, easily identified observationally, which consist of $\gg3$ members.  
767: 
768: \begin{figure}
769:     \centering
770:     \figexpand
771:     %\plotone{merger.eff.vs.mhalo.detailed.ps}
772:     \plotter{f2.ps}
773:     \caption{Efficiency of major galaxy mergers (of a certain galaxy mass relative to the 
774:     characteristic local Schechter-function $M_{\ast}$) as a function of host halo mass 
775:     (at $z=0$, but the results are qualitatively similar at all redshifts).
776:     {\em Top:} Merger timescale relative to the Hubble time (assuming a pair of galaxies of mass 
777:     $\mgal$ are hosted in a halo of mass $\mhalo$) -- mergers occur rapidly ($\tmerger\ll\tH$) 
778:     when the halo mass is small relative to the galaxy mass (we temporarily ignore 
779:     the obvious requirement that $\mgal<f_{\rm baryon}\,\mhalo$).
780:     {\em Middle:} Same, but now multiplied by the probability that the halo actually hosts a pair of 
781:     galaxies of the given mass (technically, within a mass ratio $3:1$), given the empirical 
782:     halo occupation model from \citet{wang:sdss.hod}. 
783:     Although mergers are most rapid in the lowest-mass 
784:     halos, these halos do not host relatively massive galaxies. 
785:     {\em Bottom:} Same, but further multiplied by the abundance of halos of a given mass -- 
786:     the fact that the halo mass function and merger efficiency are decreasing functions 
787:     of $\mhalo$ (for fixed $\mgal$) means that the 
788:     contribution to galaxy mergers of a given $\mgal$ will be dominated by the lowest-mass halos 
789:     in which there is a significant probability to accrete/host a pair of $\mgal$ galaxies -- 
790:     the small group scale. 
791:     \label{fig:merger.eff.demo}}
792: \end{figure}
793: Figure~\ref{fig:merger.eff.demo} illustrates several of these points. We adopt the merger 
794: timescales derived below and use the halo occupation fits from \citet{wang:sdss.hod} to 
795: determine the probability of a halo hosting a pair of galaxies of a given mass: 
796: the details of the formalism are described below and used throughout, but we wish to illustrate 
797: the key qualitative points. The merger timescale for galaxies of a given mass is shortest 
798: when they are large relative to their host halo mass, as expected from dynamical friction 
799: considerations. However, the probability of a pair being hosted cuts off sharply at low 
800: halo masses. Moreover, the contribution to mergers of galaxies of mass $\mgal$ 
801: from larger halos is further suppressed by the simple fact that there are fewer halos of 
802: larger masses. 
803: 
804: Modern, high-resolution dark matter-only cosmological simulations \citep[e.g.][]{springel:millenium} 
805: have made it possible to track the merger histories of galaxy halos over large 
806: ranges in cosmic time and halo mass. For our purposes, the critical information 
807: is contained in the subhalo mass function, which has been quantified in great detail  
808: directly from such simulations \citep{kravtsov:subhalo.mfs,gao:subhalo.mf,nurmi:subhalo.mf}
809: and from extended Press-Schechter theory and semi-analytic approaches 
810: \citep{taylor:substructure.evolution,zentner:substructure.sam.hod,vandenbosch:subhalo.mf} 
811: calibrated against numerical simulations. 
812: 
813: When a halo (containing a galaxy and its own 
814: subhalo populations) is accreted, the accretion process is relatively rapid -- the 
815: accreted halo will always be identifiable for {\em some} period of time 
816: as a substructure in the larger halo. Although the new subhalo may lose mass to tidal stripping, 
817: there will still be some dark matter subhalo associated with the accreted galaxy, which 
818: will remain until the substructure merges with the central galaxy via dynamical friction 
819: or (much more rarely) another satellite substructure. Therefore, knowing the 
820: subhalo populations of all halos at a given instant, the calculation of the rate and distribution of 
821: {\em galaxy} mergers depends only on calculating the efficiency 
822: of the subhalo/galaxy mergers within these halos. This is a great advantage -- we 
823: do not need to calculate halo-halo merger rates, which are not well-defined 
824: (even when extracted directly from cosmological simulations) and depend 
825: sensitively on a number of definitions \citep[see, e.g.][]{gottlober:merger.rate.vs.env,
826: maller:sph.merger.rates}, but instead work 
827: from the robust (and well-defined) subhalo mass function 
828: \citep[see][and references therein]{gao:subhalo.mf}. 
829: 
830: This is similar to many 
831: of the most recent semi-analytic models, which adopt a hybrid approach to 
832: determine galaxy mergers, 
833: in which galaxies survive independently so long as their host halo remains a distinct 
834: substructure, after which point a dynamical friction ``clock'' is started and the galaxy merges 
835: with the central galaxy in its parent halo at the end of the dynamical friction time. 
836: Fortunately, for our purposes we are only interested in major mergers with mass 
837: ratios $\lesssim 3:1$. In these cases, dynamical friction acts quickly on the subhalos 
838: (infall time $\lesssim \tH /3$ at all redshifts), and the primary ambiguity will be 
839: the {\em galaxy} merger time in their merged or merging subhalos. 
840: 
841: To perform this calculation, we need to know the properties of the merging galaxies. 
842: For now, we only want to calculate where and when galaxies are merging, not 
843: how they evolved to their present state in the first place. This is our primary reason for 
844: not constructing a full semi-analytic model: rather than introduce a large number of 
845: uncertainties, theoretical prescriptions which we are not attempting to test here, 
846: and tunable parameters in order to predict that e.g.\ a $10^{11}\,\msun$ halo 
847: typically hosts a $\sim10^{10}\,\msun$ star-forming galaxy, we can adopt 
848: the established empirical fact that this is so. In detail, we populate subhalos according 
849: to an empirical halo occupation model \citep[e.g.,][]{tinker:hod,conroy:monotonic.hod,
850: valeostriker:monotonic.hod,vandenbosch:concordance.hod,wang:sdss.hod};
851: i.e.\ matching the observed statistics of 
852: where galaxies of a given type live (accounting for different occupations for 
853: different galaxy types/colors, and the scatter in galaxies hosted in 
854: halos of a given mass). 
855: 
856: This is sufficient for most of our predictions. We do not necessarily need to know 
857: exactly how long it will take for these mergers to occur, only that they are 
858: occurring at a given redshift -- i.e.\ that the objects will merge and that the 
859: merger time is shorter than the Hubble time (which for the mass ratios of interest 
860: is essentially guaranteed). For example, predicting the clustering of galaxy mergers 
861: does not require knowledge of how rapidly they occur, only {\em where} they occur. 
862: Even predicting the observed merger mass function does not rely 
863: sensitively on this information, 
864: since the duration over which the merger is visible will be comparable (albeit 
865: not exactly equal)
866: to the duration over which the merger occurs (such that a fixed fraction $\sim1$ of 
867: all merging systems are observable). 
868: 
869: However, for the cases where it is necessary, 
870: we estimate the timescales for the galaxies to merge and 
871: to be identified as mergers. This is the most uncertain element in our model. 
872: Part of this uncertainty owes to the large parameter space of mergers (e.g.\ differences 
873: in orbital parameters, relative inclinations, etc.). 
874: These uncertainties are fundamental, but can at least be controlled by 
875: comparison to large suites of hydrodynamic simulations which sample these 
876: parameter spaces \citep{robertson:fp} and allow us to quantify the 
877: expected range of merger properties owing to these (essentially random) differences. 
878: The more difficult question is how appropriate any analytic merger timescale or 
879: cross section can be. To address this, we will throughout this paper consider 
880: a few representative models: 
881: 
882: {\em Dynamical Friction:} The simplest approximation is that the 
883: galaxies are point masses, and (once their subhalos merge) they fall 
884: together on the 
885: dynamical friction timescale. This is what is adopted in most semi-analytic 
886: models. In fact, this is only an appropriate description when the galaxies are small 
887: relative to the enclosed halo mass, and are both 
888: moving to the center of the potential well -- which is often not the case at these 
889: late stages. While unlikely to be incorrect by orders of magnitude, 
890: this approximation begins to break down when the galaxies are relatively 
891: large compared to their halos (common in $\lesssim10^{12}\,\msun$ halos) 
892: and when the galaxies are very close (and could e.g.\ enter a stable orbit). What 
893: finally causes galaxies to merge is not, in fact, 
894: simple dynamical friction, but dissipation of angular momentum via a resonance 
895: between the internal and orbital frequencies.
896: 
897: {\em Group Capture (Collisional):} On small scales, 
898: in satellite-satellite mergers, or in the merger 
899: of two small field halos, it is more appropriate to consider galaxy mergers 
900: as a collisional process in which there is some effective gravitational cross section. 
901: In other words, galaxy mergers proceed once the galaxies pass at sufficiently 
902: small distances with sufficiently low relative velocity. There have been a number of 
903: theoretical estimates of these cross sections -- we adopt here the fitting 
904: formulae from \citet{krivitsky.kontorovich}, who calibrate the appropriate 
905: cross-sections from a set of numerical simulations of different encounters and 
906: group environments. This compares well with other calculations \citep[][and 
907: references therein]{white:cross.section,makino:merger.cross.sections,mamon:groups.review}, 
908: and we find little difference using these alternative estimations. For large 
909: mass ratios and separations, 
910: the expressions appropriately reduce to the dynamical friction case.
911: 
912: {\em Angular Momentum:} \citet{binneytremaine} consider this problem 
913: from the perspective of the angular momentum-space in which 
914: galaxy mergers are allowed. This approach is similar to the capture estimates 
915: above, but accounting for capture into orbits as well. Whether or not such 
916: orbits will merge is, of course, somewhat ambiguous -- it is likely that 
917: some significant fraction are stable, and will not merge, while others 
918: decay rapidly owing to resonance between the disk circular frequencies and 
919: the orbital frequency. Nevertheless, this serves to bracket the range of 
920: likely merger configurations. 
921:  
922: 
923: \subsubsection{Synopsis of model and uncertainties}
924: \label{sec:mergers:synopsis}
925: 
926: Thus, to summarize our approach: at a given redshift, we calculate the 
927: halo mass function $n(\mhalo)$ for our adopted cosmology following 
928: \citet{shethtormen}. For each halo, we calculate the 
929: (weakly mass and redshift dependent) subhalo mass function (or distribution of 
930: subhalos, $P[N_{\rm subhalo}\, | \, M_{\rm subhalo},\ \mhalo]$)
931: following \citet{zentner:substructure.sam.hod} 
932: and \citet{kravtsov:subhalo.mfs}. Alternatively, we 
933: have adopted it directly from \citet{gao:subhalo.mf,nurmi:subhalo.mf} or 
934: calculated it following \citet{vandenbosch:subhalo.mf,valeostriker:monotonic.hod}, and 
935: obtain similar results. Note that the subhalo masses are 
936: defined as the masses upon accretion by the parent halo, which 
937: makes them a good proxy for the hosted galaxy mass \citep{conroy:monotonic.hod} 
938: and removes the uncertainties owing to tidal mass stripping. 
939: 
940: Mergers are identified by the basic criteria described above. 
941: We populate these halos and subhalos 
942: with galaxies following the empirical halo occupation models 
943: of \citet{conroy:monotonic.hod} \citep[see also][]{valeostriker:monotonic.hod} normalized directly 
944: with group observations following \citet{wang:sdss.hod} at $z=0$ 
945: \citep[considering instead the occupation fits in][makes little difference]{yang:clf,
946: cooray:highz,cooray:hod.clf,zheng:hod,vandenbosch:concordance.hod}. 
947: This determines both the mean stellar mass and dispersion in stellar masses of 
948: galaxies hosted by a given halo/subhalo mass $P(\mgal\,|\,M_{\rm subhalo})$, 
949: which (optionally) can be broken down 
950: separately for blue and red galaxy types. 
951:  
952: Figure~\ref{fig:merger.eff.mean} shows the mean galaxy mass as a function of 
953: halo mass from this model at $z=0$. Since the halo occupation models 
954: consider stellar mass or luminosity, we use the baryonic and stellar mass 
955: Tully-Fisher relations calibrated by \citet{belldejong:tf} to convert between the two. 
956: (We have also compared the global baryonic mass function estimated in this manner with 
957: that observationally inferred in \citet{bell:baryonic.mf} and find good agreement). 
958: If necessary, we calculate the galaxy-galaxy merger efficiency/timescale 
959: using the different estimators described above. Figure~\ref{fig:merger.eff.mean} 
960: also shows the expected merger efficiency as a function of halo mass 
961: for these mean values (i.e.\ probability of hosting a subhalo within the appropriate 
962: mass range convolved with the calculated merger timescale). The qualitative 
963: features are as expected from Figure~\ref{fig:merger.eff.demo}. 
964: The different merger timescale estimators agree well at large halo masses, 
965: with the dynamical friction treatment yielding a somewhat longer 
966: (factor $\lesssim$ a few) timescale at intermediate masses (but this is near the regime 
967: of low $\mhalo/\mgal$ where the dynamical friction approximation is 
968: least accurate). 
969: 
970: \begin{figure}
971:     \centering
972:     \figexpand
973:     %\plotone{merger.eff.vs.m.allbaryons.ps}
974:     \plotter{f3.ps}
975:     \caption{Illustration of basic elements of importance to where 
976:     galaxy-galaxy mergers occur. {\em Top:} Average central galaxy 
977:     stellar (dotted) and baryonic (solid) mass as a function of host 
978:     halo mass, in our typically adopted halo occupation 
979:     model \citep[][black]{conroy:monotonic.hod,valeostriker:monotonic.hod}, 
980:     and the alternate halo occupation model from
981:     \citet[][green; only baryonic mass shown]{yang:clf}
982:     {\em Middle:} Corresponding halo-to-galaxy mass ratio. 
983:     {\em Bottom:} Average major merger timescale/efficiency (calculated as 
984:     in the middle panel of Figure~\ref{fig:merger.eff.demo}, but for the 
985:     appropriate mean $\mgal(\mhalo)$). Timescales are determined
986:     as described in the text, from dynamical friction (dot-dashed), 
987:     group capture (solid), or angular momentum (long dashed) considerations. 
988:     \label{fig:merger.eff.mean}}
989: \end{figure}
990: 
991: The main elements and their uncertainties in our model are: 
992: 
993: {\bf 1.\ Halo Mass Function:} We begin by computing the overall halo mass function. 
994: There is very little ambiguity in this calculation at all redshifts and masses 
995: of interest \citep[$z\lesssim6$; see e.g.][]{reed:halo.mfs}, and 
996: we do not consider it a significant source of 
997: uncertainty. 
998: 
999: {\bf 2.\ Subhalo Mass Function:} The subhalo mass function of each halo is 
1000: then calculated. Although numerical simulations and semi-analytic 
1001: calculations generally give 
1002: very similar results \citep[especially for the major-merger mass ratios of interest 
1003: in this paper, as opposed to very small subhalo populations; see][]{vandenbosch:subhalo.mf}, 
1004: there is still some (typical factor $<2$) disagreement between different estimates. 
1005: We therefore repeat most of our calculations adopting both 
1006: our ``default'' subhalo mass function calculation 
1007: \citep{zentner:substructure.sam.hod,kravtsov:subhalo.mfs} and an alternative 
1008: subhalo mass function calculation \citep{vandenbosch:subhalo.mf} 
1009: \citep[normalized to match cosmological simulations 
1010: as in][]{shaw:cluster.subhalo.statistics}, which bracket the range 
1011: of a number of different estimates \citep[e.g.,][]{springel:cluster.subhalos,
1012: tormen:cluster.subhalos,delucia:subhalos,gao:subhalo.mf,nurmi:subhalo.mf} 
1013: and demonstrate the uncertainty 
1014: owing to this choice. The difference is ultimately negligible 
1015: at $\mgal\gtrsim10^{10}\,\msun$ at all redshifts, and rises to only a factor $\sim2$ at 
1016: $\mgal\lesssim10^{10}\,\msun$ (probably owing to differences in the 
1017: numerical resolution of different estimates at low halo masses). 
1018: 
1019: {\bf 3.\ Halo Occupation Model:} We then populate the 
1020: central galaxies and ``major'' subhalos with an empirical halo occupation model. 
1021: Although such models are constrained, by definition, to reproduce the mean 
1022: properties of the halos occupied by galaxies of a given mass/luminosity, there 
1023: are known degeneracies between parameterizations that give rise to 
1024: (typical factor $\sim2$) differences between models. We therefore again 
1025: repeat all our calculations for our ``default'' model 
1026: \citep{conroy:monotonic.hod} \citep[see also][]{valeostriker:monotonic.hod} and 
1027: an alternate halo occupation model \citep{yang:clf} \citep[see also][]{yan:clf.evolution,zheng:hod}, which 
1028: bracket the range of a number of calculations 
1029: \citep[e.g.,][]{cooray:highz,cooray:hod.clf,zheng:hod,vandenbosch:concordance.hod}. 
1030: Again, we find this
1031: yields negligible differences 
1032: at $\mgal\gtrsim10^{10}\,\msun$ (as the clustering and abundances 
1033: of massive galaxies are reasonably well-constrained, and most of these 
1034: galaxies are central halo galaxies), and even at low masses the 
1035: typical discrepancy rises to only $\sim0.2\,$dex. 
1036: 
1037: We note that we have also considered a variety of prescriptions for the 
1038: redshift evolution of the halo occupation model: including that 
1039: directly prescribed by the quoted models, a complete re-derivation 
1040: of the HOD models of \citet{conroy:monotonic.hod} and 
1041: \citet{valeostriker:monotonic.hod} 
1042: at different redshifts from the observed mass functions of 
1043: \citet{fontana:highz.mfs,bundy:mfs,borch:mfs,blanton:lfs} (see \S~\ref{sec:quasars:mergers}), 
1044: or simply assuming no evolution (in terms of galaxy mass
1045: distributions at fixed halo mass; for either all galaxies or 
1046: star-forming galaxies). We find that the resulting differences are 
1047: small (at least at $z\lesssim3$), comparable to 
1048: those inherent in the choice of halo occupation model. 
1049: This is not surprising, as a number of recent 
1050: studies suggest that there is very little evolution in halo occupation 
1051: parameters (in terms of mass, or relative to $L_{\ast}$) with 
1052: redshift \citep{yan:clf.evolution,cooray:highz,
1053: conroy:monotonic.hod}, or equivalently that the masses of galaxies hosted in a 
1054: halo of a given mass are primarily a function of that halo mass, not 
1055: of redshift \citep{heymans:mhalo-mgal.evol,
1056: conroy:mhalo-mgal.evol}. This appears to be especially true for 
1057: star-forming and $\sim L_{\ast}$ galaxies \citep[of greatest importance for 
1058: our conclusions;][]{conroy:mhalo-mgal.evol}, unsurprising 
1059: given that ``quenching'' is not strongly operating in those systems to change 
1060: their mass-to-light ratios. 
1061: 
1062: {\bf 4.\ Merger Timescale:} Having populated a given halo and its subhalos 
1063: with galaxies, we then calculate the timescale for mergers between major galaxy 
1064: pairs. This is ultimately the largest source of uncertainty in our calculations, 
1065: at all redshifts and masses. 
1066: Again, we emphasize that some of our calculations are completely 
1067: independent of these timescales. However, where adopted, we illustrate  
1068: this uncertainty by presenting all of our predictions for three estimates of 
1069: the merger timescale: a simple dynamical friction formula, a 
1070: group capture or collisional cross section estimate, and an angular 
1071: momentum (orbital cross section) capture estimate, all
1072: as described above. At large masses 
1073: and redshifts $z\lesssim2.5$, this is a surprisingly weak source of 
1074: uncertainty, but the estimated merger rates/timescales 
1075: can be very different at low masses $\mgal\lesssim 10^{10}\,\msun$ 
1076: and the highest redshifts $z\sim3-6$. 
1077: 
1078: At low masses, this owes 
1079: to a variety of effects, including the substantial difference 
1080: between infall or merger timescales and the timescale for 
1081: morphological disturbances to be excited (different in e.g.\ an 
1082: impact approximation as opposed to the circular orbit decay 
1083: assumed by dynamical friction). 
1084: 
1085: The difference in redshift 
1086: evolution is easily understood: at fixed mass ratio, the 
1087: dynamical friction timescale scales as 
1088: $t_{\rm df}\propto \tH\propto \rho^{-1/2}$, 
1089: but a ``capture'' timescale will scale with fixed cross section as 
1090: $t\propto 1/(n\,\langle\sigma\,v \rangle)\propto \rho^{-1}$, 
1091: so that (while the details of the cross-sections and dependence 
1092: of halo concentration on redshift make the 
1093: difference not quite as extreme as this simple scaling) the very large
1094: densities at 
1095: high redshift make collisional merging increase rapidly in efficiency. 
1096: The true solution is probably some effective 
1097: combination of these two estimates, and the 
1098: ``more appropriate'' approximation 
1099: depends largely on the initial orbital parameters of the subhalos. 
1100: At present, we therefore must recognize this as an inherent 
1101: uncertainty, but one that serves to bracket the likely range of 
1102: possibilities at high redshifts. 
1103: 
1104: 
1105: 
1106: 
1107: \subsection{Where Mergers Occur}
1108: \label{sec:mergers:scales}
1109: 
1110: We are now in a position to predict the statistics of mergers. First, we illustrate some 
1111: important qualitative features. Figure~\ref{fig:merger.eff.centralsat} shows the 
1112: merger efficiency (as in Figure~\ref{fig:merger.eff.demo}) for different classes of 
1113: mergers: major mergers with the central galaxy in a halo, minor mergers with the 
1114: central galaxy, and major mergers of two satellite galaxies in the halo. We show 
1115: the results for our ``default'' model, adopting the dynamical friction merger 
1116: timescale, but the qualitative results are independent of these choices.
1117: The key features 
1118: are expected: major mergers are efficient at small group scales (halo 
1119: masses) comparable to or just larger than the average host halo mass for a given 
1120: $\mgal$. At larger $\mhalo$, major mergers become more rare for the reasons in 
1121: \S~\ref{sec:mergers:criteria}. 
1122: However, although dynamical friction times increase, the rapidly increasing 
1123: number of satellite systems in massive halos means that minor merger accretion onto 
1124: the central galaxy proceeds with a relatively constant efficiency. This will not 
1125: trigger substantial quasar or starburst activity or morphological transformation, but 
1126: may be important for overall mass growth in large cD galaxies, although 
1127: recent cosmological simulations \citep{maller:sph.merger.rates} suggest that 
1128: major mergers dominate minor mergers in the assembly of massive galaxies 
1129: (although their simulation does not extend to the largest cD galaxies). 
1130: 
1131: Satellite-satellite 
1132: minor mergers are a small effect at all masses, as expected (by the time a halo is sufficiently massive 
1133: to host a large number of satellites of a given $\mgal$, the orbital velocity of the 
1134: galaxies about the halo is much larger than their individual internal velocities). 
1135: In what follows, we will generally ignore satellite-satellite mergers. Including them 
1136: is a very small correction (generally $\ll10\%$), and their dynamics are 
1137: uncertain. Moreover, their 
1138: colors and star formation histories are probably affected by processes 
1139: such as tidal stripping, harassment, and ram-pressure stripping, which we 
1140: are neither attempting to model nor test. We have however checked that there 
1141: are no significant or qualitative changes to our predictions if we 
1142: (naively) include the satellite-satellite term.
1143: 
1144: \begin{figure}
1145:     \centering
1146:     \figexpand
1147:     %\plotone{merger.eff.central.satellite.ps}
1148:     \plotter{f4.ps}
1149:     \caption{Merger efficiency (arbitrary units; defined in the same manner as the lower panel 
1150:     of Figure~\ref{fig:merger.eff.demo}, with different linestyles in the same style for various mass 
1151:     galaxies) for different classes of mergers. Using the subhalo mass functions and halo 
1152:     occupation models, we can separate major mergers onto the 
1153:     central galaxy in a halo ({\em top}), 
1154:     minor (mass ratio $>3:1$ but $<10:1$) mergers onto the central galaxy ({\em middle}), 
1155:     and satellite-satellite mergers ({\em bottom}). Major mergers occur efficiently in central galaxies 
1156:     near the small group scale for each $\mgal$. When galaxies live in very massive halos, they 
1157:     experience a large number of minor mergers from the satellite population. Satellite-satellite 
1158:     mergers are a relatively small effect at all galaxy and halo masses. 
1159:     \label{fig:merger.eff.centralsat}}
1160: \end{figure}
1161: 
1162: Although the consequences of the merger will be very different, 
1163: the efficiency with which
1164: two galaxies merge does not depend strongly on whether they 
1165: are star-forming or red/passive (all else being equal). It is therefore a consequence that, 
1166: at low redshifts, gas-rich mergers are generally relegated to low stellar masses and field 
1167: environments where such galaxies are common. Figure~\ref{fig:merger.redblue} 
1168: illustrates this. We plot the mean efficiency of major, central galaxy mergers 
1169: (as in Figure~\ref{fig:merger.eff.centralsat}, but for the mean $\mgal$ at each $\mhalo$) 
1170: as a function of halo mass at each of three redshifts. At each redshift, we divide this into  
1171: the observed fraction of red and blue galaxies at the given galaxy/halo mass, 
1172: using the appropriate observed, type-separated galaxy mass functions. The efficiency of 
1173: mergers at a given halo and galaxy mass 
1174: does not evolve (note that this is {\em not} a statement that the overall 
1175: merger rates will not change, but rather a statement that the same galaxies in 
1176: the same halos will merge at the same rate). However, at low redshifts, red galaxies 
1177: dominate the mass budget, whereas at high redshifts, most galaxies are 
1178: still blue (star-forming) in all but the most massive halos. We will discuss 
1179: the possibility that mergers themselves drive this change in the 
1180: blue and red fractions in \papertwo, but for now illustrate that 
1181: the locations of gas-rich and dry mergers reflect where 
1182: gas-rich and gas-poor galaxies dominate the population, respectively, 
1183: which is empirically determined at the redshifts of interest here. We note 
1184: that our halo occupation models do not explicitly model a dependence of 
1185: halo populations on central galaxy properties; i.e.\ the tentative 
1186: observational suggestion that, at fixed halo and galaxy mass, 
1187: red central galaxies are preferentially 
1188: surrounded by red (as opposed to blue) satellites \citep{weinmann:obs.hod}. If real, 
1189: the effect of such a trend is to make the transition plotted in 
1190: Figure~\ref{fig:merger.eff.centralsat} somewhat sharper -- this has 
1191: little effect on our conclusions, but does somewhat lower 
1192: the predicted gas-rich merger rates (and corresponding predicted 
1193: quasar luminosity density) at $z\lesssim0.5$ (since a red central 
1194: galaxy would have a lower probability of an infalling, gas-rich system). 
1195: 
1196: \begin{figure}
1197:     \centering
1198:     \figexpand
1199:     %\plotone{merger.eff.red.blue.ps}
1200:     \plotter{f5.ps}
1201:     \caption{Merger efficiency (arbitrary units; 
1202:     calculated as in Figure~\ref{fig:merger.eff.demo}) 
1203:     as a function of halo mass (adopting the mean $\mgal(\mhalo)$ from 
1204:     Figure~\ref{fig:merger.eff.mean}). Using the type-separated 
1205:     galaxy mass functions from 
1206:     \citet{bell:mfs,borch:mfs,fontana:mfs} at $z=0,\,1,\,2$, respectively, 
1207:     we show the fraction of galaxies 
1208:     at each mass expected to be gas-rich and gas-poor, at each of 
1209:     three redshifts. At high redshifts, all but the most massive merging galaxies 
1210:     will be gas-rich, whereas at low masses the gas-poor population dominates 
1211:     at most masses where mergers are efficient. 
1212:     \label{fig:merger.redblue}}
1213: \end{figure}
1214: 
1215: Integrating over the appropriate galaxy 
1216: populations, Figure~\ref{fig:merger.fraction.mhalo} compares the predicted $z=0$ 
1217: merger fraction as a function of 
1218: halo mass from this model with that observed. The agreement is good over a wide 
1219: dynamic range. Although there is a significant (factor $\sim2$) systematic difference 
1220: based on how this fraction is calculated, this is within the range of present 
1221: observational uncertainty. It is also important to distinguish the merger fraction of 
1222: parent halos (i.e.\ fraction of groups which contain a merger) and that of 
1223: galaxies (i.e.\ fraction of all galaxies at a given $\mgal$ or $\mhalo$ which 
1224: are merging), as at large halo masses the rate of mergers onto the central galaxy 
1225: could remain constant (giving a constant merger rate per halo), but the inefficient 
1226: merging of the increasingly large number of satellites will cause the 
1227: galaxy merger fraction to fall rapidly. 
1228: 
1229: We also show the distribution of mergers (interacting pairs) and all galaxies 
1230: in environmental density (local projected surface density 
1231: $\Sigma_{5}=5/(\pi\,d_{5}^{2})$, where $d_{5}$ is the distance to the 
1232: fifth nearest-neighbor) from the local group catalogues of \citet{alonso:groups} 
1233: -- we compare this data set directly to our prediction by converting 
1234: $\Sigma_{5}$ to $\mhalo$ using the mean relation from \citet{croton:sam}, 
1235: as in \citet[][]{baldry06:redfrac.vs.m.env} (although as they note, the relation has considerable scatter).
1236: Similarly, we show the post-starburst (generally merger remnant) 
1237: fraction from \citet{hogg:e+a.env} and \citet{goto:e+a.merger.connection}, as a function of 
1238: surface density on large scales. 
1239: 
1240: Our predictions and the observations 
1241: emphasize that galaxy mergers occur on all scales (in halos of all masses), 
1242: and in all environments. In a global sense, there is no preferred merger scale. 
1243: That is not to say that mergers of galaxies of a particular mass do not 
1244: have a preferred scale (indeed, in our modeling, this is explicitly the 
1245: small group scale), but rather because this scale is a function of galaxy mass, 
1246: mergers of {\em some} mass occur in all halo masses and environments. 
1247: It is clear that it is a mistake to think that mergers would not occur in field 
1248: (or even void) environments, a fact which is very important to the formation of 
1249: spheroids and quasars in these locations.
1250: 
1251: 
1252: \begin{figure*}
1253:     \centering
1254:     \figexpand
1255:     %\plotone{merger.fraction.mhalo.ps}
1256:     \plotone{f6.ps}
1257:     \caption{{\em Top:} Merger fraction as a function of host halo mass. The 
1258:     fraction of all halos (groups) predicted to host at least one major merger of 
1259:     galaxy mass $\gtrsim10^{10}\,\msun$ is plotted ({\em left}), 
1260:     as is the fraction of all galaxies in halos of a given $\mhalo$ which are 
1261:     merging ({\em right}). We show the predictions for several variations of 
1262:     our standard model (described in the text) used to identify all merging systems
1263:     (black lines, as labeled), 
1264:     and adding a more detailed calculation of the actual 
1265:     timescale for the physical galaxy mergers (blue lines, as labeled) and 
1266:     ability to morphologically identify them. 
1267:     Both are compared with observed merger fractions 
1268:     (points) from \citet[purple circles][]{alonso:groups} \citep[we convert 
1269:     their measured intermediate-scale densities to average halo masses 
1270:     following][shown as open and filled points, 
1271:     respectively]{baldry06:redfrac.vs.m.env,kauffmann:sf.vs.env}. 
1272:     {\em Bottom:} The observed distributions (fraction of objects per logarithmic interval in 
1273:     galaxy surface density) of merger and normal galaxy 
1274:     environments, from the group catalogues of \citet{alonso:groups} ({\em left}), 
1275:     and the fraction of recent merger remnant (post-starburst, K+A) galaxies 
1276:     as a function of galaxy surface density averaged 
1277:     on intermediate ($1.5\,{\rm Mpc}$) 
1278:     and large ($8$\,Mpc) scales ({\em right}). Mergers occur on all scales 
1279:     and in halos of all masses, without a strong feature at a particular scale.
1280:     \label{fig:merger.fraction.mhalo}}
1281: \end{figure*}
1282: 
1283: 
1284: \subsection{How Mergers Are Influenced By Environment}
1285: \label{sec:mergers:env}
1286: 
1287: Figure~\ref{fig:merger.fraction.mhalo} demonstrates that, 
1288: all else being equal, mergers do not depend on the large scale 
1289: environment. This is conventional wisdom, of course, because 
1290: mergers are an essentially {\em local} process. However, there 
1291: is one sense in which the merger rate should depend on environment. 
1292: If the local density of galaxies (supply of systems for major mergers) 
1293: is enhanced by some factor $1+\delta$, then the probability (or rate) 
1294: of major mergers should be enhanced by the same factor. 
1295: 
1296: In detail, our adopted model for the merger/capture cross section 
1297: of galaxies (\S~\ref{sec:mergers:criteria})
1298: allows us to calculate the differential probability that 
1299: some halo/subhalo or galaxy population at a given distance $r$ 
1300: will merge with the central galaxy in a time $<\tH$. Given the observed 
1301: galaxy-galaxy correlation function as a function of 
1302: stellar mass \citep{li:clustering}, we can trivially calculate the mean number density of 
1303: galaxies (possible fuel for major mergers) in a shell $dr$ at $r$, 
1304: and combining this with the merger rate/cross section calculation 
1305: determines the differential contribution to the total merger 
1306: rate of galaxies of that mass, from pairs at the separation $dr$. 
1307: This can be thought of as either a capture process from 
1308: halo/subhalo orbits, or a global inflow rate from 
1309: dynamical friction and gravitational motions; the results are 
1310: the same, modulo the absolute merger rate normalization 
1311: \citep{binneytremaine,masjedi:merger.rates}. 
1312: Next, assume that the density of these companions is 
1313: multiplied, at this radius, by a factor $1+\delta_{r}$ (relative to the 
1314: mean $\langle(1+\delta_{r})\rangle$ expected 
1315: at that $r$ for the given central halo mass). Integrating over all 
1316: radii, we obtain the total merger rate/probability, with the 
1317: appropriate enhancement. 
1318: 
1319: Figure~\ref{fig:merger.density.dept} illustrates this, 
1320: calculated in several 
1321: radial shells using our gravitational capture cross sections 
1322: to estimate the enhancement (the other cross sections yield 
1323: similar results). The absolute value of the 
1324: probability shown will be a function of galaxy mass, halo mass,
1325: and redshift, but the qualitative behavior is similar. Unsurprisingly, 
1326: density enhancements on small scales ($r\lesssim100\,$kpc, where 
1327: most systems will merge) linearly increase the merger rate 
1328: accordingly. Note that density decrements decrease the merger 
1329: rate only to a point -- this is because even for a galaxy with no companions 
1330: within a $100\,$kpc radius, there is of course some non-zero probability that 
1331: companions will be accreted or captured from initially larger radii and 
1332: merge in $t\ll\tH$. 
1333: 
1334: At larger radii, the enhancement is less pronounced. 
1335: A galaxy in the center of a 
1336: halo of a given mass in a $\sim3\,$Mpc overdensity is not substantially 
1337: more likely to experience a major merger, because there is little contribution 
1338: to its merger rate from those large radii (at least on short timescales; of course, 
1339: over $t\sim\tH$ subhalos may be accreted from these radii, but by then the 
1340: density structure will change and the merger rate will reflect that). 
1341: Naturally, an overdensity at the $\sim3\,$Mpc scale implies an enhanced 
1342: density within that scale. However, we are considering this for 
1343: galaxies and halos of a specific mass, for which the virial radii are generally much smaller 
1344: than these scales, so the increased density in this annulus does not necessarily 
1345: imply an enhanced galaxy density within the halos themselves 
1346: (for that $\mhalo$), although it may affect the overall abundance of the halos. As a 
1347: general rule, merger rates will scale with environmental density on scales less than the 
1348: virial radii of the masses of interest, and be independent of density on larger scales. 
1349: 
1350: \begin{figure}
1351:     \centering
1352:     \figexpand
1353:     %\plotone{density.dept.vs.scale.ps}
1354:     \plotone{f7.ps}
1355:     \caption{Dependence of the merger rate/probability on environmental density 
1356:     decrement/enhancement
1357:     within a given radius $r$; i.e.\ galaxy 
1358:     overdensity $(1+\delta_{r})/\langle(1+\delta_{r})\rangle$ 
1359:     at a fixed galaxy and 
1360:     host halo mass (absolute units are arbitrary here, and depend on these quantities). 
1361:     On scales less than the typical virial radii of interest, the merger rate 
1362:     increases with overdensity (linearly at $\delta_{r}\gg1$), but it is independent 
1363:     (for a fixed halo mass) of large-scale environment. 
1364:     \label{fig:merger.density.dept}}
1365: \end{figure}
1366: 
1367: If the merger rate increases in regions with small-scale overdensities, then 
1368: mergers themselves should be biased to such regions. To the extent 
1369: that the small-scale galaxy overdensity around a merger traces this overdensity 
1370: (which we caution is not {\em necessarily} true, as one of the initial galaxies 
1371: in this overdensity is, by definition, consumed in the merger), 
1372: this implies that mergers and merger remnants should preferentially exhibit 
1373: small-scale density excesses. The magnitude of this excess is straightforward 
1374: to determine: for a given galaxy/halo mass, the distribution of 
1375: environments (densities ($1+\delta_{r}$) on a given scale $r$) is 
1376: known. Then, for each scale $r$, the calculation in Figure~\ref{fig:merger.density.dept} 
1377: gives the relative probability of a merger as a function of overdensity. 
1378: Convolving the probability of any object being in given overdensity with the probability of a 
1379: merger in that overdensity gives the mean overdensity of mergers at that scale, i.e.\ 
1380: \begin{equation} 
1381: %\frac{\langle(1+\delta_{r})_{\rm merger}\rangle}{\langle(1+\delta_{r})_{\rm field}\rangle}
1382: %=\frac{\int{(1+\delta_{r})\,P_{\rm merger}(1+\delta_{r})\,P(1+\delta_{r}\,|\,\mhalo)\,
1383: %{\rm d}(1+\delta_{r})}}{\int{(1+\delta_{r})\,P(1+\delta_{r}\,|\,\mhalo)\,
1384: %{\rm d}(1+\delta_{r})}}, 
1385: \frac{\langle x_{\rm merger}\rangle}{\langle x_{\rm all}\rangle}
1386: =\frac{\int{x\,P_{\rm merger}(x)\,P(x\,|\,\mhalo)\,
1387: {\rm d}x}}{\int{x\,P(x\,|\,\mhalo)\,
1388: {\rm d}x}}, 
1389: \end{equation}
1390: where $x\equiv(1+\delta_{r})$.
1391: 
1392: It is straightforward in extended Press-Schechter theory to calculate of 
1393: the probability of forming a halo of a given mass in a given overdensity 
1394: on a particular scale \citep{mowhite:bias}. However, 
1395: since we are calculating a galaxy overdensity in radii about the 
1396: merger candidate, Poisson noise is 
1397: dominant on small scales where the average number of companions is 
1398: $\lesssim1$ -- nevertheless it is again straightforward to calculate the probability of 
1399: a given overdensity. In any case we account for both effects, and show the 
1400: results in Figure~\ref{fig:excess.clustering.mergers}. 
1401: Specifically, we show the average number of companions within a radius of a 
1402: given $r$ about a merger, for all field galaxies. We then 
1403: multiply the field curve by the calculated 
1404: overdensity of mergers as a function of $r$. The exercise can then be trivially repeated 
1405: for the correlation function $\xi(r)$. We compare with observed post-starburst 
1406: populations (E+A/K+A) galaxies, and find that they display a similar excess on small scales. 
1407: As before, the difference on large scales is negligible -- 
1408: unsurprisingly, the density excess becomes important at $r\lesssim r_{\rm vir}$ for 
1409: the typical galaxies of interest. 
1410: 
1411: Finally, we stress that the excess of companions 
1412: on small scales does {\em not}, in this model, stem from those galaxies themselves having 
1413: any interaction with the central merger (remnant), but reflects a genuine small-scale 
1414: overdensity (as in small groups), in which mergers will be more likely. 
1415: 
1416: \begin{figure}
1417:     \centering
1418:     \figexpand
1419:     %\plotone{excess.companions.ps}
1420:     \plotter{f8.ps}
1421:     \caption{Excess galaxy overdensity on small scales predicted for 
1422:     mergers from our model. Because mergers are more likely when there is a 
1423:     galaxy overdensity on small scales (Figure~\ref{fig:merger.density.dept}), 
1424:     mergers will, on average, occur in regions with slightly enhanced small-scale densities. 
1425:     We show the real-space correlation function ({\em bottom}; technically 
1426:     the merger-galaxy cross correlation function) and corresponding 
1427:     number of companions within a given radius ({\em top}) of all field galaxies
1428:     \citep{goto:e+a.merger.connection}, 
1429:     and then this multiplied by the predicted excess on small scales 
1430:     for mergers (essentially integrating over the probability bias to large overdensity 
1431:     on small scales in Figure~\ref{fig:merger.density.dept}). Dashed blue lines 
1432:     indicate the errors in our estimate from the combination of uncertainties in the field 
1433:     galaxy correlation function, the range of galaxy masses considered (which 
1434:     slightly shifts the physical scale on which the effect is important), and 
1435:     the inclusion/exclusion of Poisson noise in the distribution of overdensities for a 
1436:     given halo mass. The observed number of companions and clustering 
1437:     of post-starburst (likely merger remnant) galaxies is shown for 
1438:     comparison, from \citet[][red circles]{goto:e+a.merger.connection} 
1439:     and \citet[][purple diamonds]{hogg:e+a.env}.
1440:     \label{fig:excess.clustering.mergers}}
1441: \end{figure}
1442: 
1443: 
1444: \subsection{Integrated Merger Populations Over Time}
1445: \label{sec:mergers:populations}
1446: 
1447: At a given redshift, we use our model to predict the mass function of mergers. 
1448: For clarity, we take the mass of a merger to be the total stellar mass of the 
1449: remnant galaxy (roughly the total baryonic mass of the merger 
1450: progenitors). This avoids ambiguity in merger mass ratios, tends to be 
1451: observationally representative (since mergers are generally labeled 
1452: by total luminosity/stellar mass), and has been shown in simulations to 
1453: be a better proxy for the merger behavior than the initial mass of 
1454: either progenitor \citep[as long as it is still a major merger;][]{hopkins:qso.all}. 
1455: 
1456: Figure~\ref{fig:merger.mfs} shows the mass functions of ongoing 
1457: mergers at each of several redshifts. We first consider 
1458: the mass function of ``all'' objects which will merge efficiently -- i.e.\ the mass function of 
1459: merging pairs. This requires no knowledge of the 
1460: actual timescale of the merger or e.g.\ lifetime of tidal disturbances. 
1461: The results agree well with the mass functions and merger fractions 
1462: estimated at all $z\lesssim1.5$, suggesting that our model does 
1463: indeed reasonably describe the true nature of galaxy mergers. For comparison, we 
1464: show the results obtained using 
1465: a different halo occupation model to associate galaxies and 
1466: halos, or using a different set of simulations/models to estimate the subhalo 
1467: mass functions. As noted in \S~\ref{sec:mergers:criteria}, these choices make very little difference 
1468: (considerably smaller than e.g.\ the systematics in the observations). 
1469: 
1470: \begin{figure}
1471:     \centering
1472:     \figexpand
1473:     %\plotone{merger.mfs.ps}
1474:     \plotone{f9.ps}
1475:     \caption{Mass functions (in terms of the remnant stellar mass) of 
1476:     ongoing mergers at each of several redshifts (labeled). Observed mass functions  
1477:     (solid red points) are shown from \citet[][stars]{xu:merger.mf} and \citet[][circles]{bundy:mfs}
1478:     \citep[for a detailed analysis of the mass functions, see][]{hopkins:transition.mass}.
1479:     Error bars do {\em not} include cosmic variance. Observed merger 
1480:     fractions (open orange points), converted to a mass function estimate 
1481:     over the mass range sampled (horizontal errors) are shown 
1482:     from \citet[][cross]{bell:merger.fraction} 
1483:     and \citet[][squares]{lotz:merger.fraction}, with errors including cosmic variance. 
1484:     We compare the prediction of our default model (thick solid black line), for the abundance of 
1485:     mergers and merging pairs. Dotted line employs a different halo occupation 
1486:     model, and dashed line adopts a different 
1487:     fit to the subhalo mass functions (see Figure~\ref{fig:merger.fraction.mhalo} 
1488:     and \S~\ref{sec:mergers:synopsis}).
1489:     We also show the predictions 
1490:     for morphologically identified mergers (thin blue lines), which requires estimating the merger 
1491:     timescale/capture efficiency and duration of morphological disturbances
1492:     (see \S~\ref{sec:mergers:criteria}). 
1493:     We estimate these using a group capture/collisional model (solid), 
1494:     angular momentum capture cross-sections (long dashed), and simple dynamical friction 
1495:     considerations (dotted), calibrating the duration of disturbances from numerical 
1496:     simulations \citep{lotz:merger.selection}. At masses $\gtrsim10^{10}\,\msun$, there is 
1497:     little difference owing to methodology. At very low masses, simulations suggest that the 
1498:     merger timescale (i.e.\ orbital or crossing time after first passage) is considerably longer 
1499:     than the time period over which strong disturbances are excited; however, this is below the 
1500:     mass scales of interest for most of our predictions.     
1501:     \label{fig:merger.mfs}}
1502: \end{figure}
1503: 
1504: It is not always clear, however, that observations capture all merging pairs 
1505: (or that our definition of ``all'' is appropriate as, for some mergers, 
1506: $t_{\rm merger}\rightarrow\tH$). Often, 
1507: systems are identified as mergers on the basis of tidal disturbances and other 
1508: clear morphological signatures. We therefore calculate the mass function of systems 
1509: observed in this manner. This requires that we adopt one of the 
1510: models in \S~\ref{sec:mergers:criteria} 
1511: for the merger timescale, which tells us how long it will characteristically take for a given 
1512: merger to reach the interaction cross section where tidal disturbances will be excited. 
1513: Then, using numerical simulations to estimate the typical duration of those features 
1514: \citep[in which they will be identified by typical morphological classification schemes, see][]
1515: {lotz:merger.selection}, we obtain the observed ``disturbed morphology'' mass functions. 
1516: We perform this calculation using each of the methods for calculating the merger 
1517: timescale described in \S~\ref{sec:mergers:criteria}. Note that the number of systems 
1518: according to this convention
1519: can exceed that in our ``all pairs'' definition if the timescale on which 
1520: disturbances are visible is longer than the ``infall'' timescale or timescale on 
1521: which the subhalo survives (the case for very efficient infall/capture). 
1522: 
1523: At high masses, the difference between samples of merging pairs and 
1524: those of disturbed systems is small, as is the difference between our choice of 
1525: methodology in calculating the merger abundances and/or timescales. This is 
1526: because high-mass systems merge more quickly, excite morphological 
1527: disturbances more easily on first passage, and are brighter (making 
1528: faint morphological features easier to identify). At very low masses 
1529: $\mgal\lesssim10^{10}\,\msun$, our predictions do diverge -- this is because the 
1530: overall infall or merger timescale can become substantially longer than 
1531: the timescale over which morphological disturbances are excited (in these cases, 
1532: this occurs closer to the final coalescence). Although this conclusion 
1533: merits more detailed numerical 
1534: investigation in future work, it has little effect on any 
1535: of our predictions -- for example, the total merger fraction (especially at high redshift) 
1536: is restricted to larger-mass $\mgal\gtrsim\mstar$ systems, where the predictions 
1537: agree well, and the overall merger mass density is nearly identical regardless of 
1538: the methodology. Furthermore, quasar and galaxy formation processes are 
1539: probably influenced (or even dominated) by other mechanisms 
1540: (such as secular disk instabilities and quenching via infall as a 
1541: satellite galaxy) at these low masses, which we do not attempt to model.
1542: 
1543: We next integrate the mass functions in Figure~\ref{fig:merger.mfs} above a given 
1544: mass limit to predict the merger fraction as a function of redshift, shown in 
1545: Figure~\ref{fig:merger.fraction}. The fraction is determined relative to the mass 
1546: functions in \citet{fontana:highz.mfs}, who provide a continuous fit over the range of 
1547: interest. But we note that since this is an integrated quantity, the difference 
1548: adopting other mass function estimates \citep[e.g.][]{borch:mfs} is small
1549: (at least at $z\lesssim1.5$). Comparing 
1550: this to a range of observations, the agreement is good, especially 
1551: for the deeper mass limit. For high mass 
1552: mergers ($\mgal\gtrsim10^{11}\,\msun$) there is greater scatter in the observations, 
1553: which most likely owes to cosmic variance (especially at $z\lesssim0.2$).  In both 
1554: cases, however, the merger fraction is not an especially steep function of 
1555: redshift. In fact, between $z= 0.3-1.5$, the fraction increases by 
1556: only a factor $\sim3-4$, 
1557: consistent with most observations finding a relatively flat merger fraction in this 
1558: range \citep[e.g.][]{lin:merger.fraction,lotz:merger.fraction} and 
1559: recent cosmological simulations \citep{maller:sph.merger.rates}. 
1560: Further, although halos may be merging more frequently at high redshift, they 
1561: are also merging more rapidly, meaning that the fraction merging at any instant 
1562: can be relatively flat. 
1563: 
1564: \begin{figure*}
1565:     \centering
1566:     \figexpand
1567:     %\plotone{merger.fraction.ps}
1568:     \plotone{f10.ps}
1569:     \caption{Predicted merger fraction as a function of redshift (lines, 
1570:     same style as Figure~\ref{fig:merger.mfs}), above two approximate mass 
1571:     limits. Observations (points) are shown from 
1572:     \citet[][filled inverted triangles]{patton:merger.fraction}, \citet[][filled circles]{conselice:merger.fraction}, 
1573:     \citet[][filled triangles]{bundy:merger.fraction}, 
1574:     \citet[][open diamonds]{lin:merger.fraction}, \citet[][open stars]{xu:merger.mf}, 
1575:     \citet[][open circles]{depropris:merger.fraction}, \citet[][filled diamonds]{cassata:merger.fraction}, 
1576:     \citet[][filled stars]{wolf:merger.mf}, \citet[][open triangles]{bundy:mfs},     
1577:     \citet[][open inverted triangles]{lotz:morphology.evol}, \citet[][open squares]{lotz:merger.fraction},
1578:     \citet[][filled squares]{bell:merger.fraction}, and 
1579:     \citet[][$\times$'s]{bridge:merger.fractions}.
1580:     Note that the mass limit 
1581:     is only approximate in several of these cases, as they are selected by optical luminosity. 
1582:     The predicted merger fractions agree well, especially for the deeper case which 
1583:     resolves $\mstar$ galaxies. 
1584:     \label{fig:merger.fraction}}
1585: \end{figure*}
1586: 
1587: Finally, given our model for the halos hosting mergers, it is straightforward to calculate 
1588: the predicted clustering properties of those mergers. Specifically, we have 
1589: already predicted a number density of mergers as a function of halo mass, galaxy mass, and 
1590: redshift; i.e.\ some $n_{\rm merger}(\mgal\,|\,\mhalo,\,z)$. 
1591: Knowing the clustering amplitude or bias of each host halo $b(\mhalo\,|\,z)$, it is straightforward 
1592: to predict the clustering of the merging galaxies, in the same manner by which halo 
1593: occupation models construct the clustering of a given population: 
1594: \begin{equation}
1595: b(\mgal) = \frac{\int{b(\mhalo)\,n_{\rm merger}(\mgal\,|\,\mhalo)\,{\rm d}{\mhalo}}}
1596: {\int{n_{\rm merger}(\mgal\,|\,\mhalo)\,{\rm d}{\mhalo}}}. 
1597: \end{equation}
1598: We calculate $b(\mhalo)$ following \citet{mowhite:bias} as updated 
1599: by \citet{shethtormen} to agree with the results of numerical simulations. 
1600: 
1601: Figure~\ref{fig:merger.clustering} shows this as a function of redshift. Since 
1602: observations generally sample near $\mgal\sim\mstar$, we plot this for 
1603: $\mgal=\mstar(z=0)\approx10^{11}\,\msun$. We compare with available 
1604: clustering measurements for 
1605: likely major-merger populations. At low redshifts, \citet{blake:e+a.clustering} have 
1606: measured the clustering of a large, uniformly selected 
1607: sample of post-starburst (E+A/K+A) galaxies in the 2dF. 
1608: \citet{infante:pair.clustering} have also measured the 
1609: large-scale clustering of close galaxy pairs selected from the SDSS at 
1610: low redshift. At high redshift, no such samples exist, but \citet{blain:smg.clustering} 
1611: have estimated the clustering of a moderately large sample of 
1612: spectroscopically identified sub-millimeter galaxies at $z\sim2-3$, 
1613: which as discussed in \S~\ref{sec:intro} are believed to originate in major mergers. 
1614: Our prediction is consistent with these constraints -- 
1615: however, given the very limited nature of the data and the lack of 
1616: a uniform selection criteria for ongoing or recent mergers at different 
1617: redshifts, we cannot draw any strong conclusions. 
1618: 
1619: \begin{figure}
1620:     \centering
1621:     \figexpand
1622:     %\plotone{merger.bias.vs.z.ps}
1623:     \plotone{f11.ps}
1624:     \caption{Comparing our predicted clustering of $\sim\mstar$ major mergers (lines; 
1625:     style as in Figure~\ref{fig:merger.mfs}) 
1626:     as a function of redshift to that various populations usually associated with 
1627:     galaxy mergers (points): post-starburst (E+A/K+A) galaxies 
1628:     \citep[][star]{blake:e+a.clustering}, 
1629:     close galaxy pairs \citep[][diamond]{infante:pair.clustering}, and 
1630:     sub-millimeter galaxies \citep[][square]{blain:smg.clustering}. 
1631:     \label{fig:merger.clustering}}
1632: \end{figure}
1633: 
1634: One caution should be added: 
1635: recent higher-resolution simulations suggest that the approximation here 
1636: (and in many -- but not all -- halo occupation models), that bias is a function only of 
1637: halo mass at a given redshift, may not be accurate
1638: \citep[e.g.,][]{gao:assembly.bias,harker:marked.correlation.function,wechsler:assembly.bias}. 
1639: In particular, because mergers 
1640: have particularly recent halo assembly times for their post-merger masses, 
1641: they may represent especially biased regions of the density distribution. 
1642: Unfortunately, it is not clear how to treat this in detail, as there remains considerable 
1643: disagreement in the literature as to whether or not a significant ``merger bias'' exists 
1644: \citep[see, e.g.][]{kauffmann:qso.clustering,percival:merger.bias,furlanetto:merger.bias,
1645: lidz:merger.bias}. 
1646: Furthermore the distinction between galaxy-galaxy and 
1647: halo-halo mergers (with the considerably longer timescale for most galaxy mergers) 
1648: means that it is not even clear whether or not, after the galaxy merger, there would be a 
1649: significant age bias. 
1650: 
1651: In any case, most studies suggest the effect is quite small: using 
1652: the fitting formulae from \citet{wechsler:concentration,wechsler:assembly.bias}, 
1653: we find that even in extreme cases 
1654: (e.g.\ a $\mhalo\gg\mstar$ halo merging at $z=0$ as opposed to an average 
1655: assembly redshift $z_{f}\approx6$) the result is that the standard EPS formalism 
1656: underestimates the bias by $\approx30\%$. For the estimated 
1657: characteristic quasar host halo masses 
1658: and redshifts of interest here, the maximal effect is $\lesssim 10\%$ at all $z=0-3$, 
1659: much smaller than other systematic effects we have considered (and 
1660: generally within the range of 
1661: our plotted variant calculations in Figure~\ref{fig:merger.clustering}). 
1662: This is consistent with \citet{gao:assembly.bias} and \citet{croton:assembly.bias} 
1663: who find that assembly bias is only important 
1664: (beyond the $10\%$ level) for the most extreme halos or galaxies in their simulations, 
1665: where for example the clustering 
1666: of small halos which are destined to be 
1667: accreted as substructure in clusters ($\gtrsim 10^{15}\,h^{-1}\,M_{\sun}$) will be 
1668: very different from the clustering of similar-mass halos in field or void environments. 
1669: Indeed, our own calculation in Figure~\ref{fig:excess.clustering.mergers} suggests 
1670: that merger bias applies only on small scales, and that mergers show no preference 
1671: for excess densities on the large scales for which the linear bias description is 
1672: meaningful. 
1673: The effect may grow with redshift, however, so care should be taken in extrapolating 
1674: the predictions in Figure~\ref{fig:merger.clustering} to higher redshifts. 
1675: For further discussion of the effects on the data and predictions shown here, 
1676: we refer to \citet{hopkins:clustering}. 
1677: 
1678: \begin{figure}
1679:     \centering
1680:     \figexpand
1681:     %\plotone{merger.highz.pred.ps}
1682:     \plotter{f12.ps}
1683:     \caption{{\em Top:} As Figure~\ref{fig:merger.fraction}, but 
1684:     extending our predicted merger fractions to high redshift.
1685:     {\em Middle:} Mass flux through mergers (i.e.\ total rate of stellar mass 
1686:     merging). Black points are observed merger fractions converted to an 
1687:     estimated mass flux rate following \citet{hopkins:transition.mass}. 
1688:     Green, red, and blue circle show the observationally inferred 
1689:     mass flux through the ``green valley'' (i.e.\ from blue cloud to red sequence), 
1690:     rate of growth of the red sequence, and rate of mass loss off the 
1691:     blue cloud (respectively), from $z\sim0-1$ \citep{martin:mass.flux} 
1692:     (see \papertwo\ for a more detailed comparison).
1693:     {\em Bottom:} As Figure~\ref{fig:merger.clustering}, but 
1694:     extended to higher redshift. Blue and red lines show the clustering of 
1695:     mergers above the given mass thresholds. 
1696:     \label{fig:merger.highz}}
1697: \end{figure}
1698: 
1699: For the sake of future comparison, we show in Figure~\ref{fig:merger.highz} 
1700: our predictions for the merger fractions and clustering of 
1701: mergers (Figure~\ref{fig:merger.fraction} \&\ \ref{fig:merger.clustering}, 
1702: respectively) at all redshifts $z=0-6$. We note the caveat that 
1703: our merger fraction is defined relative to the mass functions in 
1704: \citet{fontana:highz.mfs}, which become uncertain at high redshifts, 
1705: although this uncertainty is comparable to the differences between 
1706: the methods of calculating the merger timescale (as discussed in 
1707: \S~\ref{sec:mergers:synopsis}). It is also less clear 
1708: what the observable consequences of mergers at 
1709: the highest redshifts may be -- if merger 
1710: rates are sufficiently high, there may be a large number of multiple 
1711: mergers (as in \citet{li:z6.quasar}),
1712: or systems may effectively be so gas rich that merging 
1713: preserves disks and operates as a means of ``clumpy accretion'' 
1714: \citep[e.g.][]{robertson:disk.formation}.
1715: 
1716: Although the estimates differ at the highest redshifts, we stress that their 
1717: integrated consequences at low redshifts $z\lesssim3$ are
1718: similar, as this is where most merging activity and spheroid/BH mass 
1719: buildup occurs. 
1720: We also note that high-redshift mergers are likely to be the most 
1721: massive $\mgal\gg M_{\ast}$ systems, so we show 
1722: our predictions for the clustering of mergers assuming different 
1723: mass limits (as opposed to strictly at $\mgal=M_{\ast}$). We 
1724: also plot the mass flux in mergers, i.e.\ the 
1725: integrated rate at which galaxy baryonic/stellar mass is merged, 
1726: $\int \mgal\,\dot{n}(\mgal)\,{\rm d}\log{\mgal}$. This compares favorably 
1727: with the observationally inferred rates at which mass is moved 
1728: off the blue cloud, through the ``green valley,'' and onto 
1729: the red sequence \citep[from the evolution in galaxy mass functions 
1730: and color-magnitude relations; see][]{martin:mass.flux}, as expected 
1731: in a model where mergers drive such a transition (for details, see 
1732: \papertwo). Future observations of these quantities at high redshift 
1733: will improve the constraints on our halo occupation and 
1734: merger timescale estimates, allowing for more accurate calculations
1735: of e.g.\ quasar triggering and spheroid formation rates at these
1736: epochs.
1737: 
1738: 
1739: \section{Quasars}
1740: \label{sec:quasars}
1741: 
1742: \subsection{Consequences of Merger-Driven Fueling: 
1743: What Determines Where and When Quasars Live}
1744: \label{sec:quasars:mergers}
1745: 
1746: Having developed in \S~\ref{sec:mergers} 
1747: a physically-motivated model of merger rates as a function of 
1748: galaxy and halo mass, environment, and redshift (and tested that this 
1749: model is consistent 
1750: with the existing body of merger observations), we can now extend 
1751: our application. As discussed in \S~\ref{sec:intro}, the argument for an 
1752: association between mergers and quasars has a long history. We therefore 
1753: make the simple ansatz: {\em Every major merger of star-forming/gas-rich galaxies 
1754: triggers a quasar}. 
1755: 
1756: \begin{figure*}
1757:     \centering
1758:     \figexpand
1759:     %\plotone{lum.density.ps}
1760:     \plotone{f13.ps}
1761:     \caption{Predicted quasar luminosity density, if quasars are triggered in mergers, 
1762:     as a function of redshift. {\em Left:} Prediction from a simplified toy model 
1763:     in which all halos hosting $\sim\lstar$ galaxies undergo major mergers near their 
1764:     characteristic small group mass scale, and build a BH which obeys the appropriate 
1765:     $\mbh-\mhalo$ relation for that redshift
1766:     \citep[estimated $\mbh-\mhalo$ as a function of redshift from][corresponding to 
1767:     solid, long dashed, 
1768:     and dot-dashed lines, respectively]{hopkins:clustering,fine:mbh-mhalo.clustering,
1769:     hopkins:bhfp}. 
1770:     Points show observational estimates from 
1771:     the measured QLFs of \citet[][red circles]{ueda03:qlf}, \citet[][blue triangles]{hasinger05:qlf}, 
1772:     \citet[][green diamonds]{richards05:2slaq.qlf}, and the large compilation of 
1773:     multiwavelength QLF data in \citet[][black stars]{hopkins:bol.qlf}. The observations 
1774:     from specific bands are converted to a bolometric luminosity density using the 
1775:     bolometric corrections calibrated in \citet{hopkins:bol.qlf}. 
1776:     {\em Right:} Same, but the predicted luminosity density is calculated properly accounting for all 
1777:     galaxy and halo masses from the merger rate functions determined in \S~\ref{sec:mergers}, and 
1778:     adopting the observed ratio of BH to host galaxy spheroid mass as a function of redshift 
1779:     \citep[e.g.][]{peng:magorrian.evolution}. Linestyles correspond to different 
1780:     means of estimating the exact merger rates, as in Figure~\ref{fig:merger.mfs}. Red lines 
1781:     assume all mergers will trigger quasars, black (lower) lines assume only gas-rich (``wet'') mergers 
1782:     can trigger bright quasar activity (adopting the observed fraction of 
1783:     gas-rich/star-forming/blue galaxies as a function of $\mgal$ and 
1784:     $\mhalo$ as in Figure~\ref{fig:merger.redblue}). 
1785:     A merger-driven model naturally predicts both the rise and fall of the global quasar luminosity density 
1786:     to high precision.
1787:     \label{fig:lum.density}}
1788: \end{figure*}
1789: From this statement, we can make a number of robust predictions. In \S~\ref{sec:mergers} 
1790: we derived the characteristic host halo mass for mergers of $\sim\mstar$ galaxies. 
1791: To the extent that these are gas-rich systems, this should therefore also 
1792: represent the characteristic host halo mass of quasars, and (since the mass density of 
1793: the Universe is dominated by systems near $\sim\mstar$) dominate the buildup of black 
1794: hole mass. 
1795: 
1796: From the \citet{soltan82} argument, the black hole mass density of the Universe 
1797: must be dominated by growth in typical, bright quasar phases with canonical radiative 
1798: efficiency $\epsilon_{r}\sim0.1$. Let us construct the simplest possible model: 
1799: mergers (of $\mstar$ galaxies) characteristically occur at a host halo mass $\sim \mmerger$. 
1800: From the halo mass function, it is straightforward to calculate the rate at which halo mass 
1801: crosses this mass threshold, 
1802: \begin{equation}
1803: \dot{\rho}_{\rm halo} = \bar{\rho}\,\frac{{\rm d}F(>\mhalo)}{{\rm d}t}, 
1804: \end{equation}
1805: where $F(>\mhalo,\,z)$ is the fraction of mass in halos of mass greater than 
1806: $\mhalo$, determined from the Press-Schechter formalism revised following 
1807: \citet{shethtormen}. 
1808: Assume that every such halo undergoes a merger approximately 
1809: upon crossing this mass threshold, which transforms its galaxy from disk to spheroid. The 
1810: hosted BH mass therefore grows from some arbitrarily small amount to the expected mass 
1811: given the BH-host mass relations, which we can write as $\mbh=\nu(z)\,\mhalo$ 
1812: (we distinguish this from $\mbh=\mu(z)\,\mgal$). 
1813: The ratio $\nu(z)$ is determined to $z\sim3$ from the clustering of active BHs 
1814: of a given mass at each redshift 
1815: \citep[see e.g.,][]{daangela:clustering,fine:mbh-mhalo.clustering,
1816: hopkins:clustering,hopkins:bhfp}, and indirectly from determinations of the 
1817: BH host galaxy masses \citep{peng:magorrian.evolution}. 
1818: The total rate at which BH mass is built up is then
1819: \begin{equation}
1820: \dot{\rho}_{\rm BH} = \nu(z)\,\dot{\rho}_{\rm halo} = \nu(z)\,\bar{\rho}\,\frac{{\rm d}F(>\mhalo)}{{\rm d}t}, 
1821: \end{equation}
1822: and the bolometric luminosity density is $j_{\rm bol}=\epsilon_{r}\,\dot{\rho}_{\rm BH}\,c^{2}$. 
1823: Figure~\ref{fig:lum.density} compares this simple estimate with the observed bolometric 
1824: quasar luminosity density as a function of redshift. 
1825: 
1826: The agreement is striking, which suggests that this toy model, such that
1827: the bulk of the 
1828: assembly of BH mass occurs near the transition halo mass, is reasonable. This also 
1829: naturally explains the rise and fall of the quasar luminosity density with time. However, 
1830: this is ultimately just a simple approximation -- we can consider this in greater detail adopting 
1831: our previous estimate of the merger rate as a function of stellar mass and redshift, $\dot{n}(\mgal\,|\,z)$, from \S~\ref{sec:mergers}. Each major merger transforms disks to spheroids, building a BH of 
1832: average mass $\mbh = \mu(z)\,\mgal$. We should properly only consider mergers of 
1833: gas-rich or star-forming systems, as dry mergers will, by definition, not be able to trigger 
1834: quasar activity and form new BH mass. Therefore, we empirically adopt the fraction of 
1835: red and blue galaxies at each $\mgal,\,\mhalo$ (as in \S~\ref{sec:mergers}) to restrict 
1836: only to mergers of blue galaxies. 
1837: Again, $\mu(z)$ has been directly determined from 
1838: observations \citep{peng:magorrian.evolution}, and estimated from theoretical arguments 
1839: \citep{hopkins:bhfp}. For convenience, we adopt the numerical best-fit estimate of 
1840: $\mu(z)$ from \citet{hopkins:bhfp}. A good approximation to this 
1841: numerical function is 
1842: \begin{equation}
1843: \mu(z) \approx 0.0012\,{\Bigl(}\frac{1+z^{5/2}}{1+(z/1.775)^{5/2}}{\Bigr)}, 
1844: \end{equation}
1845: which matches the asymptotic observed values at low and high redshift 
1846: \citep{haringrix,walter04:z6.msigma.evolution}, and captures the observed weak evolution 
1847: to $z\sim1$ and rapid evolution between $z=1-3$ \citep{shields03:msigma.evolution,
1848: peng:magorrian.evolution,salviander:msigma.evolution}. 
1849: Given the merger rate $\dot{n}(\mgal\,|\,z)$, we can then convert this to a cosmic 
1850: rate of formation or build-up of BHs in merger-driven quasars, 
1851: \begin{equation}
1852: \dot{n}(\mbh\,|\,z) = \int{P(\mbh\,|\,\mgal)\,\dot{n}(\mgal\,|\,z)\,{\rm d}\log{\mgal}}. 
1853: \end{equation}
1854: The intrinsic dispersion about the mean BH-host mass relation appears, at all redshifts, to be 
1855: roughly lognormal with width $\approx0.27\,$dex, so we model $P(\mbh\,|\,\mgal)$ as such.  
1856: Once the total rate of formation of BH mass is calculated, the same conversion above 
1857: yields the quasar luminosity density. 
1858: 
1859: Figure~\ref{fig:lum.density} shows the results of this 
1860: more detailed calculation. They are similar to the results from our extremely simplified model -- 
1861: which reflects the fact that most of the mass/luminosity density is contained near 
1862: $\mstar$ or $\lstar$. Note that 
1863: considering all mergers (i.e.\ including dry mergers) overpredicts the quasar luminosity 
1864: density at low redshifts. This demonstrates that the decrease in the quasar luminosity density 
1865: at low redshifts is, in part, driven by the fact that an increasing fraction of massive systems have 
1866: already been transformed to ``red and dead'' systems at late times, and are no longer available to fuel 
1867: quasars, even if they undergo subsequent dry mergers. By $z\sim0$, for example, 
1868: a large fraction ($\sim50\%$) of the mass density in $>M_{\ast}$ systems has already 
1869: been gas-exhausted (discussed in detail in \papertwo), 
1870: and therefore such mergers are no longer a viable fuel supply 
1871: for quasar activity. As discussed in \S~\ref{sec:mergers:scales}, the predicted gas-rich merger 
1872: mass density (and corresponding quasar luminosity density) at $z\lesssim0.5$ will be slightly 
1873: lower if these gas-exhausted systems are preferentially surrounding by 
1874: gas-exhausted satellites (compared to gas-rich central galaxies of the same mass in 
1875: similar halos), but it is clear in Figure~\ref{fig:lum.density} that this is completely 
1876: consistent with the observations (especially if secular processes contribute significantly 
1877: to the quasar luminosity density at low redshifts and luminosities, as we expect from 
1878: our comparisons in \S~\ref{sec:quasars:secular}). 
1879: 
1880: 
1881: 
1882: \begin{figure}
1883:     \centering
1884:     \figexpand
1885:     %\plotone{bhmf.ps}
1886:     \plotone{f14.ps}
1887:     \caption{Predicted BH mass function (BHMF) from gas-rich merger-driven quasar/BH formation 
1888:     (Figure~\ref{fig:lum.density}, right). Results are shown at $z=0$ (black lines; linestyles 
1889:     correspond to different calculations of the merger rates, as in Figure~\ref{fig:merger.mfs}), 
1890:     and $z=1,\,2,\,3$ (blue, green, and red, respectively; for clarity, only our fiducial calculation 
1891:     -- solid line -- is shown, but relative evolution with redshift for each calculation is similar). 
1892:     Yellow (shaded) range shows the $z=0$ observational estimate of the BHMF 
1893:     in \citet{marconi:bhmf}. Integrating forward the 
1894:     merger mass functions as a function of redshift yields a good match to the local BHMF. 
1895:     The effect of dry mergers is included, but is small. 
1896:     \label{fig:bhmf}}
1897: \end{figure}
1898: Having calculated the rate of BH formation as a function of the remnant BH mass, 
1899: $\dot{n}(\mbh\,|\,z)$, it is trivial to integrate this forward and predict the BH mass 
1900: function (BHMF) at any time. Figure~\ref{fig:bhmf} shows the result of this calculation at 
1901: $z=0$, compared to the observationally estimated BHMF. The two agree well at all masses, 
1902: even at very large $\mbh\sim10^{10}\,\msun$. We also show the BHMF at several other redshifts. 
1903: Interestingly, there is a downsizing behavior, where a large 
1904: fraction of the most massive 
1905: BHs are in place by $z=2$, while less massive BHs form later \citep[essentially required by the 
1906: fact that few $\sim10^{9}\,\msun$ BHs are active at low redshift, while a very high fraction are 
1907: active at $z\sim2$, see][]{mclure.dunlop:mbh,kollmeier:eddington.ratios,fine:mbh-mhalo.clustering}. 
1908: If we were to ignore dry mergers at low redshifts, this effect would be even more pronounced, 
1909: but at $z\lesssim1$ their effect is to move some of the BH mass density from lower-mass systems 
1910: into higher mass $\gtrsim10^{9}\,\msun$ systems (at higher redshifts, the effects are negligible). 
1911: It is not obvious, however, that this translates to 
1912: downsizing in galaxy mass assembly, since the ratio of BH to galaxy mass $\mu(z)$ evolves 
1913: with redshift. We will return to this question in \papertwo. 
1914: 
1915: \begin{figure}
1916:     \centering
1917:     \figexpand
1918:     %\plotone{quasar.bias.vs.z.ps}
1919:     \plotone{f15.ps}
1920:     \caption{Predicted quasar clustering as a function of redshift, assuming merger-triggering 
1921:     (black lines, as in Figure~\ref{fig:merger.mfs}), corresponding to the small group scale of 
1922:     $\sim\mstar$ galaxies. Red (upper) shaded range show the prediction if quasars were associated 
1923:     with large group scales, blue (lower) range show the prediction from a secular model in 
1924:     which quasar clustering traces that of star-forming galaxies observed at each redshift (lines show 
1925:     $\pm1\,\sigma$ range estimated from the 
1926:     compiled observations in \citet{hopkins:clustering}, from \citet{shepherd:clustering.by.type,
1927:     giavalisco:lbg.clustering,norberg:clustering.by.lum.type,coil:prelim.clustering,zehavi:local.clustering,
1928:     adelberger:lbg.clustering,allen:lum.dep.lbg.clustering,
1929:     phleps:midz.clustering,meneux:clustering.vs.z,lee:lbg.clustering}). Points show 
1930:     quasar clustering measurements from \citet[][red squares]{croom:clustering}, 
1931:     \citet[][green diamonds]{porciani:clustering}, 
1932:     \citet[][cyan and blue circles]{myers:clustering,myers:clustering.old}, 
1933:     and \citet[][violet stars]{daangela:clustering}. 
1934:     Large black stars 
1935:     show the observed clustering of $z\sim1$ small groups (of $\sim\lstar$ galaxies) 
1936:     from \citet{coil:clustering.groups}, corresponding to the most efficient scales for 
1937:     major $\sim\lstar$ galaxy mergers. Quasar clustering measurements are consistent 
1938:     with the small group scale in which mergers proceed efficiently. 
1939:     \label{fig:quasar.bias}}
1940: \end{figure}
1941: Since we begin our calculation with the halos hosting quasars, we should 
1942: be able to predict the bias of quasars as a function of redshift. As in Figure~\ref{fig:merger.clustering}, 
1943: we use the known clustering of the halos hosting mergers to calculate the 
1944: clustering of those mergers as a function of redshift. Assuming each merger produces a 
1945: quasar of the appropriate mass, this yields the expected clustering of quasars 
1946: as a function of redshift. Figure~\ref{fig:quasar.bias} compares this prediction 
1947: to observed quasar clustering as a function of redshift. Technically, we adopt 
1948: the quasar lightcurve models from \S~\ref{sec:quasars:qlf} below to determine the clustering 
1949: specifically of $\lstar$ quasars (i.e.\ determining the relative contribution to $\lstar$ from 
1950: different host masses and their clustering as in Figure~\ref{fig:merger.clustering}), 
1951: but the result is nearly identical 
1952: to assuming that $\lstar$ quasars trace $\mstar$ mergers (Figure~\ref{fig:merger.clustering}). 
1953: This should 
1954: be true in any model, as long as the quasar lifetime is a smooth function of luminosity or 
1955: host mass. We also compare with the directly observed clustering of 
1956: small groups similar to our definition. 
1957: 
1958: The agreement is quite good at all 
1959: $z\lesssim2$. At higher redshifts, the observations show considerably larger scatter, perhaps 
1960: owing to their no longer being complete near the QLF $\lstar$ -- future observations, sufficiently 
1961: deep to clearly resolve $\lstar$,
1962: are needed to test this in greater detail. We also consider 
1963: the predicted clustering if $\lstar$ quasars were associated with the large group scale of 
1964: $\mstar$ galaxies (for simplicity we take this to be halo masses $\gtrsim5-10$ times larger than 
1965: the small group scale, where 
1966: our halo occupation model predicts of order $\gtrsim3$ satellite $\sim M_{\ast}$ galaxies), 
1967: and the expectation from a secular model, in which quasar clustering 
1968: traces the observed clustering of star-forming galaxies \citep[taken from the observations 
1969: collected in][]{hopkins:clustering}
1970: -- neither agrees with the observations. Note that these estimates may not be 
1971: applicable to the highest-redshift quasar clustering measurements, where flux limits 
1972: allow only the most massive $L\gg L_{\ast}$ systems to be observed 
1973: (but see Figure~\ref{fig:merger.highz} for how the clustering amplitude varies with 
1974: merger masses). 
1975: 
1976: \begin{figure}
1977:     \centering
1978:     \figexpand
1979:     %\plotone{qsos.small.group.scale.ps}
1980:     \plotter{f16.ps}
1981:     \caption{{\em Top:} Characteristic halo mass implied by quasar clustering measurements. 
1982:     Points show the $1\sigma$ allowed range in host halo mass $\mhalo$ corresponding to the quasar
1983:     bias measurements in Figure~\ref{fig:quasar.bias} (in the same style). Shaded magenta 
1984:     regions show the range of halo masses for the corresponding redshift bins in the SDSS 
1985:     \citep{shen:clustering}. The solid line shows 
1986:     the best-fit $\mhalo(z)$ to all observations, with the $1\sigma$ ($2\sigma$) allowed range 
1987:     shaded orange (cyan). {\em Middle:} Shaded range again 
1988:     shows the characteristic host halo mass 
1989:     implied by quasar clustering. Points show the halo mass scale 
1990:     implied by direct measurements of 
1991:     observationally identified small groups (velocity dispersions $\lesssim200\,{\rm km\,s^{-1}}$), 
1992:     from \citet{brough:group.dynamics} at $z\approx0$ (squares), 
1993:     and from from clustering 
1994:     measurements of groups from \citet[][triangles]{eke:groups} 
1995:     and \citet[][stars]{coil:clustering.groups}. 
1996:     {\em Bottom:} Same, but showing the small group halo mass 
1997:     estimated indirectly from the empirically determined halo occupation distribution (HOD). 
1998:     Black inverted triangles adopt the best-fit HOD from \citet{conroy:monotonic.hod} (our 
1999:     default model), other points adopt the 
2000:     methodology of \citet{valeostriker:monotonic.hod} to construct the 
2001:     HOD from various measured 
2002:     galaxy stellar mass functions in 
2003:     \citet[][blue stars]{fontana:highz.mfs}, \citet[][purple squares]{borch:mfs}, 
2004:     \citet[][red circles]{bundy:mfs,bundy:mtrans}, and 
2005:     \citet[][orange triangles]{blanton:lfs}. The characteristic 
2006:     scale of $\sim\lstar$ quasar hosts appears to robustly trace the characteristic small group scale of 
2007:     $\sim\lstar$ galaxies; i.e.\ the mass scale at which galaxy mergers are most efficient. 
2008:     \label{fig:quasar.small.groups}}
2009: \end{figure}
2010: We can invert this, and compare the empirically determined scales of quasar host 
2011: systems with the small group scale which should dominate gas-rich $\sim\lstar$ galaxy mergers. 
2012: Figure~\ref{fig:quasar.small.groups} shows the mean host mass $\mhalo$ which corresponds to 
2013: various quasar clustering measurements (i.e.\ range of $\mhalo$ for which the expected 
2014: quasar bias agrees with the observed $\pm1\,\sigma$ range). We compare this with direct measurements of the halo masses corresponding to small groups of $\sim\mstar$ galaxies, 
2015: determined from both clustering measurements and velocity dispersion measurements of 
2016: observationally identified groups with dispersions $\sigma\lesssim200\,{\rm km\,s^{-1}}$. 
2017: We can also estimate the appropriate small group scale from the halo occupation 
2018: formalism. 
2019: 
2020: Specifically, following the formalism of \citet{conroy:monotonic.hod}, if galaxy luminosity/mass 
2021: is monotonic with subhalo mass (at the time of subhalo accretion), then we can take any 
2022: galaxy mass function, monotonically rank it and match to our halo+subhalo mass functions, 
2023: and obtain a new halo occupation model which predicts a small group scale -- i.e.\ the range 
2024: of halo masses at which satellites of mass $\sim\lstar$ first appear. As discussed in 
2025: \S~\ref{sec:mergers:criteria}, 
2026: the choice of mass functions and how the HOD is constructed makes little difference 
2027: (factor $<2$) to our predictions, so (unsurprisingly) these all yield a similar estimate of the 
2028: small group scale to our default model predictions.
2029: 
2030: At all observed redshifts, 
2031: the scale of $\sim\lstar$ quasars appears to trace the small group scale -- i.e.\ whatever 
2032: mechanism triggers $\sim\lstar$ quasars operates preferentially at 
2033: the characteristic small group scale for $\sim\lstar$ galaxies, where mergers are expected 
2034: to be most efficient. 
2035: 
2036: %\subsection{Dependence on Environment}
2037: %\label{sec:quasars:smallscale.clustering}
2038: 
2039: \begin{figure*}
2040:     \centering
2041:     \figexpand
2042:     %\plotone{qso.small.scale.excess.ps}
2043:     \plotone{f17.ps}
2044:     \caption{Excess small-scale clustering of quasars expected if they are triggered in 
2045:     mergers, as in Figure~\ref{fig:excess.clustering.mergers}. {\em Left:} Observed 
2046:     correlation functions from \citet[][blue circles]{myers:clustering.smallscale} and 
2047:     \citet[][green diamonds]{hennawi:excess.clustering}, measured for $\sim\lstar$ quasars over the 
2048:     redshift ranges $z\sim1-2$. Dashed black line shows 
2049:     the expected correlation function (nonlinear dark matter clustering from 
2050:     \citet{smith:correlation.function}, multiplied by the appropriate constant large-scale bias factor) 
2051:     without a small-scale excess. 
2052:     Red lines multiply this by the predicted additional bias as a function of scale 
2053:     from \S~\ref{sec:mergers:env}, namely the fact that small-scale overdensities increase 
2054:     the probability of mergers. Solid line shows our mean prediction, dashed 
2055:     the approximate $\sim1\sigma$ range, as in Figure~\ref{fig:excess.clustering.mergers}. 
2056:     {\em Center:} Same, but dividing out the best-fit large-scale correlation 
2057:     function (i.e.\ bias as a function of scale). Black squares in upper panel show the 
2058:     measurement for true optical quasars ($-23.3 > M_{i} > -24.2$) from 
2059:     \citet{serber:qso.small.scale.env} at $z\sim0.1-0.5$. {\em Right:} Ratio of the mean 
2060:     bias at small radii ($r < 100\,h^{-1}\,{\rm kpc}$) to that at large radii (the asymptotic values in 
2061:     the center panel), at all redshifts where this has been observed. 
2062:     Lines show the predicted excess from 
2063:     the previous panels (lower line averages down to a minimum radius 
2064:     $r=50\,h^{-1}\,{\rm kpc}$, upper line to a -- potentially unphysical -- minimum 
2065:     $r=10\,h^{-1}\,{\rm kpc}$). 
2066:     \label{fig:excess.clustering.qso}}
2067: \end{figure*}
2068: In \S~\ref{sec:mergers:env}, we demonstrated that the increased probability of mergers 
2069: in regions with 
2070: excess {\em small scale} overdensities means that the typical merger is more likely to 
2071: exhibit an excess of clustering on small scales, relative to average systems of the same halo mass.  
2072: If quasars are triggered in mergers, this should be true as well. We therefore apply 
2073: the identical methodology from Figure~\ref{fig:excess.clustering.mergers} to calculate the 
2074: excess clustering signal expected in active quasars. Figure~\ref{fig:excess.clustering.qso} shows 
2075: the results of this exercise. We adopt the large-scale mean clustering expected from 
2076: \citet{myers:clustering.smallscale}, specifically using the formulae of \citet{smith:correlation.function} 
2077: to model the expected nonlinear correlation function in the absence of any bias, then apply the 
2078: formalism from Figure~\ref{fig:excess.clustering.mergers} to estimate the additional bias as a 
2079: function of scale. Comparing this to observations, the measurements clearly favor an excess 
2080: bias on small scales \citep[$r\lesssim100-200\,h^{-1}\,{\rm kpc}$;][]{hennawi:excess.clustering}, 
2081: similar to our prediction, 
2082: over a constant bias at all scales. This appears to be true at all observed redshifts; the excess 
2083: relative bias we predict at small scales is simply a consequence of how the probability of 
2084: a merger scales with local density, so it does not vary substantially as a function of redshift. 
2085: 
2086: It should be noted that the excess of quasar clustering on small scales might 
2087: also reflect an excess of merging binary quasars, i.e.\ merging systems in which 
2088: the interaction has triggered quasars in each merging counterpart. For the 
2089: reasons given in \S~\ref{sec:intro}, this 
2090: situation is expected to be relatively rare (even if all quasars 
2091: are initially triggered by galaxy mergers), but \citet{myers:clustering.smallscale} 
2092: note that only a small fraction of merging pairs need to excite quasar activity in 
2093: both members in order to explain the observed clustering excess. 
2094: Figure~\ref{fig:excess.clustering.qso} demonstrates that a similar excess 
2095: is observed in both the quasar-quasar autocorrelation function 
2096: \citep{hennawi:excess.clustering,myers:clustering.smallscale} 
2097: and the quasar-galaxy cross-correlation function 
2098: \citep{serber:qso.small.scale.env}, 
2099: arguing that it primarily reflects a genuine preference for 
2100: quasar activity in small-scale overdensities. In any case, 
2101: however, the excess on small scales is a general feature of a merger-driven 
2102: model for quasar activity. Indeed, the predicted excess is also seen in 
2103: high-resolution cosmological simulations \citep{thacker:qso.turnover.small.scale.excess}, 
2104: if quasars are specifically identified with (``attached to'') major mergers.
2105: Secular (bar or disk instability) fueling mechanisms, on 
2106: the other hand, should (by definition) show no clustering excess relative 
2107: to median disk galaxies of the same mass and properties, in contrast to what is 
2108: observed (although in agreement with what is seen for low-luminosity Seyfert galaxies, 
2109: see \S~\ref{sec:quasars:secular}).
2110: 
2111: \subsection{Model-Dependent Predictions: Additional Consequences of Quasar Light Curves}
2112: \label{sec:quasars:qlf}
2113: 
2114: To proceed further, we must 
2115: adopt some estimates for quasar lightcurves and/or lifetimes.  Following
2116: the methodology developed by \citet{springel:multiphase} and 
2117: \citet{springel:models},
2118: \citet{hopkins:qso.all,hopkins:faint.slope} use a large set of several hundred 
2119: hydrodynamical simulations \citep[see][]{robertson:fp} of galaxy mergers, 
2120: varying the relevant physics, galaxy properties, orbits, and system masses, 
2121: to quantify the quasar lifetime (and related statistics) as a
2122: function of the quasar luminosity. They define the quantity $t_{Q}(L\,|\,M_{\rm BH})$, 
2123: i.e.\ the time a quasar of a given BH mass $\mbh$ (equivalently, peak quasar 
2124: luminosity $L_{\rm peak}$) will be observed at a given luminosity $L$. They 
2125: further demonstrate that this quantity is robust across the wide range of varied 
2126: physics and merger properties; for example, to the extent that the final 
2127: BH mass is the same, any major 
2128: merger of sufficient mass ratio (less than $\sim3:1$) will produce an identical effect. 
2129: We adopt these estimates in what follows, and note that while there is still 
2130: considerable uncertainty in a purely empirical determination of the quasar lifetime, 
2131: the model lightcurves are consistent with the present observational constraints 
2132: from variability studies \citep[][and references therein]{martini04}, clustering 
2133: \citep{croom:clustering,adelbergersteidel:lifetimes,
2134: porciani:clustering,myers:clustering,daangela:clustering,shen:clustering}, 
2135: Eddington ratio measurements \citep{mclure.dunlop:mbh,kollmeier:eddington.ratios}, 
2136: active BH mass functions \citep{vestergaard:mbh,fine:mbh-mhalo.clustering,greene:active.mf}, 
2137: and cosmic background measurements \citep{volonteri:xray.counts,hopkins:bol.qlf}. 
2138: 
2139: \begin{figure*}
2140:     \centering
2141:     \figexpand
2142:     %\plotone{qlfs.model.ps}
2143:     \plotone{f18.ps}
2144:     \caption{Predicted quasar luminosity functions, convolving 
2145:     our predicted merger rate functions (Figure~\ref{fig:merger.mfs}; same line styles) with 
2146:     quasar lightcurves from simulations \citep{hopkins:qso.all}. Red lines allow 
2147:     dry mergers to trigger quasar activity as well (leading to an overestimate 
2148:     at low redshifts, as in Figure~\ref{fig:lum.density}). Points show observed 
2149:     bolometric luminosity functions at each redshift, from the compilation of 
2150:     observations in \citet{hopkins:bol.qlf}. QLF measurements derived from 
2151:     observations in the optical, soft X-ray, hard X-ray, mid-IR, and 
2152:     narrow emission lines are shown as green, blue, red, cyan, and orange points, 
2153:     respectively. The merger-driven model naturally 
2154:     predicts the observed shape and evolution of the QLF at all redshifts. 
2155:     \label{fig:qlf}}
2156: \end{figure*}
2157: The quasar luminosity function $\phi(L)$ is given by the convolution over the merger rate (rate of 
2158: formation of BHs of final mass $\mbh$ in mergers) and quasar lifetime (differential 
2159: time spent at luminosity $L$ by a BH of final mass $\mbh$):
2160: \begin{equation}
2161: \label{eqn:qlf.convolution}
2162: \phi(L) = \int t_{Q}(L\,|\,\mbh)\,\dot{n}(\mbh\,|\,z) \,{\rm d}\log{\mbh}. 
2163: \label{eqn:qlf}
2164: \end{equation}
2165: Note this technically assumes $t_{Q}\ll \tH$, but this is true for all 
2166: luminosities and redshifts of interest here. Figure~\ref{fig:qlf} shows this 
2167: prediction at a number of redshifts, compared to the large compilation of 
2168: QLF measurements from \citet{hopkins:bol.qlf}. The agreement 
2169: is surprisingly good at all redshifts. At the most extreme luminosities 
2170: $L_{\rm bol} > 3\times10^{14}\,L_{\sun}$ at each redshift, our predictions 
2171: may begin to fall short of the observed QLF, but this somewhat expected, as these 
2172: luminosities naively imply $>10^{10}\,\msun$ BHs accreting at the 
2173: Eddington limit. It is therefore likely that a full resolution at the most extreme 
2174: luminosities involves either revising the estimate of these bolometric 
2175: luminosities (i.e.\ the bolometric corrections adopted may not be appropriate 
2176: for the most extreme objects, or there may be beaming effects) or 
2177: including processes beyond the scope of our current investigation (e.g.\ 
2178: super-Eddington accretion or multiple mergers in massive BCGs). Nevertheless, 
2179: our simple merger-driven scenario appears to accurately predict the distribution 
2180: and evolution of most quasar activity. 
2181: 
2182: \begin{figure}
2183:     \centering
2184:     \figexpand
2185:     %\plotone{qso.frac.vs.mhalo.ps}
2186:     \plotter{f19.ps}
2187:     \caption{Predicted AGN fraction as a function of host properties. 
2188:     {\em Top:} Low-redshift 
2189:     quasar fraction (defined here by Eddington ratios $\dot{m}>0.1$) 
2190:     as a function of galaxy mass. Black lines show 
2191:     the prediction of our merger-driven model, in the style of 
2192:     Figure~\ref{fig:merger.mfs}. Observed fractions are shown down to 
2193:     (roughly) their completeness limit, from \citet{kauffmann:qso.hosts}.
2194:     {\em Bottom:} Same, but at $z\approx2$, with the AGN fraction determined observationally in 
2195:     LBG \citep{erb:lbg.masses} and $K$-selected \citep{kriek:qso.frac} samples. 
2196:     Some caution should be applied at $\mgal\lesssim10^{10}\,\msun$, as 
2197:     the AGN luminosities become sufficiently low that even moderate star formation 
2198:     will dominate the observed luminosity and systems may not be classified as AGN. 
2199:     \label{fig:active.fraction}}
2200: \end{figure}
2201: Integrating the QLF over the appropriate range, we trivially obtain the active fraction, and 
2202: can calculate this separately for each host mass $\mgal$ or BH mass $\mbh$ in 
2203: Equation~(\ref{eqn:qlf}).
2204: Figure~\ref{fig:active.fraction} compares this to 
2205: observations at both low and high redshift, for 
2206: systems with $\dot{m}\equiv L/L_{\rm Edd} > 0.1$, 
2207: representative of typical Seyfert and quasar populations 
2208: \citep[e.g.][]{mclure.dunlop:mbh}. Note that the quasar lifetime 
2209: integrated above this threshold is close to a constant value
2210: $\lesssim10^{8}$\,yr, similar to observational estimates \citep{martini04}.
2211: At very low masses/levels of activity, other fueling 
2212: mechanisms may be dominant -- for comparison
2213: with e.g.\ the active fractions in \citet{hao:local.lf} of typical $\lesssim10^{7}\,\msun$ BHs 
2214: ($\lesssim10^{10}\,\msun$ hosts), we refer to secular and/or ``stochastic'' accretion 
2215: models in disks \citep[e.g.][]{hopkins:seyferts} and old ellipticals \citep{martini:ell.center.dust}.
2216: Furthermore, at the lowest masses plotted, the typical AGN luminosities 
2217: become extremely faint (typical $M_{B}\gtrsim-18$ in $\mgal\lesssim10^{10}\,\msun$ hosts), 
2218: and so such systems may be more often classified as non-AGN or typical star-forming systems
2219: \citep[e.g.][]{rodighiero:obscured.agn}. 
2220: At high levels of accretion, however, the merger-driven prediction agrees well with 
2221: observations at low and high redshift, and predicts a downsizing trend similar to that 
2222: seen -- namely that from $z=2$ to $z=0$, quasar activity has been particularly suppressed 
2223: in the most massive systems (although it has been suppressed to some extent at all host 
2224: masses), presumably owing to the conversion of these systems to ``red and dead'' spheroids 
2225: without cold gas supplies (see \papertwo).
2226: 
2227: 
2228: \begin{figure*}
2229:     \centering
2230:     \figexpand
2231:     %\plotone{bias.vs.l.ps}
2232:     \plotone{f20.ps}
2233:     \caption{{\em Left:} Predicted bias as a function of quasar luminosity 
2234:     from our merger-driven model (black lines, style as in Figure~\ref{fig:merger.mfs}). 
2235:     To contrast, the expected bias $b(L)$ from the semi-analytic models 
2236:     of \citet[][cyan]{wyitheloeb:sam} and \citet[][orange with diamonds]{kh00} are 
2237:     plotted (dot-dashed lines); these adopt simplified (constant or exponential 
2238:     ``on/off'') quasar lightcurves. Points are measurements from 
2239:     \citet[][red squares]{croom:clustering}, \citet[][orange crosses]{adelbergersteidel:lifetimes}, 
2240:     \citet[][purple diamonds]{porciani:clustering},  
2241:     \citet[][blue circles]{myers:clustering},  
2242:     \citet[][magenta stars]{daangela:clustering}, and 
2243:     \citet[][black open circles]{coil:agn.clustering}. For ease of comparison, all luminosities are 
2244:     converted to bolometric luminosities using the corrections from \citet{hopkins:bol.qlf}. 
2245:     Vertical blue dotted lines show $\lstar$ in 
2246:     the QLF at each redshift, from \citet{hopkins:bol.qlf}.
2247:     {\em Right:} 
2248:     The best-fit slope of the dependence of bias on luminosity at the QLF $\lstar$, i.e.\ 
2249:     ${\rm d}(b/b_{\ast}) / {\rm d}\log{(L/L_{\ast})}$, where $b_{\ast}\equiv b(L_{\ast})$. 
2250:     Points are determined from the observations at left, with the observations 
2251:     from \citet[][cyan circles]{myers:clustering} and 
2252:     \citet[][black open diamond]{grazian:local.qso.clustering,wake:local.qso.clustering}
2253:     added. Lines are in the style of the left panel, with the red dashed line showing 
2254:     no dependence of bias on luminosity. 
2255:     Adopting an a priori model for merger-triggered quasar activity reproduces the 
2256:     empirical prediction from \citet{lidz:clustering}, that quasar bias should depend 
2257:     weakly on quasar luminosity. 
2258:     \label{fig:bias.vs.l}}
2259: \end{figure*}
2260: We next follow \citet{lidz:clustering}, and extend 
2261: Equation~(\ref{eqn:qlf.convolution}) to convolve over the expected 
2262: bias of the active systems at each quasar luminosity $L$, 
2263: \begin{equation}
2264: b(L) = \frac{1}{\phi(L)}\,\int b(\mbh)\,t_{Q}(L\,|\,\mbh)\,\dot{n}(\mbh\,|\,z) \,{\rm d}\log{\mbh},
2265: \end{equation}
2266: where $b(\mbh)$ is determined just as $b(\mgal)$ in 
2267: \S~\ref{sec:mergers:populations}, by convolving 
2268: over the contributions to each merging range in $\mbh$ from all $\mhalo$. 
2269: Figure~\ref{fig:bias.vs.l} plots the expected bias as a function of luminosity 
2270: at each of several redshifts. As originally demonstrated in \citet{lidz:clustering}, 
2271: our model for quasar lightcurves and the underlying triggering rate of quasars 
2272: predicts a relatively weak dependence of clustering on quasar luminosity. 
2273: Here, we essentially re-derive this result with an {\em a priori} prediction of 
2274: these triggering rates, as opposed to the purely empirical (fitted to the QLF) rates 
2275: from \citet{lidz:clustering}, and find that the conclusion is robust. However, this 
2276: prediction is not necessarily a consequence of merger-driven models 
2277: (nor is it unique to them) -- 
2278: we show the predictions from the semi-analytic models of \citet{wyitheloeb:sam} and 
2279: \citet{kh00}, who adopt simplified ``lightbulb''-like quasar lightcurves 
2280: \citep[for a detailed discussion of these differences, see][]{hopkins:clustering}. 
2281: 
2282: The reason for the weak dependence of quasar clustering on luminosity in 
2283: Figure~\ref{fig:bias.vs.l} is, in fact, the nature of the quasar lightcurve. Quasars grow 
2284: rapidly in mergers to a peak quasar phase at the final stages of the merger, 
2285: which exhausts and expels the remaining gas, after which the quasar 
2286: decays to lower luminosities. This decay moves objects of the same host 
2287: properties to fainter luminosities in the QLF, making the clustering properties 
2288: flat as a function of luminosity. Thus, while an important test of our modeling 
2289: (that the correct halos and galaxies host quasars of the appropriate luminosities), 
2290: this is not a unique prediction of merger-driven models. 
2291: 
2292: 
2293: We can also use our model to estimate the infrared luminosity 
2294: functions of various populations versus redshift. By construction, our 
2295: assumed halo occupation model reproduces the observed star-forming (blue) 
2296: galaxy mass function at each redshift. Using the corresponding fitted star-formation 
2297: histories as a function of baryonic mass from \citet{noeske:sfh} (which fit the observations locally 
2298: and their evolution at least to $z\sim1.5$), we immediately obtain an 
2299: estimate of the star formation rate function in ``quiescent'' (non-merging) galaxies 
2300: at each redshift. We include a scatter of $\sim0.25\,$dex in SFR at fixed 
2301: stellar mass, comparable to that observed (in blue galaxies), but this makes relatively 
2302: little difference, as the most extreme SFR populations are dominated by mergers. 
2303: We then adopt the standard conversion from \citet{kennicutt98} to 
2304: transform this to an infrared luminosity function (where we refer to the total IR 
2305: $8-1000\,\mu{\rm m}$ luminosity). 
2306: 
2307: Our model also yields the mass function of 
2308: gas-rich mergers, for which we can estimate their distribution of star formation rates. 
2309: In \citet{hopkins:merger.lfs}, we quantify the distribution of star formation rates as a 
2310: function of galaxy properties from the same large suite of simulations 
2311: used to estimate the quasar lifetime. Essentially, this quantifies the ``lifetime'' above a 
2312: given SFR in a merger, which can be reasonably approximated as a simple function of 
2313: galaxy mass and (pre-merger) gas fraction, 
2314: \begin{equation}
2315: t(>\dot{M}_{\ast}) = t_{\ast}\,\Gamma{\Bigl(}0,\,\frac{\dot{M}_{\ast}}{M_{f}\,\fgas\,/t_{\ast}}{\Bigr)}, 
2316: \label{eqn:t.sfr}
2317: \end{equation}
2318: where $M_{f}$ is the post-merger galaxy mass (i.e.\ our $\mgal$) and $t_{\ast}\approx0.3\,$Gyr 
2319: is a fitted characteristic time. This functional form simply amounts to the statement that there is a 
2320: mean characteristic timescale $t_{\ast}$ in which most of the gas mass of the merger 
2321: ($M_{f}\,\fgas$) is converted into stars, which we find is (unsurprisingly) similar to the 
2322: dynamical time of the merger and to observational estimates of the 
2323: characteristic star formation timescale in starbursts and ULIRGs \citep{kennicutt98}. 
2324: Since the fitted star-formation histories of \citet{noeske:sfh} implicitly define a 
2325: gas fraction as a function of time (or can be used in combination with the 
2326: Schmidt-Kennicutt star formation law to infer the gas fraction), we simply adopt these 
2327: for the pre-merger galaxies \citep[but we have checked 
2328: that they correctly reproduce observed gas fractions as a function of mass 
2329: at $z=0,\,1,\,2$; see][]{hopkins:bhfp}. It is worth noting that, with this estimate, 
2330: the explicit dependence on $\fgas$ 
2331: can be completely factored out in Equation~(\ref{eqn:t.sfr}), and we can write it as 
2332: an estimate of the amount of time a system spends above a given enhancement in 
2333: SFR (basically a merger enhances the $\tau$-model SFR by $\sim \tau/t_{\ast}$), 
2334: relative to the pre-merger SFR.
2335: %For a typical $\mgal\sim10^{11}\,\msun$ galaxy, for example, this implies the system 
2336: %spends $\sim170\,$Myr at SFRs $>10$ times the quiescent rate, and just a few 
2337: %Myr at rates $>50$ times the quiescent rate (basically enhancing a $\tau$-model 
2338: %SFR by $\sim\tau/t_{\ast}$). 
2339: Using the same SFR to $L_{\rm IR}$ conversion, we obtain a rough estimate of the 
2340: IR luminosity function of mergers. 
2341: 
2342: Finally, adopting the empirically calculated obscured fraction as a function of 
2343: quasar luminosity from \citet{gilli:obscured.fractions}, and assuming that 
2344: the obscured bolometric luminosity is re-radiated in the IR, we convert 
2345: our predicted bolometric QLF to an IR QLF of obscured quasars. 
2346: Technically, not all of the luminosity will be obscured, of course, 
2347: but we find that e.g.\ using the full distribution of column densities as a function 
2348: of quasar luminosity from \citet{ueda03:qlf} to attenuate a template AGN SED yields 
2349: a very similar answer \citep[see also][]{franceschini:faint.xr.qsos}, as does using a mean 
2350: X-ray to IR bolometric correction of obscured AGN \citep{elvis:atlas,
2351: zakamska:multiwavelength.type.2.quasars,polletta:obscured.qsos}. 
2352: Including the IR contribution from un-obscured quasars 
2353: is a negligible correction.   
2354: 
2355: \begin{figure*}
2356:     \centering
2357:     \figexpand
2358:     %\plotone{ir.lfs.pred.ps}
2359:     \plotone{f21.ps}
2360:     \caption{{\em Left:} Predicted total IR ($8-1000\,\mu{\rm m}$) luminosity functions at different 
2361:     redshifts (as labeled). Green, blue, and red lines shows the estimated contribution from 
2362:     non-merging systems, star formation in mergers, and obscured AGN in mergers, respectively. 
2363:     Linestyles are as in Figure~\ref{fig:merger.mfs}, for the variants of the merger calculations.    
2364:     %given our halo occupation model and the 
2365:     %star-formation histories as a function of mass fitted in \citet{noeske:sfh}. 
2366:     %Blue lines (style as in Figure~\ref{fig:merger.mfs}) show the 
2367:     %contribution from star formation in mergers, using the distribution of 
2368:     %star formation rates as a function of merger properties from 
2369:     %\citet{hopkins:merger.lfs}. Red lines show the contribution from 
2370:     %obscured AGN, adopting the obscured fraction as a function of luminosity 
2371:     %(and bolometric correction for obscured systems) from \citet{gilli:obscured.fractions}. 
2372:     %Black dashed line is the combined luminosity function. 
2373:     Points show observational estimates from 
2374:     \citet[][magenta stars]{saunders:ir.lfs}, 
2375:     \citet[][blue triangles]{soifer:60m.lfs}, \citet[][black circles]{yun:60m.lfs}, 
2376:     \citet[][black diamonds]{lefloch:ir.lfs}, \citet[][black inverted triangles]{chapman:submm.lfs}, 
2377:     \citet[][black squares]{babbedge:swire.lfs}, 
2378:     and \citet[][black $\times$'s]{caputi:ir.lfs}. 
2379:     {\em Right:} Integrated IR luminosity density. Solid lines show the contributions from 
2380:     non-merging systems (green), star formation in mergers (blue), and obscured 
2381:     quasars in mergers (red). Blue dotted shows the total (star formation+AGN) 
2382:     merger contribution, black dashed shows the total from all sources. Orange points 
2383:     show observational estimates of $\rho_{\rm IR}$ from the compilation of 
2384:     \citet[][circles; only the direct IR observations therein are plotted here]{hopkins:sfh}, 
2385:     as well as \citet[][diamonds]{lefloch:ir.lfs}, \citet[][]{perezgonzalez:ir.lfs}, 
2386:     and \citet[][$\times$'s]{caputi:ir.lfs}.
2387:     Red stars show the bolometric quasar luminosity density from Figure~\ref{fig:lum.density}, 
2388:     rescaled by a constant (mean) obscured-to-unobscured ratio of $\sim2:1$. 
2389:     The agreement in all cases is good -- our model reproduces the star formation 
2390:     history of the Universe and distribution of star formation rates and bolometric luminosities. 
2391:     \label{fig:ir.lfs}}
2392: \end{figure*}
2393: 
2394: Figure~\ref{fig:ir.lfs} compares the resulting predicted IR luminosity functions to 
2395: observations at $z=0-2$, and to the observed IR luminosity density from $z\sim0-5$. 
2396: At all redshifts, the agreement is good, which suggests that our model accurately 
2397: describes the star-formation history of the Universe. This should be guaranteed, since 
2398: at all redshifts the quiescent population dominates the $\sim L_{\ast}$ optical 
2399: and IR luminosity functions (hence also the star formation rate and IR luminosity 
2400: densities) -- at this level, we simply confirm that our halo occupation model is a good 
2401: approximation. However, at high luminosities, typical of ULIRGs, the populations 
2402: are generally dominated by mergers and (at the highest luminosities) obscured 
2403: AGN. 
2404: 
2405: We explicitly quantify the transition point as a function of redshift in Figure~\ref{fig:ir.dom} 
2406: (we show the comparison there just for our ``default'' model, but as is clear in Figure~\ref{fig:ir.lfs}, 
2407: the transition between different populations dominating the LF is similar regardless of the 
2408: exact version of our model adopted). Our comparisons generally affirm 
2409: the conventional wisdom: at low redshift, mergers dominate the ULIRG and 
2410: much of the LIRG populations, above a luminosity $\sim10^{11.4}\,L_{\sun}$, 
2411: with heavily obscured (potentially Compton-thick) 
2412: AGN (in starburst nuclei) becoming a substantial contributor to IR luminous populations 
2413: in the most extreme $\gtrsim{\rm a\ few\ }\times10^{12}\,L_{\sun}$ systems 
2414: (nearing hyper-LIRG $>10^{13}\,L_{\sun}$ luminosities which are common bolometric 
2415: luminosities for $>10^{8}\,\msun$ BHs near Eddington, but would imply 
2416: potentially unphysical $\gtrsim1000\,\msun\,{\rm yr^{-1}}$ SFRs). 
2417: At higher redshifts, disks are more gas-rich, and thus have characteristically 
2418: larger star formation rates, dominating the IR LFs at higher luminosities. By 
2419: $z\sim1$, most LIRGs are quiescent systems, and by $z\sim2$, only extreme 
2420: systems $\gtrsim{\rm a\ few\ }\times10^{12}\,L_{\sun}$ are predominantly 
2421: mergers/AGN.
2422: 
2423: This appears to agree well with recent 
2424: estimates of the transition between AGN and passive star formation 
2425: dominating the bolometric luminosities of high-redshift systems. 
2426: Interestingly, 
2427: this shift occurs even while increasing merger rates (and higher 
2428: gas fractions in typical mergers) lead to a larger overall contribution of 
2429: mergers to the star formation rate and IR luminosity densities. At $z\sim0$, 
2430: mergers contribute negligibly to the total IR luminosity density, but 
2431: by $z\sim2$, they may contribute $\sim20-50\%$ of the IR output of the Universe, 
2432: with that contribution owing comparably to both star formation in mergers and 
2433: obscured BH growth \citep[which should be true, given the $\mbh-M_{\rm host}$ 
2434: correlations and typical $\epsilon_{r}\sim0.1$ radiative efficiencies;
2435: see, e.g.][]{lidz:proximity}. 
2436: 
2437: The integrated contribution of mergers to the star formation rate and IR luminosity 
2438: densities agrees well with observational estimates 
2439: \citep[available at $z\lesssim2$; see][]{bell:morphology.vs.sfr,menantau:morphology.vs.sfr}, 
2440: and the constraint from stellar population models that only a small fraction of the 
2441: $z=0$ stellar mass in typical early-type galaxies was formed in the 
2442: spheroid-forming merger itself \citep[as opposed to more extended star formation in 
2443: the pre-merger disks; e.g.][]{noeske:sfh}. For a more detailed comparison and analysis of the 
2444: merger-induced contribution to the star formation rate density of the Universe, 
2445: we refer to \citet{hopkins:merger.lfs}. 
2446: 
2447: \begin{figure}
2448:     \centering
2449:     \figexpand
2450:     %\plotone{ir.dom.pred.ps}
2451:     \plotone{f22.ps}
2452:     \caption{{\em Left:} Total IR luminosity, as a function of redshift, above which 
2453:     mergers (star formation+AGN) dominate the total IR luminosity functions 
2454:     (solid line, from Figure~\ref{fig:ir.lfs}; dashed lines show the range above which 
2455:     $25/75\%$ of systems on the luminosity function are mergers). Point shows 
2456:     the corresponding transition point (and range) observed in low-redshift systems 
2457:     \citep{sanders:review}. {\em Right:} Same, but for the transition between star formation 
2458:     (in non-merging+merging systems) and (obscured) AGN dominating the IR luminosity 
2459:     functions (generally a factor $\sim{\rm a\ few}$ larger luminosity than the 
2460:     quiescent system-merger transition). 
2461:     Points show the observed estimates from comparison of PAH feature strengths in 
2462:     \citet[][low redshift]{lutz:pah.qso.vs.sf.local} and \citet[][high redshift]{sajina:pah.qso.vs.sf}. 
2463:     A similar estimate is obtained (at low redshift)
2464:     from comparison of emission line strengths 
2465:     \citep{sanders96:ulirgs.mergers,kewley:in.prep}, full SED template fitting 
2466:     \citep{farrah:qso.vs.sf.sed.fitting}, or indirect comparison with Type 2 AGN luminosity 
2467:     functions \citep{chary.elbaz:ir.lfs}.
2468:     The model predicts the local transitions, and that by $z\gtrsim1$, the LIRG population 
2469:     is dominated by quiescent star formation in gas-rich systems (even as the 
2470:     total and fractional luminosity density in mergers increases rapidly). 
2471:     \label{fig:ir.dom}}
2472: \end{figure}
2473: 
2474: We caution that the above comparisons are approximate, and intended as a broad 
2475: check that our models are consistent with the observed abundance of 
2476: IR luminous galaxies as a function of redshift. We have ignored a number of 
2477: potentially important effects: for example, obscuration is a strong function of time 
2478: in a merger, and may affect various luminosities and morphological stages 
2479: differently. Moreover, our simple linear addition of the star formation contribution 
2480: of mergers to the IR LF and the AGN contribution is only technically correct 
2481: if one or the other dominates the IR luminosity at a given time in the merger; however, 
2482: there are clearly times during the final merger stages when the contributions 
2483: are comparable. Resolving these issues requires detailed, time-dependent 
2484: radiative transfer solutions through high-resolution simulations that properly 
2485: sample the merger and quiescent galaxy parameter space at each redshift, 
2486: and is outside the scope of this work \citep[although an important subject for future, 
2487: more detailed study; see, e.g.][]{li:radiative.transfer}.
2488: It would be a mistake, therefore, to read too much into 
2489: e.g.\ the detailed predictions for sub-millimeter galaxies or other extreme 
2490: populations based on Figures~\ref{fig:ir.lfs} \&\ \ref{fig:ir.dom}. However, most of our 
2491: predicted qualitative trends, including the evolution of the luminosity density 
2492: (and approximate relative contribution of mergers) and the shift in where 
2493: quiescent or merger-driven populations dominate the bright IR LF, should 
2494: be robust. Critically, a model in which merger-driven quasar activity dominates 
2495: the QLF predicts an abundance of IR-luminous galaxies consistent with 
2496: the observations as a function of both luminosity and redshift. 
2497: 
2498: 
2499: 
2500: 
2501: \subsection{When Merger-Triggering Loses to Secular Processes}
2502: \label{sec:quasars:secular}
2503: 
2504: Despite these arguments for a merger-driven origin for bright, high-redshift 
2505: quasars, there are good reasons to believe that most local, high-Eddington ratio 
2506: objects are {\em not} related to mergers. Most active local systems 
2507: typically involve relatively low-mass 
2508: BHs \citep[$\mbh\sim10^{7}\,\msun$;][]{heckman:local.mbh}, 
2509: in Sa/b-type host galaxies, 
2510: without significant evidence for recent major interactions 
2511: \citep{kauffmann:qso.hosts,pierce:morphologies}, and 
2512: have relatively low Seyfert-level luminosities 
2513: \citep[$-21\gtrsim M_{B} \gtrsim -23$;][]{hao:local.lf}, below 
2514: the traditional $M_{B}=-23$ Seyfert-quasar divide. Given this, it is natural to ask 
2515: whether there are additional reasons to believe that bright quasars have 
2516: distinct origins, and if so, when (or at what luminosities) these non-merger 
2517: driven fueling mechanisms begin to dominate AGN populations. 
2518: 
2519: \begin{figure}
2520:     \centering
2521:     \figexpand
2522:     %\plotone{qso.seyfert.smallscale.ps}
2523:     \plotone{f23.ps}
2524:     \caption{As Figure~\ref{fig:excess.clustering.qso} (upper center panel), but comparing 
2525:     the clustering (quasar-galaxy cross-correlation) 
2526:     as a function of scale measured by \citet{serber:qso.small.scale.env} 
2527:     for bright optical quasars and dimmer Seyfert galaxies. Quasar clustering is consistent with our 
2528:     predicted excess on small scales, indicating a merger-driven origin, but low-luminosity 
2529:     systems show no such dependence, suggesting that processes independent of the 
2530:     local, small-scale density (e.g.\ secular processes) may dominate at these luminosities. 
2531:     \label{fig:excess.clustering.seyferts}}
2532: \end{figure}
2533: In addition to the arguments in \S~\ref{sec:quasars:mergers} \& \ref{sec:quasars:qlf}, 
2534: there are a number of qualitative differences between bright, high-redshift quasars 
2535: and local Seyferts. Quasars have significantly different clustering amplitudes 
2536: \citep{hopkins:clustering} and host stellar mass distributions \citep{hopkins:transition.mass} 
2537: from star-forming galaxies at $z\gtrsim1$, and typically have hosts 
2538: with elliptical or merger remnant morphologies \citep{floyd:qso.hosts,falomo:qso.hosts,
2539: zakamska:qso.hosts,letawe:qso.merger.ionization}, frequently exhibiting 
2540: evidence of tidal disturbances \citep{bahcall:qso.hosts,
2541: canalizostockton01:postsb.qso.mergers,
2542: hutchings:redqso.lowz,hutchings:redqso.midz,
2543: urrutia:qso.hosts,bennert:qso.hosts}. Figure~\ref{fig:excess.clustering.seyferts} 
2544: compares the clustering as a function of scale measured in 
2545: \citet{serber:qso.small.scale.env} for both bright quasars and Seyfert galaxies -- 
2546: quasars exhibit the strong trend of excess clustering on small scales indicative of 
2547: a triggering process which prefers small-scale overdensities, but Seyferts 
2548: show no significant preference for local overdensities. 
2549: 
2550: \begin{figure}
2551:     \centering
2552:     \figexpand
2553:     %\plotone{qso.cmr.ps}
2554:     \plotter{f24.ps}
2555:     \caption{Location of quasars in the color-magnitude diagram, expected 
2556:     from different models. {\em Top:} Red and blue dotted regions roughly outline the 
2557:     red sequence and blue cloud, respectively, with the dashed line dividing the 
2558:     bimodality \citep[from][]{bell:combo17.lfs}. Arrows show the preferred location of 
2559:     quasar hosts in a merger driven model. At the end of a merger, a bright 
2560:     quasar is triggered in a spheroid/merger remnant at the top of the blue cloud 
2561:     (owing to the young stellar populations from pre-merger and merger-induced 
2562:     star formation), and subsequently the quasar luminosity decays while the remnant 
2563:     rapidly reddens, leaving a relatively low accretion rate remnant on the red sequence. 
2564:     {\em Middle:} Same, but for a secular triggering scenario in which quasar 
2565:     activity (which must still require cold gas) is uncorrelated with quenching or itself 
2566:     exhausts the gas supply. In this case, quasars should live in the blue cloud, with 
2567:     gas-rich systems, and their abundance rapidly drops approaching the ``green valley'' 
2568:     as gas supplies are exhausted. {\em Bottom:} We compare to 
2569:     observations of quasar host galaxy 
2570:     colors at $z\sim0.7-1.1$ from \citet[][blue circles]{sanchez:qso.host.colors}. X-ray identified 
2571:     AGN and quasar 
2572:     hosts from \citet[][orange diamonds]{nandra:qso.host.colors} are also shown 
2573:     (the numbers plotted should not be taken literally, as we have rescaled the authors
2574:     $U-B$ vs.\ $M_{B}$ color-magnitude relation to that shown here for the 
2575:     sake of direct comparison, but the result is qualitatively identical to that shown). 
2576:     Arrows reproduce the merger expectation from the top panel. Quasars appear to 
2577:     live in the region of color-magnitude space expected if they are triggered at the 
2578:     {\em termination} of star formation, and subsequently decay in luminosity, as 
2579:     expected in merger-driven scenarios. 
2580:     \label{fig:qso.cmd}}
2581: \end{figure}
2582: 
2583: Because galaxy mergers are also associated 
2584: with the termination of star formation in the remnant (even if only 
2585: temporarily), i.e.\ a rapid post-starburst phase and transition to the 
2586: red sequence (discussed in detail in \papertwo), 
2587: the decay of the quasar lightcurve should be associated with the 
2588: reddening of the remnant, in a merger-driven model. 
2589: This implies a particular preferred track for 
2590: quasar hosts in the color-magnitude diagram, illustrated in Figure~\ref{fig:qso.cmd}. 
2591: In this scenario, quasars should be associated with the crossing of the 
2592: ``green valley'' -- i.e.\ the triggering of a quasar occurs at the end of the merger, 
2593: when young stellar populations imply a bluer-than-average host spheroid, and 
2594: the quasar decays to lower luminosities as the remnant reddens onto the red sequence. 
2595: 
2596: Alternatively, if quasars were triggered in a purely secular manner, or otherwise independent 
2597: of whatever quenching mechanism terminates the galactic supply of cold gas, 
2598: then their natural preferred location is in the blue cloud -- i.e.\ blueward of the 
2599: ``green valley.'' Systems in this regime still have cold gas supplies and have not yet 
2600: quenched. Because the quenching is uncorrelated with quasar triggering 
2601: in such a model, and 
2602: the lack of galaxies in the ``green valley'' implies that
2603: this transition is rapid, very few quasars 
2604: would be expected to be triggered just as the quenching occurs, and therefore 
2605: few quasars should be present in the ``green valley.'' 
2606: 
2607: \begin{figure*}
2608:     \centering
2609:     %\plotone{qso.cmd.distrib.ps}
2610:     \plotone{f25.ps}
2611:     \caption{Distribution of quasar host galaxy colors from Figure~\ref{fig:qso.cmd}
2612:     (histograms; from \citet{sanchez:qso.host.colors} and \citet{nandra:qso.host.colors} 
2613:     in dark blue and orange, respectively). We compare with fitted (Gaussian) color 
2614:     distributions of blue cloud and red sequence galaxies from \citet{strateva:color.bimodality}, 
2615:     with the distribution of colors of barred galaxies in the SDSS from
2616:     \citet{barazza:bar.colors} (the expected quasar hosts in a secular or instability-driven 
2617:     quasar fueling model), and with the fitted (Gaussian) distribution of post-starburst 
2618:     (generally merger remnant) E+A/K+A galaxies in \citet{goto:e+a.merger.connection}. 
2619:     Quasar host colors follow the ``transition'' between blue cloud and red sequence 
2620:     observed and expected in merger remnants, in contrast to the preferentially most 
2621:     gas-rich, blue hosts of observed strong bars. 
2622:     \label{fig:qso.cmd.distrib}}
2623: \end{figure*}
2624: 
2625: Comparing these qualitative scenarios with observations appears to favor the 
2626: former, merger-driven case. Quasars tend to live redwards of the ``top'' of the 
2627: blue cloud, with the brightest/highest accretion rate 
2628: quasars preferentially in bluer-than-average spheroids in 
2629: the ``green valley'' \citep{kauffmann:qso.hosts,sanchez:qso.host.colors,nandra:qso.host.colors}. 
2630: 
2631: Figure~\ref{fig:qso.cmd.distrib} shows this quantitatively -- we plot the 
2632: distribution of colors of quasar hosts, compared with that fitted to 
2633: the blue cloud and red sequence, or systems with observed bars and/or 
2634: disk instabilities (the expected quasar hosts in a secular model, regardless of 
2635: quasar duty cycles during a bar phase), and post-starburst (E+A/K+A) 
2636: systems, largely identified as merger remnants and ``blue spheroids'' (see 
2637: the discussion in \S~\ref{sec:mergers:env}). The quasar hosts clearly 
2638: lie preferentially between the blue cloud and red sequence, 
2639: with a color distribution very similar to observed post-starburst galaxies. 
2640: 
2641: The distribution is quite distinct, however, from observed barred systems, 
2642: which lie overwhelmingly on the blue sequence with, if anything, a bias 
2643: towards the bluest systems (which is expected, as these are the most gas-rich 
2644: and therefore most unstable systems). Even if one assumes that, in the most 
2645: extreme bar instabilities, dust reddening might move the system into the 
2646: ``green valley'' as a reddened disk, this appears to contradict the observations 
2647: above which 
2648: find quasars to be in preferentially blue spheroids (even X-ray observations, 
2649: which suffer less severe bias against dust-reddened systems). 
2650: A more rigorous quantitative comparison of the tracks through 
2651: color-magnitude space and the relative abundances in this transition region 
2652: will be the topic of future work \citep[][in preparation]{wuyts:prep}, 
2653: and we stress that these are all relatively low-redshift samples, but studying 
2654: how the mean quasar luminosity and accretion rates scale/decay with the degree of 
2655: reddening or aging of their host stellar populations can provide a powerful 
2656: discriminant between these models. 
2657: 
2658: 
2659: \begin{figure}
2660:     \centering
2661:     %\figexpand
2662:     \epsscale{1.07}
2663:     %\plotone{seyferts.win.ps}
2664:     \plotter{f26.ps}
2665:     \caption{Fraction of the 
2666:     integrated quasar luminosity density owing to 
2667:     non-merger driven secular mechanisms. 
2668:     {\em Top:} Upper limit to the contribution from 
2669:     BHs in disk galaxy hosts at each $z$ (see text). Limits 
2670:     are derived from the observed type-separated mass functions 
2671:     in Figure~\ref{fig:quasar.small.groups} (same style) and \citet[][cyan stars]{franceschini:mfs}.
2672:     Solid line assumes the disk mass function does not evolve with $z$.   
2673:     {\em Second from Top:} Fractional 
2674:     contribution from systems in pseudobulges at $z=0$. 
2675:     Local distribution of pseudobulge masses is 
2676:     estimated from the observed pseudobulge fraction versus 
2677:     galaxy type \citep[][red dashed line, with $\sim1\sigma$
2678:     shaded range]{noordermeer:bulge-disk}, or assuming 
2679:     all bulges with Sersic index $n<2$ are pseudobulges 
2680:     \citep[with the distribution of $n$ versus bulge mass from][black 
2681:     solid line and shading]{balcells:bulge.scaling}, 
2682:     or from directly measured pseudobulge mass functions 
2683:     \citep[][blue long-dashed line and shading]{driver:bulge.mfs}. 
2684:     {\em Second from Bottom:} Probability (from $\chi^{2}$) 
2685:     that observed clustering of quasars (data in Figure~\ref{fig:quasar.bias})
2686:     and star-forming galaxies reflect the same hosts. 
2687:     Solid line is derived from the best-fit to the compilation of \citet{hopkins:clustering} 
2688:     points from the individual measurements included (see Figure~\ref{fig:quasar.bias}).
2689:     {\em Bottom:} Predicted fraction of the luminosity density from the 
2690:     the model for secular fueling from \citet{hopkins:seyferts}, when combined with 
2691:     the merger-driven model herein.
2692:     \label{fig:seyferts.win}}
2693: \end{figure}
2694: 
2695: There are a number of additional constraints we can place on the contribution to the 
2696: QLF from secular fueling in non-merging disks. Figure~\ref{fig:seyferts.win} 
2697: considers several of these. 
2698: First, we place a limit on secular activity by asking: at a given $z$, what are the 
2699: brightest QSOs possible in disk/star-forming galaxies?  For that redshift, we take the 
2700: observed mass function of star forming galaxies, and convolve with 
2701: $P(\mbh\,|\,\mgal)$ to obtain the hosted BH mass function (assuming 
2702: the most massive disks are Sa/b-type galaxies). Then, assume that every such 
2703: BH is at its Eddington luminosity. At some point (corresponding to 
2704: $\gtrsim2-4\,\mstar$ in the disk mass function) the number density of these mock 
2705: quasars falls below the QLF (which declines much less 
2706: rapidly) at that luminosity and redshift. In other words, at high luminosities, the required BH masses 
2707: from the Eddington limit are too large to live in late-type galaxies. 
2708: To be optimistic, we assume {\em all} the quasar luminosity density below this limit 
2709: is contributed by secular activity in disks. This then gives an upper limit to the 
2710: fraction of the luminosity density from disks. We repeat this procedure for a 
2711: number of different mass functions at different redshifts. In all cases, even this 
2712: limit falls to a fraction $\ll1$ by $z\gtrsim1$, as the QLF $\lstar$ reaches large 
2713: luminosities corresponding to $\mbh\gtrsim10^{8}\,\msun$ BHs at the Eddington limit. Given 
2714: the BH-host spheroid mass relations, this requires a very massive spheroid, easily formed 
2715: in a merger, but not present in even the most early-type disks. 
2716: 
2717: Second (alternatively), we assume all BHs in pseudobulges were formed via secular 
2718: mechanisms. As discussed in \S~\ref{sec:intro}, there is good reason to believe that this is 
2719: the case, whereas classical bulges must be formed in mergers. For a given 
2720: $z=0$ BH population, we infer an accretion history in the standard fashion from matching the 
2721: BH mass function and continuity equations \citep[e.g.][]{salucci:bhmf,yutremaine:bhmf}. 
2722: We then calculate the fraction of the QLF luminosity density at a given redshift 
2723: from systems which, at $z=0$, live in pseudobulges. We consider this for several 
2724: different observational estimates of the pseudobulge fraction as a function of e.g.\ 
2725: host galaxy morphological type or bulge Sersic index 
2726: \citep{kormendy.kennicutt:pseudobulge.review,balcells:bulge.scaling,allen:bulge-disk,
2727: noordermeer:bulge-disk}, and the directly estimated 
2728: pseudobulge mass functions in \citet{driver:bulge.mfs}. Although the details are sensitive to 
2729: how we define pseudobulges, we find a similar result -- massive BHs which dominate 
2730: the luminosity density at $z\gtrsim1$ live in the most massive bulges/ellipticals, which are 
2731: overwhelmingly classical bulges. 
2732: 
2733: Third, we calculate the probability that the observed clustering of quasars 
2734: is consistent with that of star forming/disk galaxies (see Figure~\ref{fig:quasar.bias}). 
2735: This is subject to some important caveats -- although quasar clustering depends 
2736: only weakly on luminosity (see Figure~\ref{fig:bias.vs.l}), 
2737: galaxy clustering has been shown to depend quite strongly 
2738: on galaxy luminosity/stellar mass \citep{norberg:clustering.by.lum.type}. 
2739: We use the compilation of 
2740: clustering data from \citet{hopkins:clustering}, as in Figure~\ref{fig:quasar.bias}. At 
2741: $z\lesssim1.5$, we specifically compare the clustering of $\sim\lstar$ quasars 
2742: with that of $\sim\lstar$ blue/star-forming galaxies. For {\em any} model in which quasars 
2743: are driven by secular activity and the statistics of quasar light curves/triggering 
2744: are continuous as a function of host mass/luminosity (i.e.\ there is not 
2745: a second feature in the luminosity function introduced by the 
2746: statistics of the light curves themselves), these should roughly correspond. 
2747: At higher redshift, galaxy clustering as a function of type and luminosity/mass 
2748: at $\sim\lstar$ is not clearly resolved so
2749: we can only plot combined clustering of observed 
2750: star-forming populations (generally selected as Lyman-break galaxies); 
2751: again caution is warranted given the known dependence of 
2752: clustering on galaxy mass/luminosity \citep[for LBGs, see][]{allen:lum.dep.lbg.clustering}. 
2753: Fortunately, the range of particular interest here is $z\lesssim1$, where 
2754: we again find a similar trend -- quasar clustering is consistent with 
2755: secular fueling at $z\sim0$, but by $z\sim1$ this is no longer true. 
2756: As discussed in \citet{hopkins:clustering}, this 
2757: appears to be contrary to some previous claims 
2758: \citep[e.g.,][]{adelbergersteidel:lifetimes}; however, in most cases where 
2759: quasars have been seen to cluster similarly to blue galaxies, either 
2760: {\em faint} AGN populations (not $\sim L_{\ast}$ quasars) or 
2761: bright ($\gg L_{\ast}$) blue galaxies were considered. Indeed, quasars 
2762: do cluster in a manner similar to the {\em brightest} blue galaxies 
2763: observed at several redshifts \citep[e.g.,][at $z\sim1$ and 
2764: $z\gtrsim2$, respectively]{coil:agn.clustering,allen:lum.dep.lbg.clustering}. 
2765: This should not be surprising; 
2766: since quasars require some cold gas supply for their fueling, they cannot be significantly 
2767: more clustered than the most highly clustered (most luminous) population of 
2768: galaxies with that cold gas. 
2769: 
2770: Finally, we compare these with a simple model expectation. We combine our 
2771: prediction of the merger-driven QLF with the model from \citet{hopkins:seyferts} 
2772: for the QLF driven by secular fueling mechanisms in star-forming 
2773: galaxies. This prediction is based on a simple model of feedback-driven 
2774: self-regulation, calculating the rate of triggering in non-merging disks from 
2775: the observed statistics of gas properties in the central regions of star-forming 
2776: galaxies of different types. The result is similar to the empirical constraints. 
2777: 
2778: All of these comparisons have important caveats. For example, 
2779: secular mechanisms could act so quickly as to completely 
2780: transform disks to bulges, rapidly making very large BHs (although this 
2781: conflicts with the pseudobulge constraints) from disk hosts. 
2782: Pseudobulges could form in more systems than we 
2783: estimated, but be subsequently transformed to 
2784: classical bulges via major mergers. Clustering could 
2785: be affected by a number of 
2786: systematic uncertainties inherent in e.g.\ the mass and luminosity 
2787: ranges considered. However, these systematics are independent, 
2788: and there is no single loophole which can simultaneously 
2789: reconcile the three constraints considered here with the possibility 
2790: that secular fueling dominates bright $\sim\lstar$ quasar activity 
2791: at $z\gtrsim1$. Although there are differences in detail, 
2792: all the methods we have considered empirically suggest a similar 
2793: scenario: secular (non-major merger related) fueling mechanisms 
2794: contribute little to quasar activity at $z\gtrsim1$, which involves 
2795: the most massive $\mbh\gtrsim10^{8}\,\msun$ BHs in the most 
2796: massive spheroids. By $z\sim0.5$, however, the most massive 
2797: BHs are no longer active, and a significant fraction of the quasar luminosity 
2798: density can come from $\sim10^{7}\,\msun$ BHs in undisturbed hosts. 
2799: By $z\sim0$, the local QLF is largely dominated by Seyfert activity in relatively 
2800: small BHs with late-type, undisturbed host disks \citep{heckman:local.mbh}.
2801: 
2802: \begin{figure}
2803:     \centering
2804:     \figexpand
2805:     %\plotone{lumden.vs.model.ps}
2806:     \plotone{f27.ps}
2807:     \caption{Bolometric quasar luminosity density as a function of redshift. Black stars 
2808:     show the observations from \citet{hopkins:bol.qlf}. Lines show estimates from 
2809:     different models (as labeled):  
2810:     the prediction from a merger-driven model (as in Figure~\ref{fig:lum.density}) and 
2811:     a moderate secular model in which BHs in pseudobulges at $z=0$ 
2812:     were formed in disk instabilities (as in Figure~\ref{fig:seyferts.win}, line 
2813:     in same style) are in good agreement with the luminosity density 
2814:     evolution and empirical constraints on clustering, host galaxy colors, spheroid kinematics, 
2815:     and disk/spheroid mass functions. We compare a maximal secular model, 
2816:     from \citet{bower:sam}, in 
2817:     which most BHs and (even classical) spheroids  
2818:     are initially formed via disk instabilities, and an ``extreme'' secular model, 
2819:     in which all $z=0$ BH mass is formed in such instabilities (same as the 
2820:     maximal secular model, but with no BH growth from cooling, accretion, 
2821:     or mergers; this is unphysical but serves as a strong upper limit). In order for 
2822:     disk instabilities to dominate BH growth, they must act very rapidly, before the 
2823:     (inevitable) major mergers can exhaust gas and form massive spheroids -- 
2824:     this forces such models to predict a luminosity density history offset to earlier 
2825:     times (higher redshifts) compared to the merger-driven model, in 
2826:     disagreement with the observations. 
2827:     \label{fig:lumden.z.models}}
2828: \end{figure}
2829: 
2830: Even if we ignore these constraints, a model in which secular fueling dominates 
2831: the growth of quasars and BHs has difficulty matching the observed rise and 
2832: fall of the quasar luminosity density with cosmic time. 
2833: Figure~\ref{fig:lumden.z.models} illustrates this. We show the observed 
2834: bolometric quasar luminosity density as a function of redshift, compared to our 
2835: estimate of the merger-driven luminosity density (as in Figure~\ref{fig:lum.density}). 
2836: We also show our estimate of the luminosity density which comes from 
2837: systems which, at $z=0$, live in pseudobulges, calculated as in 
2838: Figure~\ref{fig:seyferts.win}. Again, this fairly moderate, empirical model of 
2839: secular activity can account for the observed luminosity density at low 
2840: redshifts $z\lesssim0.5$, but provides only a small contribution at high redshifts 
2841: $z\gtrsim1$. 
2842: 
2843: We might, however, imagine a ``maximal'' secular 
2844: model in which {\em all} spheroids 
2845: are initially formed by disk instabilities. Equivalently (for our purposes), 
2846: albeit highly contrived, a model might invoke secular processes to rapidly 
2847: build up BH mass (to the final mass that will be given by the ``future'' 
2848: $\mbh-\sigma$ relation) before a spheroid is formed in later mergers and/or instabilities. 
2849: These have severe difficulty 
2850: reconciling with the kinematics of observed classical bulges 
2851: (see \S~\ref{sec:intro}) and the tightness of the BH-host spheroid correlations, respectively, 
2852: and are not favored by simple dynamical arguments \citep[see, e.g.][]{shen:size.mass}, 
2853: nor the constraints in Figure~\ref{fig:seyferts.win}, 
2854: but they could in principle be invoked. In fact, the semi-analytic model of 
2855: \citet{bower:sam} is effectively such a scenario, in which a 
2856: very strong disk instability mode is analytically adopted, which overwhelmingly 
2857: dominates initial bulge formation and BH growth (mergers contributing $\ll 1\%$ at all redshifts). 
2858: We therefore compare their estimate for the total quasar luminosity density 
2859: (accretion rate density) as a function of time. 
2860: Finally, in the default \citet{bower:sam} model, there is still some growth of BHs 
2861: via accretion from the diffuse ISM, cooling, and mergers (major and minor). We 
2862: therefore also adopt an even more extreme 
2863: secular model, in which we reproduce the \citet{bower:sam} analysis with an 
2864: even stronger disk instability mode -- essentially renormalizing the model such 
2865: that all $z=0$ bulge mass was formed in this ``secular'' mode (i.e.\ we allow 
2866: {\em no} subsequent growth via other mechanisms, and demand that the observed 
2867: integrated $z=0$ BH mass density be matched by the integrated secular mode growth). 
2868: This latter model is of course unphysical, but yields a hard upper limit to 
2869: secular-mode growth.
2870: 
2871: It is immediately clear that the ``maximal'' secular model predicts that the quasar luminosity 
2872: density should peak at much higher 
2873: redshifts $z\sim4$ than the observed $z\sim2$. In general, 
2874: the rise and fall of the quasar luminosity density in such a model are offset to earlier 
2875: times. The reason for this is simple: in a fully cosmological model, mergers are 
2876: {\em inevitable}. And, whether or not most quasars are triggered by mergers, it is 
2877: extremely difficult to contrive a major, gas-rich merger without BH accretion and 
2878: spheroid formation, with most of the gas being consumed by star formation. The only way that 
2879: a secular or disk instability model can dominate the integrated buildup of BH mass and 
2880: quasar luminosity density is to ``beat mergers to the finish,'' i.e.\ to generally operate 
2881: early and rapidly enough such that the BHs have been largely formed, and gas already 
2882: exhausted, by the time massive galaxies undergo their first major mergers. In such models, 
2883: then, one is forced to predict that the quasar luminosity density peaks at very early times 
2884: and has largely declined (i.e.\ most of the gas in massive 
2885: systems has already been exhausted) by $z\sim2$. 
2886: 
2887: Finally, this relates to a more general point. The quasar luminosity density 
2888: \citep[and especially the number density of bright quasars corresponding to 
2889: $\gtrsim10^{8}\,\msun$ BHs at high Eddington ratio; see][]{fan04:qlf,richards:dr3.qlf} declines 
2890: rapidly at $z\gtrsim2-3$ (roughly as $\sim(1+z)^{4-6}$), compared to the 
2891: global star formation rate density of the Universe, which is relatively flat 
2892: at these redshifts \citep[declining as $\sim(1+z)^{0-1.5}$ from $z\sim2-6$;][]{hopkinsbeacom:sfh}. 
2893: This has long been recognized, and cited as a reason why quasars and BH growth cannot 
2894: explain reionization at high redshifts (since, similar to the global star formation history, the 
2895: UV background declines slowly at these redshifts). It further implies that BH growth 
2896: (at least at the masses of interest for our predictions here) cannot generically 
2897: trace star formation. This places strong constraints on secular models, as above, as well as 
2898: models in which essentially all high-redshift star formation is in bulges or 
2899: some sort of dissipational collapse \citep[e.g.][]{granato:sam,lapi:qlf.sam}. Some process
2900: must delay the formation of massive BHs, while allowing star and galaxy formation to 
2901: proceed efficiently at high redshifts. A natural explanation is that massive BH formation 
2902: requires major mergers. In our model, at high redshifts, low-mass galaxies can efficiently form 
2903: (and potentially build low-mass BHs via secular instabilities), but they are 
2904: predominantly disks, which efficiently turn gas into stars and do not form very massive 
2905: bulges or BHs. Only later, once their hosts have grown more massive, are they likely to 
2906: undergo major mergers, which transform the disks into spheroids and build correspondingly 
2907: massive BHs. This automatically explains the much sharper rise and fall of the quasar 
2908: luminosity density and number density of bright quasars, relative to the 
2909: shallow evolution in the star formation rate density and ionizing background 
2910: of the Universe at high redshifts. 
2911: 
2912: \section{Discussion}
2913: \label{sec:discussion}
2914: 
2915: We have developed a theoretical model for the cosmological 
2916: role of galaxy mergers, which allows us to make predictions for various 
2917: merger-related populations such as starbursts, quasars, and 
2918: spheroidal galaxies. 
2919: By combining theoretically well-constrained 
2920: halo and subhalo mass functions as a function of redshift and 
2921: environment with empirical halo occupation models, we can estimate where 
2922: galaxies of given properties live at a given epoch. This allows us to 
2923: calculate, in an {\em a priori} cosmological manner, where major galaxy-galaxy 
2924: mergers occur and what kinds of galaxies merge, at all redshifts. 
2925: 
2926: We compare these estimates to a number of observations, including 
2927: observed merger mass functions; merger fractions as a function of 
2928: galaxy mass, halo mass, and redshift; the mass flux/mass density in 
2929: mergers; the large-scale clustering/bias of merger populations; 
2930: and the small-scale environments of mergers, and show 
2931: that this approach yields robust predictions in good agreement with
2932: observations, and can be extended to predict detailed properties 
2933: of mergers at all masses and redshifts. 
2934: There are some uncertainties in this approach. However, we 
2935: re-calculate all of our predictions adopting different estimates for the 
2936: subhalo mass functions and halo occupation model (and its redshift 
2937: evolution) and find this makes little difference (a factor $<2$) at all 
2938: redshifts. The largest uncertainty comes from our calculation of 
2939: merger timescales, where, at the highest redshifts ($z\gtrsim3$), merging via 
2940: direct collisional processes might be more efficient than 
2941: merging via dynamical friction, given the large physical densities. 
2942: More detailed study in very high-resolution numerical simulations will 
2943: be necessary to determine the effective breakdown between different 
2944: merger processes.
2945: Nevertheless, the difference in our predictions at these redshifts is still 
2946: within the range of observational uncertainty. 
2947: Ultimately, we find that our predictions are robust 
2948: above masses $\mgal\gtrsim10^{10}\,\msun$, regardless of these 
2949: possible changes to our model, as the theoretical 
2950: subhalo mass functions and empirical halo occupation models 
2951: are reasonably well-constrained in this regime. 
2952: 
2953: In addition to these specific observational predictions and tests, 
2954: our model allows us to examine the physical origins of the distribution of 
2955: major mergers of different galaxy masses and types. For example, 
2956: there is a naturally defined major-merger scale (host halo mass $\mhalo$) for 
2957: galaxies of mass $\mgal$ -- the ``small group scale,'' only slightly larger than 
2958: the average halo hosting a galaxy of mass $\mgal$. This is the scale at which 
2959: the probability to accrete a second galaxy of comparable mass $\sim\mgal$ (fuel for a 
2960: major merger) first becomes significant. At smaller (relative) 
2961: halo masses, the probability that the halo 
2962: hosts a galaxy as large as $\mgal$ declines rapidly. At larger masses, the 
2963: probability that the halo will merge with or accrete another halo hosting a comparable $\sim\mgal$ 
2964: galaxy increases, but the efficiency of the merger of these galaxies declines rapidly. 
2965: We stress that this small group scale is indeed small -- the 
2966: average small group halo will still host only 1 galaxy 
2967: of mass $\sim\mgal$, and groups will only consist of $2-3$ members of similar mass. 
2968: We also note that this does not mean that mergers occur (in a global sense) on a specific scale, 
2969: since the small group scale is different for different galaxy masses.
2970: In fact, a consequence of this model is that mergers occur in halos of 
2971: all masses and in all environments (including field and even void environments), as is observed 
2972: \citep{alonso:groups,goto:e+a.merger.connection,hogg:e+a.env}, although 
2973: the characteristic masses 
2974: and star formation histories 
2975: of galaxies merging may reflect their different environments/halo masses. 
2976: Similarly, our model allows us to accurately predict and understand the 
2977: (relatively weak) evolution of the merger fraction with redshift, and the 
2978: relative evolution in merger rates as a function of mass (evolution of the 
2979: major merger mass functions). The clustering properties and dependence of 
2980: merger rates on both large-scale and small-scale environment are natural 
2981: consequences of the fundamentally local nature of mergers, and we 
2982: study in detail the effects of environment on merger rates as a function of scale. 
2983: 
2984: Having characterized mergers in this way, we examine the role 
2985: that mergers play in triggering quasars. Even if there are other quasar ``triggers'' 
2986: dominant at some luminosities/redshifts, it is difficult to imagine a scenario in which the 
2987: strong nuclear gas inflows from a merger do not cause 
2988: rapid, near Eddington-limited accretion and ultimately yield some kind of quasar 
2989: -- and indeed such activity is ubiquitous in late-stage mergers 
2990: \citep{komossa:ngc6240,alexander:xray.smgs,
2991: borys:xray.ulirgs,brand:xray.ir.contrib}. We therefore make the simple 
2992: ansatz that gas-rich, major
2993: mergers will produce quasars (but do, in principle, allow for other 
2994: fueling mechanisms as well). This model, with just the contribution of mergers 
2995: to the quasar luminosity density, is able to account for 
2996: the observed quasar luminosity density from $z=0-6$. 
2997: The rise and fall of the luminosity density with redshift, as well as 
2998: the shape and evolution of the quasar luminosity function, are 
2999: accurately reproduced. This also yields predictions of the local black hole 
3000: mass function, cosmic X-ray background \citep[see][]{hopkins:qso.all}, 
3001: AGN fractions as a function of galaxy mass/luminosity and 
3002: redshift, large scale quasar clustering as a function of luminosity and redshift, 
3003: small-scale quasar clustering excesses, quasar host galaxy colors, 
3004: and infrared luminosity functions, all in good agreement with those observed.
3005: In particular, matching the history of the bolometric 
3006: luminosity density of quasars requires no knowledge or assumptions about 
3007: quasar duty cycles, light curves, or lifetimes, only our determination of the 
3008: global mass density in gas-rich major mergers. 
3009: 
3010: In our model, the sharp rise and fall of the quasar luminosity density over 
3011: cosmic time is the product of several factors. At high redshifts, the 
3012: buildup of BH mass from $z\gtrsim6$ to $z\sim2$ owes in part to 
3013: the growth of galaxy and halo mass, as most galaxies are rapidly forming, 
3014: and the galaxy mass density involved in major mergers steadily 
3015: increases with time. The rise is steeper than that in, for example, the 
3016: global star formation rate density of the Universe, as it tracks 
3017: just the major merger history (effectively, at these redshifts, the rise in the 
3018: density of relatively massive ``small group'' sized halos), as opposed to the global buildup of 
3019: the (relatively lower-mass) halos hosting the most rapidly star-forming galaxies. 
3020: Below redshift $z\sim2$, merger rates begin to decline 
3021: for all galaxies, and the exhaustion of gas in evolved systems 
3022: slows the growth of quasars in two ways. First, major 
3023: mergers of relatively gas-poor disks create shallower central potential 
3024: wells for the remnant spheroid (i.e.\ lower $\sigma$ values), and 
3025: as a consequence BH growth self-regulates at lower masses 
3026: \citep{hopkins:bhfp}, in agreement with the observed evolution of 
3027: the BH-host correlations with redshift \citep[e.g.,][]{peng:magorrian.evolution}. Second, an 
3028: increasing fraction of galaxies (especially around $\sim\lstar$, where 
3029: most of the mass density resides) have already undergone major 
3030: mergers and exist as ``quenched'' spheroids (with very 
3031: little remaining cold, rotationally supported gas) 
3032: whose major mergers will not excite quasar activity. 
3033: Recent high-resolution cosmological simulations which 
3034: attempt to resolve the relevant merger and feedback effects 
3035: regulating BH growth \citep{sijacki:radio,dimatteo:cosmo.bhs} further support this scenario, 
3036: with the combination of these effects and, primarily, the merger 
3037: history of the Universe regulating BH growth (at least at redshifts 
3038: $z\lesssim6$). The product of these 
3039: effects yields the observed steep rise and fall of the quasar population 
3040: with respect to its peak at $z\sim2$, in good agreement with the 
3041: observations and in contrast with the substantially more extended 
3042: global star formation history of the Universe. 
3043: 
3044: We compare this model to one in which quasar fueling is primarily 
3045: driven by secular processes -- i.e.\ disk instabilities, bars, harassment, 
3046: or any process which operates in non-merging, gas-rich systems. 
3047: We demonstrate that there are a number of robust, qualitatively distinct 
3048: predictions from these models, including: 
3049: 
3050: {\em Quasar Clustering:} A merger-driven model accurately predicts 
3051: the observed large-scale clustering of quasars (both at $\sim\lstar$ and as a detailed 
3052: function of luminosity) as a function of redshift for the observed 
3053: range $z\sim0.5-4$. 
3054: The clustering is, at all these redshifts, precisely that predicted for 
3055: ``small group'' halos in which major mergers of gas-rich galaxies should proceed 
3056: most efficiently. It is well-established empirically that quasar clustering 
3057: traces a characteristic host halo mass
3058: \citep{porciani2004,
3059: wake:local.qso.clustering,croom:clustering,porciani:clustering,
3060: myers:clustering,daangela:clustering,coil:agn.clustering,
3061: shen:clustering,hopkins:clustering}, 
3062: and investigations of the quasar proximity effect 
3063: reach a similar conclusion \citep{faucher:proximity,kim:proximity,guimaraes:proximity}.
3064: Comparing this to independent, direct measurements of the small group 
3065: scale of $\sim\lstar$ gas-rich galaxies, and to the small group 
3066: scale inferred from a wide variety of different halo occupation models, we show 
3067: in all cases that these trace the same mass. 
3068: In contrast, the clustering of typical star-forming galaxies is somewhat weaker 
3069: (as expected relative to their small group scale), and yields an underestimate of 
3070: quasar clustering at moderate and high redshifts. Only at low redshifts 
3071: ($z\lesssim0.5$) is there reasonable consistency between the clustering of 
3072: $\sim\lstar$ quasars and ``secular'' populations 
3073: \citep[for more details, see][]{hopkins:clustering}.
3074: 
3075: {\em Small-Scale Environments:} Mergers will preferentially occur in environments 
3076: with an overdensity of galaxies on small scales, and as a consequence their 
3077: clustering should reflect a bias (relative to a mean galaxy of the same mass) to 
3078: excess clustering on small scales. Furthermore, triggering of binary quasars in (even 
3079: a small fraction of) early interacting pairs can enhance this excess. 
3080: Indeed, in a purely empirical sense, both bright quasars at all redshifts 
3081: $z\sim0.5-3$ \citep{hennawi:excess.clustering,serber:qso.small.scale.env,
3082: myers:clustering.smallscale} and local 
3083: post-starburst merger remnant galaxies \citep{goto:e+a.merger.connection} are observed to 
3084: have similar, strong excess clustering on small scales, distinct from 
3085: quiescent (non-merger related) populations. 
3086: This is true both in terms of the quasar-quasar autocorrelation, and for 
3087: the quasar-galaxy cross-correlation, suggesting that it reflects a true tendency for quasars 
3088: to reside in regions of small-scale overdensity. Our model predicts the 
3089: magnitude of this excess clustering as a function of physical scale 
3090: and redshift well for both populations. Interestingly, low-luminosity 
3091: Seyfert galaxies ($M_{B}>-23$) are observed 
3092: without such an excess on small scales \citep{serber:qso.small.scale.env}, as expected if 
3093: AGN triggering at low luminosities (or typical $\mbh\lesssim10^{7}\,\msun$) 
3094: is dominated by secular processes (with the true quasar populations dominated 
3095: by mergers). However, systems of these low luminosities contribute 
3096: significantly to the quasar luminosity density at only very low redshifts $z\lesssim0.5$, 
3097: once more massive systems have predominantly quenched. 
3098: 
3099: {\em Host Galaxy Colors:} The stellar population colors of a 
3100: gas-rich merger remnant will rapidly redden, at least over the $\sim$\,Gyr period 
3101: over which subsequent infall or cooling can be ignored, and the system 
3102: will (even if only temporarily) cross the ``green valley'' between the blue cloud and 
3103: red sequence. If a quasar is triggered at the end of a merger, the decay of the 
3104: quasar lightcurve should be associated with the host crossing this interval, or 
3105: equivalently with the presence of a relatively young, blue host spheroid. 
3106: Observed quasar hosts at $z\sim0.5-1.1$ appear to preferentially occupy this 
3107: (otherwise relatively empty) locus in color-magnitude space 
3108: \citep{sanchez:qso.host.colors,nandra:qso.host.colors}, 
3109: and it is well-established that bright quasar hosts tend to be 
3110: massive spheroids with especially young stellar or post-starburst stellar 
3111: populations \citep[e.g.][and references therein]{canalizostockton01:postsb.qso.mergers,
3112: jahnke:qso.host.sf,vandenberk:qso.spectral.decomposition,barthel:qso.host.sf}. 
3113: We show that the color distribution of observed quasar hosts 
3114: is similar to that observed for clear post-starburst merger remnant 
3115: populations. In contrast, a secular model (regardless of the quasar duty cycle or lifetime) 
3116: would predict that quasar hosts trace the population of systems hosting 
3117: strong disk instabilities or bars (unless any quasar activity could somehow be suppressed 
3118: over the entire lifetime of a relatively long-lived bar) -- these actually 
3119: tend to be the most blue, gas-rich disk galaxies. We show that the observed colors of quasar 
3120: hosts are distinct from those of systems observed hosting strong bars.  
3121: 
3122: {\em Host Kinematics (Pseudobulges versus Classical Bulges):} Numerical 
3123: simulations and observations of both barred systems and merger remnants 
3124: have established that mergers yield systems with the observed kinematic and 
3125: photometric properties of classical bulges, whereas secular disk instabilities 
3126: generically give rise to pseudobulges with distinct properties
3127: (see the discussion in \S~\ref{sec:intro}). At high redshifts $z\gtrsim1$, the active 
3128: $\sim\lstar$ quasar populations (either from direct quasar BH mass measurements or 
3129: simply the Eddington argument) are dominated by massive BHs 
3130: ($\mbh\gtrsim10^{8}\,\msun$), which are directly observed to live in massive bulges 
3131: at those redshifts \citep{peng:magorrian.evolution}, and whose remnants clearly live in massive bulges 
3132: locally. These spheroids ($M_{\rm sph}\gtrsim10^{11}\,\msun$) 
3133: are overwhelmingly classical spheroids (in particular, classical true ellipticals), 
3134: whose kinematics argue that they were formed in mergers. To the extent that the buildup 
3135: of BH mass traces spheroid origin (true at all redshifts observed, albeit with 
3136: potentially redshift-dependent efficiency), this implies formation in mergers. 
3137: Adopting a number of different estimates of e.g.\ the pseudobulge fraction as a 
3138: function of host properties, pseudobulge mass distributions, or simply assuming 
3139: all bulges in star-forming/disk-dominated galaxies are formed via secular instabilities, 
3140: we compare with the distribution of active BH masses in the quasar luminosity function 
3141: at all redshifts, and show that these populations cannot dominate the 
3142: QLF at redshifts $z\gtrsim1$. Only at low redshifts $z\lesssim1$ are the 
3143: global QLF and buildup of BH mass occurring mainly
3144: in systems which typically reside in star-forming, disk-dominated hosts 
3145: with pseudobulges potentially formed via disk instabilities or bars. 
3146: 
3147: {\em Quasar Luminosity Density versus Redshift:} As noted above, a 
3148: merger-driven model predicts a sharp rise and fall of the quasar luminosity 
3149: density in good agreement with observations. If, for the sake of argument, we 
3150: adopt a model in which all BH growth is driven by disk instabilities, 
3151: we demonstrate that, once embedded in a proper cosmological context, 
3152: such a model is generically forced to predict a history of quasar luminosity density 
3153: which is offset to earlier times (in each of its rise, peak, and fall), in 
3154: conflict with the observations. This is because major mergers are dynamically 
3155: inevitable -- one cannot simply ``remove'' the mergers a galaxy will undergo 
3156: in a true cosmological model. In order for disk instabilities to dominate BH growth 
3157: or spheroid formation, they must, therefore, act before massive systems undergo 
3158: their major mergers. Since the global mass flux in gas-rich major mergers 
3159: peaks around $z\sim2-3$, a secular-dominant model is forced to assume a sufficiently 
3160: strong disk instability mode such that the progenitors of these systems 
3161: rapidly exhaust their gas supplies and build up most of their final BH/spheroid mass 
3162: at redshifts $z\gtrsim4$. By $z\sim2$, then, these models predict the quasar luminosity 
3163: density is already in rapid decline. We demonstrate this both for current state-of-the-art 
3164: semi-analytic models \citep{bower:sam}, constrained such that they cannot overproduce 
3165: the $z=0$ mass density in quenched systems nor ``avoid'' major mergers, and 
3166: simple illustrative toy models. 
3167: The only way to avoid this is to weaken the disk 
3168: instability criterion -- i.e.\ to assume disk instabilities are not so efficient at exhausting 
3169: systems, and can therefore act continuously over longer times. But then, one obtains 
3170: a prediction similar to our expectation from assuming all pseudobulges are formed 
3171: in disk instabilities -- namely, the high rate of gas-rich mergers at high redshifts will 
3172: dominate quasar activity at all $z>1$, and this ``gentler'' disk instability mode will 
3173: dominate at lower luminosities (i.e.\ only dominate BH mass buildup at low 
3174: masses $\mbh\lesssim10^{7}\,\msun$), becoming important to the 
3175: total luminosity density only at $z<1$.
3176: 
3177: These comparisons, despite the very different possible systematic effects 
3178: in the observations, all suggest a similar scenario. 
3179: Secular (non-merger related) fueling mechanisms may dominate 
3180: AGN activity in low-BH mass systems ($\mbh\lesssim10^{7}\,\msun$), 
3181: for which mergers are relatively rare 
3182: and hosts tend to be very gas-rich, potentially bar-unstable disks, but these 
3183: contribute little to quasar activity at $z\gtrsim1$, which involves 
3184: the most massive $\mbh\gtrsim10^{8}\,\msun$ BHs in the most 
3185: massive spheroids. By $z\sim0.5$, however, the most massive 
3186: BHs are no longer active (their hosts having primarily been gas exhausted and 
3187: quenched, and with overall merger rates declining), 
3188: and a significant fraction of the AGN luminosity 
3189: density can come from $\sim10^{7}\,\msun$ BHs in undisturbed hosts, corresponding 
3190: to relatively low-luminosity ($M_{B}>-23$) Seyfert galaxies. 
3191: By $z\sim0$, the local QLF is largely dominated by Seyfert activity in relatively 
3192: small BHs with late-type, undisturbed host disks \citep{heckman:local.mbh}. 
3193: Our models allow for secular mechanisms, such as the stochastic triggering 
3194: model of \citet{hopkins:seyferts}, to be important at low luminosities, and 
3195: a pure comparison between this secular model and our merger-driven 
3196: prediction here yields a transition to secular dominance at low luminosities 
3197: in good agreement with the empirical constraints. 
3198: 
3199: Ultimately, one would like to test this by directly studying the morphology of 
3200: true, bright quasar hosts at high redshifts. Unfortunately, 
3201: as discussed in \S~\ref{sec:intro}, this remains extremely difficult, and 
3202: results have been ambiguous.
3203: As noted previously, mock observations constructed from numerical major merger simulations 
3204: \citep{krause:mock.qso.obs}
3205: imply that, with the best presently attainable data, the faint, rapidly 
3206: fading tidal features associated with the quasar phase (i.e.\ final stages of the merger, 
3207: at which the spheroid is largely formed and has begun to relax) are difficult to 
3208: observe even locally and (for now) nearly impossible to identify at the 
3209: redshifts of greatest interest ($z\gtrsim1$). Similarly, experiments with automated, 
3210: non-parametric classification schemes \citep{lotz:gini-m20} suggest that the hosts will 
3211: generically be classified as ``normal'' spheroids, even with perfect resolution and no 
3212: surface brightness dimming. 
3213: This appears to be borne out, as recently 
3214: \citet{bennert:qso.hosts} have re-examined very 
3215: low-redshift quasars previously recognized from 
3216: deep HST imaging as having relaxed spheroid hosts, and found (after 
3217: considerably deeper integrations) that every such object shows clear evidence for 
3218: a recent merger. The ability to identify such features may be slightly improved if 
3219: one considers just the population of highly dust-reddened (but still dominated by quasar 
3220: light in the optical/near IR) or IR-luminous quasar expected to be associated with a 
3221: (brief) ``blowout'' stage preceding the more typical optical quasar phase in a merger, and 
3222: it does appear that observations of quasars in this stage, somewhat closer to the peak of 
3223: merger activity, show ubiquitous evidence of recent or ongoing mergers 
3224: \citep{hutchings:redqso.lowz,hutchings:redqso.midz,
3225: kawakatu:type1.ulirgs,guyon:qso.hosts.ir,urrutia:qso.hosts}, albeit still requiring 
3226: very deep integrations. 
3227: 
3228: On the other hand, it is increasingly possible to 
3229: improve the constraints we have studied in this paper, to break the degeneracy between 
3230: secular and merger-driven models of quasar fueling. Improving measurements of 
3231: merger fractions, mass functions, and clustering 
3232: at low redshifts, and extending these measurements to high redshifts, can break 
3233: the degeneracies in our cosmological models (regarding, for example, the appropriate 
3234: merger timescales at high redshifts) and enable more robust, tightly constrained predictions. 
3235: We have also made a large number of predictions in this paper and previous related 
3236: works \citep[e.g.][]{hopkins:qso.all,hopkins:clustering} which can be directly tested 
3237: without the large ambiguities presently inherent in quasar host morphology estimates. 
3238: Better observations of quasar 
3239: host galaxy colors (and corresponding estimates of their recent star formation history), 
3240: improved measurements of quasar clustering at redshifts $z\gtrsim3$ (especially 
3241: measurements which can resolve $\sim\lstar$ quasars at these redshifts), 
3242: detailed cross-correlation measurements of quasars and other galaxy populations 
3243: and clustering measurements which 
3244: can decompose the excess bias of quasars on small scales as a function of 
3245: e.g.\ redshift and luminosity, improved constraints on the bolometric corrections of 
3246: the brightest quasars and the history of the bolometric quasar luminosity density 
3247: at $z\gtrsim3-4$, and estimates of the evolution with redshift of pseudobulge populations 
3248: will all be able to test the models presented in this paper. The combination of these 
3249: observations can greatly strengthen the constraints herein, and ultimately allow for 
3250: more detailed modeling which attempts not just to predict the general origin of quasars in 
3251: mergers, but to fully break down the contribution of major mergers (or mergers of different 
3252: types) and other fueling 
3253: mechanisms to the quasar luminosity functions as a function of luminosity and redshift. 
3254: 
3255: \acknowledgments We thank Josh Younger, Volker Springel, Gordon Richards, 
3256: Chris Hayward, Alice Shapley, Jenny Greene, 
3257: and Yuexing Li for helpful discussions.
3258: This work was supported in part by NSF grant AST
3259: 03-07690, and NASA ATP grants NAG5-12140, NAG5-13292, and NAG5-13381.
3260: 
3261: 
3262: \bibliography{ms}
3263: 
3264: 
3265: 
3266: \end{document}
3267: 
3268: 
3269: