0809:0809.2800/ms.tex

1: % Created from:

2: % mn2esample.tex

3: % v2.1 released 22nd May 2002 (G. Hutton)

4:

5: \documentclass[usegraphicx,usenatbib,usedcolumn,useAMS]{mn2e}

6:

7: %%%%% AUTHORS - PLACE YOUR OWN MACROS HERE %%%%%

8:

9: \newcommand\W{W_{\rm{H}\alpha}}

10: \newcommand\EW{\rm{EW}(\rm{H}\alpha)}

11:

12: %%%-----------------------------------------------------------------------------

13:

14: %%% General maths and units commands

15: \newcommand{\unisim}{\sim\!}

16: %%% References

17: \newcommand{\refsec}[1]{Section \ref{#1}}

18: \newcommand{\reffig}[1]{Fig.~\ref{#1}}

19: \newcommand{\reftab}[1]{Table \ref{#1}}

20:

21: %%%-----------------------------------------------------------------------------

22: % make sure full page visible if processed as A4 or letter

23: \voffset-1.25cm

24: %%%-----------------------------------------------------------------------------

25:

26: \title[Components of the galaxy population]%

27: {Revealing components of the galaxy population

28:   through nonparametric techniques}

29: \author[S. P. Bamford et al.]{%

30: Steven P. Bamford$^{1,2}$\thanks{E-mail: steven.bamford@nottingham.ac.uk},

31: Alex L. Rojas$^{3,4}$, Robert C. Nichol$^{1}$, Christopher J. Miller$^{5}$,\newauthor

32: Larry Wasserman$^{3}$, Christopher R. Genovese$^{3}$, Peter

33: E. Freeman$^{3}$

34: \vspace{6pt}\\

35: $^{1}$Institute of Cosmology and Gravitation, University of

36: Portsmouth, Mercantile House, Hampshire Terrace, Portsmouth, PO1 2EG, UK\\

37: $^{2}$Centre for Astronomy \& Particle Theory, School of Physics \& Astronomy,

38: University of Nottingham, Nottingham, NG7 2RD, UK\\

39: $^{3}$Department of Statistics, Baker Hall, Carnegie Mellon

40: University, Pittsburgh, PA 15213, USA\\

41: $^{4}$Carnegie Mellon University in Qatar, c/o Qatar Foundation,

42: P.O. Box 24866, Doha, Qatar\\

43: $^{5}$Observatorio Cerro Tololo, Observatorio de AURA en Chile,

44: Casilla 603, La Serena, Chile

45: }

46:

47: \begin{document}

48:

49: \date{Accepted ???. Received ???; in original form ???}

50:

51: \pagerange{\pageref{firstpage}--\pageref{lastpage}} \pubyear{2008}

52:

53: \maketitle

54:

55: \label{firstpage}

56:

57: \begin{abstract}

58:   The distributions of galaxy properties vary with environment, and

59:   are often multimodal, suggesting that the galaxy population may be a

60:   combination of multiple components.  The behaviour of these

61:   components versus environment holds details about the processes of

62:   galaxy development.  To release this information we apply a novel,

63:   nonparametric statistical technique, identifying four components

64:   present in the distribution of galaxy H$\alpha$ emission-line

65:   equivalent-widths. We interpret these components as passive,

66:   star-forming, and two varieties of active galactic nuclei.

67:   Independent of this interpretation, the properties of each component

68:   are remarkably constant as a function of environment.  Only their

69:   relative proportions display substantial variation.  The galaxy

70:   population thus appears to comprise distinct components which are

71:   individually independent of environment, with galaxies rapidly

72:   transitioning between components as they move into denser

73:   environments.

74: \end{abstract}

75:

76: \begin{keywords}

77: methods: statistical -- galaxies: statistics -- galaxies: fundamental

78: parameters -- galaxies: clusters: general

79: \end{keywords}

80:

81: \section{Components of the galaxy population}

82: It has long been recognised that galaxies may be divided into at

83: least two distinct sub-populations. Originally this division was based

84: on visual appearance.  Most galaxies can be morphologically

85: classified as either elliptical or spiral.  Finer classification is

86: possible, discretizing an apparently continuous variation in galaxy

87: appearance.  However the dichotomy between elliptical and spiral

88: morphology is more pronounced than the variations within each

89: class.  Subsequently, it has been discovered that several other, more

90: quantitative, galaxy properties are distributed unevenly or in a

91: multi-modal manner.

92:

93: The colour distribution of SDSS galaxies is strongly bimodal

94: \citep{2001AJ....122.1861S}. Galaxies in the ``red'' and ``blue'' modes

95: can be roughly identified as those with elliptical and spiral

96: morphology, respectively

97: \citep{2002AJ....124..646H,2006MNRAS.368..414D}.  Whereas morphology

98: reflects the dynamical state of galaxies, colour is related to their

99: star-formation history, particularly over the last $\la 10^9$

100: years.  The colour bimodality thus implies a division of the galaxy

101: population into blue galaxies, which have recently formed stars, and

102: red galaxies, which have not.  Such a bimodality in the star-formation

103: properties of galaxies has also been observed using more direct

104: measures of current star-formation, such as emission-line strength

105: \citep{2004MNRAS.348.1355B}.

106:

107: The position of the red and blue galaxy sequences in the

108: colour--luminosity or colour--stellar mass planes display only a weak

109: dependence on environment.  However, the relative proportions of

110: galaxies in the two sequences vary strongly. In regions with a higher

111: local galaxy density the fraction of galaxies on the red sequence is

112: higher

113: \citep{2004ApJ...615L.101B,2004AIPC..743..106B,2006MNRAS.373..469B}.

114:

115: It remains a matter of debate whether colour is more closely related to

116: environment than morphology.  Some claim that trends in morphology

117: versus environment can be mostly explained via a morphology--colour

118: relation which is almost independent of environment

119: \citep{2006MNRAS.366....2W,2006astro.ph..8353B,2006astro.ph.10171B,2007MNRAS.376L...1W}.

120: However, other studies oppose this view \citep{2007ApJ...658..898P},

121: and it has been clearly shown that the colour and morphology

122: bimodalities behave differently with respect to environment and

123: stellar mass \citep{2008arXiv0805.2612B}.

124:

125: There are growing indications that, in a fraction of the galaxy

126: population, star-formation must be terminated rapidly

127: \citep{2004MNRAS.348.1355B,2006MNRAS.373..469B}.  The emission lines

128: in galaxy spectra provide a way of measuring the level of current star

129: formation on a timescale of $\la 10^7$ years.  They therefore trace

130: rapid star formation variations more sensitively than colour.  Another

131: important property of emission lines is that they are produced by

132: active galactic nuclei (AGN), in addition to star formation.  AGN are

133: present in many galaxies, and are thought to be produced by accretion

134: of material onto the super-massive black holes which appear to reside

135: at the centre of most, if not all, galaxies

136: \citep{1998Natur.395A..14R}.  Recently a variety of studies have

137: suggested that AGN strongly influence star-formation in their host

138: galaxies, and thus play an important role in defining the galaxy

139: population

140: \citep{2003MNRAS.346.1055K,2005MNRAS.364.1337S,2006MNRAS.365...11C,2006MNRAS.370..645B}.

141: The potential presence of an AGN contribution complicates the

142: traditional usage of emission-lines as an indicator of star

143: formation rate (SFR).  However, it also presents an opportunity to

144: study these two interdependent processes, star-formation and AGN,

145: through the distribution of a single quantity.

146:

147: Galaxies with contrasting properties are found to be distributed

148: differently in space.  Elliptical galaxies cluster together more

149: strongly than spirals \citep{2000ApJ...545....6B,2001ApJ...554..857G}.

150: Similarly, red galaxies are preferentially found in denser

151: environments than blue galaxies \citep{2005ApJ...630....1Z}.  We have a

152: well developed theory for how structure forms in the cosmos, at least

153: in terms of the underlying cold dark matter which dominates the mass

154: density \citep{2005Natur.435..629S}.  Baryonic matter is expected to be

155: similarly distributed, in broad terms.  This theory thus explains the

156: range of galaxy environments observed.  However, the properties of

157: galaxies as a function of environment is a much more complicated

158: issue, depending on the detailed physics of galaxy formation and

159: evolution.  By studying trends in the galaxy population with

160: environment we can learn about these physical processes.

161:

162: There has been a logical progression in studies of galaxy properties

163: as a function of environment.  Early work was based on dividing

164: galaxies into simple classes and looking at variations in the

165: fractions of galaxies of each class in bins of environment

166: \citep{1985ApJ...288..481D}.  As galaxy samples grew, this moved on to

167: examining trends in the mean properties of galaxies as a smooth

168: function of local galaxy density

169: \citep{2002MNRAS.334..673L,2003ApJ...584..210G}. A significant

170: development was fitting to the data functions that describe the

171: distribution of galaxies in two classes

172: \citep{2004ApJ...615L.101B,2004AIPC..743..106B,2006MNRAS.373..469B}.

173: Most of the approaches employed so far have relied upon enforcing a

174: predefined view of how to divide or classify the galaxy population in

175: increasingly complex ways.  However, our understanding of the physical

176: processes at work is highly uncertain and does not provide a

177: sufficient basis to make this decision.  Our only guide is the data

178: itself.  A natural next step is thus to turn to nonparametric methods,

179: where the components of the population are deduced consistently from

180: the data itself.

181:

182: Recently, several studies have performed multivariate statistical

183: analyses on datasets containing a wide variety of galaxy properties,

184: in order to identify components of the galaxy population, and

185: determine which properties are most important for identifying to which

186: component a galaxy belongs

187: \citep{2005MNRAS.363.1257E,2006MNRAS.373.1389C}.  Such studies are

188: highly informative, but become complicated when one wishes to

189: determine the behaviour of the identified components versus another

190: variable.  In this paper we are primarily concerned with variation in

191: the components of the galaxy population as a function of environment.

192: The statistical method we present below may be straightforwardly

193: applied to multivariate datasets.  However, for simplicity, in the

194: present work we consider the environmental dependence of just one

195: galaxy property.  Nevertheless, even with this elementary approach, we

196: are able to learn much about the galaxy population.

197:

198: \begin{figure*}

199: \includegraphics[clip=True,trim = 3cm 22.3cm 9.5cm 3.3cm,width=1.0\textwidth]{fig1.ps}

200: \caption{\label{fig:w13}

201:   The distribution of transformed H$\alpha$ equivalent width

202:   ($\W$) for (left) low and (right) high density environments.  The

203:   histogram displays the data, with Poisson uncertainties indicated by

204:   the grey shading.  The red, purple, green and blue lines show the

205:   components derived by applying the NMR technique.  The brown line

206:   gives the sum of these components, which is clearly a good

207:   representation of the data.}

208: \end{figure*}

209:

210: \section{Conditional density estimation}

211: A common problem in astronomy, and statistical sciences in general, is

212: that one wishes to understand how the behaviour of one variable depends

213: upon another.  This is relatively straightforward in the case where

214: there is a single relationship between the variables, albeit with

215: some, possibly variable, scatter or width to the distribution.  Much

216: statistical and astronomical literature has been devoted to the

217: development of such regression methods \citep{weisberg}.  However, in

218: the case where multiple components may be present in the overall

219: distribution, each with a different functional dependence on the

220: variables, the situation becomes substantially more difficult.  One

221: can still attempt to apply single-component statistical tools, for

222: example nonparametric quantile regression \citep{QR,lQR}, on the whole

223: distribution, but the understanding one gains from such an exercise is

224: limited and sometimes misleading.  Alternatively one may individually

225: analyse subsamples selected by defining regions in the parameter

226: space, or preferably using additional information\citep{MMR}.  This

227: approach, however, is unsuitable when the multiple components

228: significantly overlap, or when it is unclear how many components are

229: present.

230:

231: Most regression techniques focus on estimating the conditional mean,

232: the average value of one variable as a function of another variable;

233: for example, a line through a set of scattered points.  However, one

234: may get a better understanding of the relationship between a response

235: variable and a set of covariates by considering the estimation of the

236: conditional density as a whole; the \emph{distribution} of one

237: variable as a function of another.  (Note that \emph{density} here

238: refers to probability density as a function of the parameter set, not

239: a measure of environmental local galaxy density as elsewhere in this

240: paper.)  We use a new conditional density estimator based on finite

241: mixture models and local likelihood estimation, which describes the

242: underlying relationship between two variables by a set of

243: parameterised functions. This feature gives the proposed procedure the

244: advantage of being easily interpretable. This method is called

245: nonparametric mixture regression (NMR), and is described in detail in

246: Appendix \ref{sec:nmr}.

247:

248: The NMR technique has the potential to aid the understanding of many

249: datasets, across all fields of science.  In the present work, it

250: allows us to determine the environmental dependence for individual

251: components of the galaxy population, with minimal prior assumptions on

252: the number and properties of these components.

253:

254: \section{\boldmath Galaxy H$\alpha$ equivalent widths}

255: \label{sec:Halpha}

256: The strongest emission line in a galaxy optical spectrum is H$\alpha$.

257: The luminosity of H$\alpha$ is approximately proportional to the rate

258: of ongoing star-formation \citep{2006ApJ...642..775M}, when

259: uncontaminated by additional emission, such as from an AGN.  A

260: commonly employed quantity is the equivalent width (EW) of a spectral

261: line, the line flux normalised by the continuum flux at the same

262: wavelength.  The EW measurement has the advantages of being

263: approximately independent of uncertainties in the spectral flux

264: calibration and any extinction present in both the observed galaxy and

265: our own.  The H$\alpha$ line is in the red region of the spectrum,

266: where the continuum is dominated by the light from old stars.  The

267: H$\alpha$ continuum flux is therefore roughly proportional to stellar

268: mass, and hence $\EW$ is approximately proportional to the SFR

269: per unit stellar mass.

270:

271: \begin{figure*}

272: \includegraphics[clip=True,trim = 3cm 22.3cm 9.5cm 3.3cm,width=1.0\textwidth]{fig2.ps}

273: \caption{\label{fig:ew13}As \reffig{fig:w13}, but shown here in terms of the

274:   untransformed equivalent width, $\EW$.  The inset shows the same

275:   plot with axis-ranges chosen to better show the behaviour at small $\EW$.}

276: \end{figure*}

277:

278: The overall distribution of galaxy H$\alpha$ luminosity, equivalent

279: width, and hence absolute and normalised SFR, are

280: found to move to lower levels with increasing environmental density

281: \citep{2002MNRAS.334..673L,2003ApJ...584..210G}.  This generally agrees

282: with the colour and morphology trends described above, and the

283: variation of H$\alpha$ emission with morphological type

284: \citep{2004AJ....127.2511N}.  However, if the galaxy population is

285: separated into galaxies which are star-forming and those which are

286: not, the distribution of $\EW$ for each component

287: does not depend significantly on environment.  Only the relative

288: proportion of star-forming galaxies changes strongly

289: \citep{2004MNRAS.348.1355B} with environment.  This finding, of

290: distinguishable components in the galaxy population with properties

291: independent of environment but proportions which vary strongly,

292: mirrors the behaviour found in the colour distribution.  It also

293: motivates us to perform a more rigorous evaluation of the components

294: present in the galaxy population in this work.

295:

296: As mentioned earlier, an important feature of emission lines is that,

297: in addition to star formation, they are also produced by AGN.

298: Galaxies whose emission lines are dominated by star-formation or AGN

299: activity can be separated using various diagnostic diagrams.  The most

300: common of these plots the emission line ratios

301: ${\rm{[OIII]}\lambda5007}/{\rm{H}\beta}$ versus

302: ${\rm{[NII]}\lambda6583}/{\rm{H}\alpha}$, and is known as the BPT

303: diagram \citep{1981PASP...93....5B}. The usual approach is to use these

304: diagrams to reject objects inappropriate to the particular study.

305: Thus a study of galaxy star formation properties would exclude all

306: galaxies with signs of AGN contamination.  However, classifying a

307: galaxy using the BPT diagram requires multiple emission lines to be

308: detected, resulting in a fraction of objects which cannot be

309: classified.  In addition, the separation between galaxies dominated by

310: star-formation and AGN is not clear, and there appears to be a large

311: population of galaxies which host both star-formation and an AGN.

312: Roughly 20\% of all galaxies are unambiguously AGN-dominated, while it

313: is estimated that a further 20\% are star-forming galaxies with a

314: significant AGN contribution \citep{2003ApJ...597..142M}. This

315: ambiguity means a variety of SFR--AGN demarcations exist

316: \citep{2001ApJ...556..121K,2003MNRAS.346.1055K,2006MNRAS.371..972S}.

317: Star formation studies based on emission lines have therefore rejected

318: widely varying fractions of galaxies from their samples.  This

319: fraction is usually low, so significant numbers of AGN-contaminated

320: galaxies remain.  More importantly, if our aim is to gain knowledge of

321: star-formation properties across the whole galaxy population, then we

322: may be rejecting an important fraction of the population.  If there

323: are any intrinsic correlations between AGN and star-formation, as has

324: been suggested by other studies \citep{2003MNRAS.346.1055K}, then

325: information about these will be lost.

326:

327: A number of classes of AGN have been identified.  A primary

328: distinction is between Type 1 and Type 2 AGN.  In Type 1 objects our

329: viewing angle is such that we see the region immediately around the

330: central black hole directly, and thus the galaxy's light is dominated

331: by the AGN emission.  In this case the properties of the host galaxy

332: are generally very difficult to determine.  In Type 2 AGN, the central

333: region is obscured by a dusty torus surrounding it.  The observed AGN

334: emission is therefore due to material further removed from the central

335: ionising source, and mostly confined to emission lines.  Most

336: photometric and structural galaxy properties may therefore be reliably

337: measured, despite the presence of a Type 2 AGN.  In this work we

338: exclude all Type 1 AGN, identified by the large widths of their

339: emission lines, and consider only the more common Type 2 objects.  A

340: further subdivision within Type 2 AGN is between LINER and Seyfert 2

341: objects.  These are similar, and may simply be two parts of a

342: continuum of objects, with Seyfert 2 AGN being more powerful and

343: highly ionised.  However, there are signs that LINERs and Seyfert 2

344: AGN are truly physically distinct classes \citep{2006MNRAS.372..961K}.

345:

346: In this work we examine the components in the distribution of galaxy

347: $\EW$, interpretable as a proxy for star formation rate and

348: nuclear activity per unit stellar mass.  It is possible to estimate

349: the true star-formation rate and stellar mass, for galaxies which do

350: not host an AGN, using a combination of several spectral features.

351: However, such estimates are sensitive to the details of the assumed

352: model.  There is therefore a concern that any finding concerning the

353: components of the resulting distribution may be attributable to the

354: model.  The $\EW$, on the other hand, is a single, robust,

355: model-independent measurement.

356:

357: The data we use in our study is from Data Release 4 of the SDSS

358: \citep{2006ApJS..162...38A}. The emission line fluxes, continua and

359: resulting EW used in this study are those provided for DR4 by the

360: MPA-Garching group \citep{2004ApJ...613..898T}\footnote{available from

361:   http://www.mpa-garching.mpg.de/SDSS/DR4}.  All quantities used in

362: this paper were obtained from the CMU-PITT SDSS DR4 Value Added

363: Catalog\footnote{available from

364:   http://nvogre.phyast.pitt.edu/dr4\_value\_added} (VAC).  The SQL code for

365: the selection of each of our samples is given in \reftab{tab:sql}.

366: We construct a volume-limited sample by selecting galaxies with $0.05

367: < z < 0.095$ and $M_r < -20.4$.  In this work we thus focus on the

368: behaviour of fairly bright galaxies.  The lower redshift limit ensures

369: the spectra are based on a reasonable fraction of the galaxies' light;

370: at $z=0.05$ the $3$~arcsec diameter of each spectroscopic fibre

371: corresponds to $3$~kpc.  Throughout we convert to physical scales

372: assuming a flat Friedman-Robertson-Walker cosmology with $\Omega_m =

373: 0.3$, $\Omega_{\lambda} = 0.7$ and $H_0 = 70$ km~s$^{-1}$~Mpc$^{-1}$.

374:

375: \begin{table}

376:   \caption{\label{tab:sql}Definitions of the galaxy samples used in this

377:     study, given as `where' clauses of the SQL queries of the CMU-PITT SDSS

378:     DR4 VAC}

379: \begin{tabular}{p{0.06\textwidth}p{0.31\textwidth}c}

380: \hline

381: \centering sample & \centering SQL selection & $n$ \\

382: \hline \hline

383: density defining sample &

384: \texttt{!z between 0.02 and 0.10 and absolute\_Petro\_r <= -20.4 and Sort~=~0}

385: &

386: 117873\\

387: \hline

388: $\rho_{1.3}$ \mbox{sample} &

389: \texttt{!z between 0.05 and 0.095 and absolute\_Petro\_r <= -20.4 and

390:  2.4 < Dist\_right\_edge and 2.4 < Dist\_left\_edge and

391:  2.4 < Dist\_upper\_edge and 2.4 < Dist\_lower\_edge and

392:  H\_ALPHA\_FLUX > -99 and H\_ALPHA\_CONT > 0.0001 and

393:  H\_ALPHA\_FLUX/H\_ALPHA\_CONT > -0.4 and

394:  absolute\_Petro\_u > -990 and absolute\_Petro\_r > -990

395:  and Sort~=~0}

396: &

397: 76420\\

398: \hline

399: $\rho_{5.5}$ \mbox{sample} &

400: \texttt{!z between 0.05 and 0.095 and absolute\_Petro\_r <= -20.4 and

401:  11 < Dist\_right\_edge and 11 < Dist\_left\_edge and

402:  11 < Dist\_upper\_edge and 11 < Dist\_lower\_edge and

403:  H\_ALPHA\_FLUX > -99 and H\_ALPHA\_CONT > 0.0001 and

404:  H\_ALPHA\_FLUX/H\_ALPHA\_CONT > -0.4 and

405:  absolute\_Petro\_u > -990 and absolute\_Petro\_r > -990

406:  and Sort~=~0}

407: &

408: 46998\\

409: \hline

410: \end{tabular}

411: \end{table}

412:

413: \section{\boldmath Measuring galaxy environment}

414: Galaxy environment can be characterised in many ways, but a commonly

415: adopted value is the local number density of galaxies brighter than a

416: given luminosity, averaged over some volume or kernel.  We estimate

417: the local galaxy number density, $\rho_b$, within a fixed-scale,

418: spherical kernel with a Gaussian radial profile and bandwidth $b$.

419: Our local galaxy densities are thus simple to interpret physically.

420:

421: To select the bandwidth, or scale, $b$ of the kernel, we apply

422: leave-one-out cross-validation; that is, we select the value of $b$

423: which minimizes the estimated integrated mean squared error, $CV(b)$. This

424: error is obtained by estimating the density function $n$ times, each

425: time leaving out one galaxy from the estimation:

426: \begin{equation}

427: CV(b) = \int \widehat f_{n,b}^{\;2}(\bmath{x}) d\bmath{x} -

428: \frac{2}{n}\sum_{i=1}^n \widehat f_{(-i),b}(\bmath{X_i})

429: \end{equation}

430: where $\{\bmath{X_i}\}$ is the set of galaxy positions, and $\widehat

431: f_{n,b}$ and $\widehat f_{(-i),b}$ are the kernel density estimators

432: with bandwidth $b$, using all $n$ galaxies and after removing the

433: $i^{\rmn{th}}$ galaxy, respectively.  We compute $CV(b)$ for a range

434: of different bandwidth values to find that which minimizes the error.

435: Applying this cross-validation method we determine an optimum

436: bandwidth value of $1.3$~Mpc.  A similar optimum bandwidth for local

437: galaxy density estimation was found using cross-validation by

438: \citet{2004MNRAS.348.1355B}.

439:

440: Interestingly, this scale corresponds to the size of

441: galaxy clusters, and is thus highly appropriate for characterising

442: density from a physical, as well as a statistical, point of view.

443: However, while cross-validation provides the statistically optimum

444: bandwidth for the whole sample, any choice of bandwidth has its

445: limitations.  This density estimator loses resolution at low

446: densities, where there are no neighbouring galaxies within the kernel

447: bandwidth, and is thus unable to discriminate between densities lower

448: than $\rho_{1.3} \sim 0.03$~Mpc$^{-3}$, comprising 17\% of the sample.

449: In order to probe environments less dense than this, but necessarily

450: on larger physical scales, we additionally perform the analysis with

451: local densities measured using a larger bandwidth of $5.5$~Mpc.

452: Almost all galaxies have a neighbour within this radius.  One could

453: also consider estimating densities with a kernel bandwidth

454: significantly smaller than $1.3$~Mpc. However, such an estimator would

455: lose resolution below even moderate densities, where galaxies are

456: typically separated by more than the bandwidth.  It would also be less

457: able to discriminate between high density environments, because the

458: densities are estimated using galaxy positions uncorrected for

459: redshift-space distortions, and hence an increase in true-space

460: density no longer results in a higher redshift-space density within

461: the kernel.  We mostly show results based on the

462: statistically-motivated $1.3$~Mpc bandwidth in the main body of this

463: article, but provide figures using the $5.5$~Mpc bandwidth in Appendix

464: \ref{sec:55Mpc}, to demonstrate that we find similar results on larger

465: scales and to lower densities.

466:

467: We avoid biased density estimates for galaxies at the edges of our

468: sample volume by determining the densities using a larger volume

469: sample of galaxies with $0.02 < z < 0.10$ and $M_r < -20.4$.

470: We then limit the analysis sample to galaxies with $0.05 < z < 0.095$

471: and further than approximately twice the bandwidth from a survey

472: boundary.  We reject a further 3\% of galaxies with unreliable $\EW$

473: or $(u-r)$ rest-frame colour measurements.  The exact selections, and

474: corresponding sample sizes, are given in \reftab{tab:sql}.

475:

476: \begin{figure*}

477: \includegraphics[clip=True,trim = 3cm 22cm 5cm 3.3cm,width=1.0\textwidth]{fig3.ps}

478: \caption{\label{fig:wrho13}The behaviour of the NMR components versus

479:   environment.  The left panel plots the data as dots, along with the

480:   location of each component, indicated by thick, solid lines, and additionally

481:   their widths via the coloured shading and dashed lines.  These widths

482:   are shown explicitly in the middle panel.  The right panel displays

483:   the variation in the proportion of each component.  While the

484:   location and width of the components do not change significantly

485:   with environment, the proportions vary strongly.}

486: \end{figure*}

487:

488: \section{Applying the NMR technique}

489:

490: \begin{figure*}

491: \includegraphics[clip=True,trim = 5cm 7.5cm 10cm 2cm,width=1.0\textwidth]{fig4.ps}

492: \caption{\label{fig:3Dwrho55}A three-dimensional view of the NMR estimate of the

493:   $\W$--$\rho_{5.5}$ distribution, shown by the grey, transparent

494:   surface, and its constituent components, colour-coded is in the

495:   previous figures.  It can be clearly seen that the positions and widths of the

496:   components do not change significantly, while their relative

497:   proportions vary substantially.}

498: \end{figure*}

499:

500: A brief inspection of the sample $\EW$ distribution reveals a peak

501: around zero EW, with a long, asymmetric tail to high EW.  The NMR

502: technique is more computationally efficient when using symmetrical,

503: Gaussian functions to model the distribution.  Gaussians are also an

504: obvious choice due to their exceptional richness and flexibility.  For

505: convenience we therefore wish to transform the equivalent width

506: quantity to a space where its natural components appear to take a more

507: symmetrical, Gaussian, form.  Better matching the shape of the true

508: distribution components to that assumed in the NMR technique will also

509: naturally result in fewer NMR components being required to model the

510: distribution (but see Appendix \ref{sec:nmr}).  The EW extend slightly to

511: negative values, proscribing a simple logarithmic transformation.  We

512: therefore choose the transformation $\W = \log_{10}(\EW + \lambda)$.

513: The zero offset parameter, $\lambda$, must be large enough to make the

514: logarithm argument positive for the most negative EW value in our

515: sample. In constructing our sample we remove outliers by requiring

516: $\EW > -0.4$, thereby clipping the lowest 0.1\% of the sample.

517: Therefore, we must have $\lambda > 0.4$.  We have examined the

518: behaviour of our NMR fits and their likelihood with variations in

519: $\lambda$.  The chosen value has only a relatively small effect,

520: slightly altering the shape of the Gaussian basis functions once they

521: are transformed back into EW space, but not changing our results

522: significantly.  Here we adopt $\lambda = 1.4$ as a compromise between

523: maximising the fit likelihood and ensuring stable behaviour.  We must

524: also choose a reasonable bandwidth for the regression kernel in

525: $\rho$.  Following extensive tests we adopt an adaptive bandwidth

526: enclosing the nearest 5000 points (also see discussion in Appendix

527: \ref{sec:nmr}).

528:

529: We apply the NMR technique to the distribution of $\W$, and determine

530: the optimum number of components using the Bayesian Information

531: Criterion \citep[BIC;][]{BIC}.  Four components are strongly preferred

532: by the data, by $\Delta$BIC $>$ 7 (see Appendix \ref{sec:nmr} for more

533: details).  In \reffig{fig:w13} (\reffig{fig:w55}) we show the NMR

534: components we obtain for the $\rho_{1.3}$ ($\rho_{5.5}$) sample, at

535: two values of local galaxy density.  The components are plotted in

536: $\W$-space, in which the technique is applied.  We also show the

537: components and data transformed back into $\EW$-space in

538: \reffig{fig:ew13} (\reffig{fig:ew55}).  The properties of these

539: components as a function of environmental density are shown in

540: \reffig{fig:wrho13} (\reffig{fig:wrho55}).  In \reffig{fig:3Dwrho55}

541: we show a three-dimensional view of the components and their sum for

542: the $\rho_{5.5}$ sample, which includes all the relevant information

543: (location, width and relative proportion of each component) in a

544: single plot.  We show the results for the $\rho_{5.5}$ here simply

545: because they are smoother than those for $\rho_{1.3}$, and the

546: individual components are more clearly visible in this

547: three-dimensional view.  It is critical to note that the only data

548: which has been used to determine these components is the $\EW$

549: distribution.

550:

551: At this stage we make no attempt at interpreting the components as

552: physically distinct populations. Nevertheless, Figs. \ref{fig:w13},

553: \ref{fig:w55}, \ref{fig:ew13}, \ref{fig:ew55} indicate that the $\EW$

554: distribution can be well described by multiple components.  The

555: hypothesis that the galaxy population comprises distinct components,

556: or types, is strongly supported by the various property bimodalities

557: described earlier.  We find that the locations and widths of the

558: components of the $\EW$ distribution are independent of environment.

559: Only the relative proportions of the components are found to vary

560: strongly.  This implies that the variations with environment are

561: primarily the result of differences in the relative frequency of each

562: galaxy type, rather than changes in the intrinsic properties of each

563: type.

564:

565: Galaxies move to regions of higher density over time, under the

566: influence of gravity.  The variation of galaxy properties with

567: environment is therefore at least partly due to

568: environmentally-dependent changes in individual galaxy properties over

569: time.  If all galaxies in a given environment were affected similarly,

570: we would expect to see smooth changes in the property distributions of

571: each individual component.  However, we find that the individual

572: components remain mostly unchanged with environment.  This implies

573: that some galaxies are transformed directly from one type to another,

574: in an apparently stochastic manner.  If this transformation is

575: sufficiently slow, we would expect to see the transitioning galaxies

576: appearing as a separate component in the relevant range of local

577: density.  If it is rapid, then the fraction of transitioning galaxies

578: at any time would be too low to separate from the main distribution.

579:

580: \begin{figure}

581: \includegraphics[angle=270,width=0.45\textwidth]{fig5.ps}

582: \caption{\label{fig:bpt13}The BPT diagram for our $\rho_{1.3}$ sample, traditionally

583:   used to identify star-forming galaxies and AGN hosts.  For clarity,

584:   only one-fifth of our sample galaxies are plotted. The \emph{LINER},

585:   \emph{Seyfert 2} and \emph{SF dominated} regions are colour-coded to

586:   match our interpretation of their correspondence to the NMR

587:   components shown in the other figures (purple, blue and green,

588:   respectively).  Note that many galaxies cannot be placed on this

589:   diagram.  These are \emph{passive} galaxies, with no emission lines,

590:   and \emph{uncertain} galaxies, with some detected emission lines,

591:   but not all four of those required for inclusion in this diagram.}

592: \end{figure}

593:

594: \section{Identifying the components}

595: It is easy to identify the component at zero $\EW$ with passive

596: galaxies, containing no star-formation or AGN activity.  The dominant

597: component at high $\EW$ must be associated with star-forming galaxies

598: (with the above caveats concerning potential AGN contamination).  We

599: also find two intermediate $\EW$ components.  The principle change

600: with environment appears to be the movement of galaxies from the

601: star-forming component to the others, but primarily to the passive

602: component.  However, interpreting either of these intermediate EW

603: components as a population transitioning between star-forming and

604: passive is inconsistent with their existence as a significant fraction

605: of the galaxy population even at low environmental densities.

606:

607: To explore the physical interpretation of the components we have

608: found, we now turn to more traditional diagnostics to separate the

609: contributions from star formation (SF) and AGN to the emission lines.

610: The BPT diagram for our $\rho_{1.3}$ sample is shown in \reffig{fig:bpt13}.  In

611: order to appear on this plot, all four required emission lines must be

612: detected at $>2$~sigma significance.  The classifications we define

613: are as follows;

614: %

615: \emph{passive}: no emission lines detected,

616: %

617: \emph{SF dominated}: all four lines detected and below the curve of

618: \citet{2006MNRAS.371..972S},

619: %

620: \emph{AGN dominated}: above the line of \citet{2001ApJ...556..121K}

621: with either all four lines detected or with both lines for just one of

622: the ratios detected and ${\rm{[OIII]}}/{\rm{H}\beta} > 0.6$ or

623: ${\rm{[NII]}}/{\rm{H}\alpha} > 0.05$,

624: %

625: \emph{AGN+SF}: all four lines detected and between the curves of

626: \citet{2001ApJ...556..121K} and \citet{2003MNRAS.346.1055K},

627: %

628: \emph{SF+AGN}: all four lines detected and between the curves of

629: \citet{2006MNRAS.371..972S} and \citet{2003MNRAS.346.1055K},

630: %

631: \emph{uncertain}: at least one of the four emission lines detected,

632: but none of the other classification criteria met.

633: %

634: Note that the majority of AGN-dominated galaxies can be robustly

635: identified simply from their ${\rm{[NII]}}/{\rm{H}\alpha}$ ratio

636: \citep{2003ApJ...597..142M,2006MNRAS.371..972S}.

637:

638: Our classification method is such that galaxies classified as

639: \emph{AGN dominated} must contain a significant AGN component, and

640: will have low contribution to their emission lines from star

641: formation.  On the other hand \emph{SF dominated} galaxies may well

642: also contain up to $\sim 20$--$40$\% AGN contamination in their

643: emission lines \citep{2003MNRAS.346.1055K,2006MNRAS.371..972S}.  The

644: \emph{AGN dominated} galaxies can be further subdivided into

645: \emph{LINER} and \emph{Seyfert 2} sources using the BPT diagram

646: \citep{2003MNRAS.346.1055K}.

647:

648: \begin{figure*}

649: \includegraphics[clip=True,trim = 3cm 22cm 9.5cm 3.3cm,width=1.0\textwidth]{fig6.ps}

650: \caption{\label{fig:wrho13bpt}The $\W$--$\rho_{1.3}$ distribution for

651:   objects in our sample colour-coded by their location in the BPT

652:   diagram shown in Fig.~4.  The lines indicate the median $\W$ in bins

653:   of $\rho_{1.3}$ for each subsample.  The left panel shows

654:   \emph{passive}, \emph{LINER}, \emph{Seyfert 2} and \emph{SF

655:     dominated} galaxies (in order of increasing $\W$), while the right

656:   panel shows \emph{uncertain}, \emph{AGN+SF} and \emph{SF+AGN}

657:   galaxies (brown, orange and cyan, respectively, and again in order

658:   of increasing $\W$).  A comparison with Fig.~2 reveals a

659:   correspondence between the NMR components and, in order of

660:   increasing $\W$, (1) \emph{passive} galaxies, (2) \emph{LINER} and

661:   \emph{uncertain} galaxies, (3) \emph{Seyfert 2} and \emph{AGN+SF}

662:   galaxies, and (4) \emph{SF dominated} and \emph{SF+AGN} galaxies.}

663: \end{figure*}

664:

665: Figure \ref{fig:wrho13bpt} shows the $\W$--$\rho_{1.3}$ distributions

666: of galaxies classified using the BPT diagram. Comparing with

667: \reffig{fig:wrho13}, one can clearly identify the NMR components with

668: the \emph{passive}, \emph{LINER}, \emph{Seyfert 2} and \emph{SF

669:   dominated} BPT-classified galaxies.  The large fraction of galaxies

670: for which the BPT diagram gives an uncertain result may also be

671: identified with the components.  The galaxies with apparently mixed

672: star formation and AGN emission are found at similar $\W$ to the

673: \emph{Seyfert 2} objects, and the higher intermediate NMR component.

674: Galaxies with at least one emission line, but which cannot be

675: identified via the BPT diagram have similar $\W$ to \emph{LINER}

676: objects and the lower NMR component.  While not conclusive, this

677: strongly suggests that the components derived from the NMR technique

678: do represent physically distinct populations.  This is remarkable

679: given that the NMR components have been inferred from just a single

680: emission line.

681:

682: \section{A new insight into the galaxy population}

683:

684: By applying the newly developed NMR method to the H$\alpha$ equivalent

685: width distribution, a single astrophysical quantity that contains

686: information on both star formation and nuclear activity, we have

687: identified four distinct components in the galaxy population.  None of

688: these components vary significantly with environment, in terms of the

689: distribution of their H$\alpha$ equivalent widths.  However, the relative

690: proportions of galaxies in each component vary substantially with

691: environment.  This implies that any environmental processes at work do

692: not affect all galaxies in a gradual way, which would result in

693: changes in the component H$\alpha$ equivalent width distributions.

694: Rather, they must rapidly transform a fraction of galaxies from one

695: component to another, in a stochastic manner, in order to avoid

696: changing the properties of the individual components.

697:

698: The above conclusions stand without requiring us to identify the

699: components with more traditional galaxy sub-populations.  However,

700: when we attempt such an identification, we find that the extreme

701: components may be associated with passive and star-forming galaxies,

702: while the two intermediate components display similarities to galaxies

703: hosting LINERs and Seyfert 2 AGN.  Galaxies with an apparent mix of

704: star-formation and AGN may also be identified with these components.

705: However, in contrast to the usual methods of classifying the

706: star-formation and AGN properties of galaxies, which require multiple

707: emission lines to be significantly detected, the technique we describe

708: in this paper is applicable to all galaxies.  We thereby avoid the

709: issue of excluding objects for which traditional methods are

710: uncertain, and the biases which this may introduce.

711:

712: \section*{Acknowledgements}

713: SPB acknowledges support from an STFC postdoctoral grant.  AR

714: acknowledges the Qatar Foundation for Education, Science and Community

715: Development.  RCN holds a Marie Curie Excellence Chair from the

716: European Commission.  We thank the NSF for funding this

717: inter-disciplinary research through their KDI initiative.

718: Three-dimensional visualisation was conducted with the S2PLOT

719: programming library \citep{2006PASA...23...82B}.  We are grateful to

720: the referee, Dr. Nicholas Ball, for useful comments.

721:

722: \bsp

723:

724: % Bibliography generated by BibTeX and pasted in from bbl file

725: %\bibliographystyle{mn2e}

726: %\bibliography{sdss_halpha}

727: \begin{thebibliography}{}

728: \small

729:

730: \bibitem[\protect\citeauthoryear{{Adelman-McCarthy} et~al.,}{{Adelman-McCarthy}

731:    et~al.}{2006}]{2006ApJS..162...38A}

732: {Adelman-McCarthy} J.~K.,  et~al., 2006, ApJS, 162, 38

733:

734: \bibitem[\protect\citeauthoryear{{Baldry}, {Balogh}, {Bower}, {Glazebrook} \&

735:   {Nichol}}{{Baldry} et~al.}{2004}]{2004AIPC..743..106B}

736: {Baldry} I.~K.,  {Balogh} M.~L.,  {Bower} R.,  {Glazebrook} K.,    {Nichol}

737:   R.~C.,  2004, in {Allen} R.~E.,  {Nanopoulos} D.~V.,   {Pope} C.~N.,  eds,

738:   The New Cosmology: Conference on Strings and Cosmology Vol.~743 of American

739:   Institute of Physics Conference Series, {Color bimodality: Implications for

740:   galaxy evolution}.

741: pp 106--119

742:

743: \bibitem[\protect\citeauthoryear{{Baldry}, {Balogh}, {Bower}, {Glazebrook},

744:   {Nichol}, {Bamford} \& {Budavari}}{{Baldry}

745:   et~al.}{2006}]{2006MNRAS.373..469B}

746: {Baldry} I.~K.,  {Balogh} M.~L.,  {Bower} R.~G.,  {Glazebrook} K.,  {Nichol}

747:   R.~C.,  {Bamford} S.~P.,    {Budavari} T.,  2006, MNRAS, 373, 469

748:

749: \bibitem[\protect\citeauthoryear{{Baldwin}, {Phillips} \&

750:   {Terlevich}}{{Baldwin} et~al.}{1981}]{1981PASP...93....5B}

751: {Baldwin} J.~A.,  {Phillips} M.~M.,    {Terlevich} R.,  1981, PASP, 93, 5

752:

753: \bibitem[\protect\citeauthoryear{{Ball}, {Loveday} \& {Brunner}}{{Ball}

754:   et~al.}{2006}]{2006astro.ph.10171B}

755: {Ball} N.~M.,  {Loveday} J.,    {Brunner} R.~J.,  2008, MNRAS, 383, 907

756:

757: \bibitem[\protect\citeauthoryear{{Balogh} et~al.,}{{Balogh}

758:   et~al.}{2004}]{2004MNRAS.348.1355B}

759: {Balogh} M.,  et~al., 2004, MNRAS, 348, 1355

760:

761: \bibitem[\protect\citeauthoryear{{Balogh}, {Baldry}, {Nichol}, {Miller},

762:   {Bower} \& {Glazebrook}}{{Balogh} et~al.}{2004}]{2004ApJ...615L.101B}

763: {Balogh} M.~L.,  {Baldry} I.~K.,  {Nichol} R.,  {Miller} C.,  {Bower} R.,

764:   {Glazebrook} K.,  2004, ApJL, 615, 101

765:

766: \bibitem[\protect\citeauthoryear{{Bamford}, {Nichol}, {Baldry}, {Land},

767:   {Lintott}, {Schawinski}, {Slosar}, {Szalay}, {Thomas}, {Torki}, {Andreescu},

768:   {Edmondson}, {Miller}, {Murray}, {Raddick} \& {Vandenberg}}{{Bamford}

769:   et~al.}{2008}]{2008arXiv0805.2612B}

770: {Bamford} S.~P.,  {Nichol} R.~C.,  {Baldry} I.~K.,  {Land} K.,  {Lintott}

771:   C.~J.,  {Schawinski} K.,  {Slosar} A.,  {Szalay} A.~S.,  {Thomas} D.,

772:   {Torki} M.,  {Andreescu} D.,  {Edmondson} E.~M.,  {Miller} C.~J.,  {Murray}

773:   P.,  {Raddick} M.~J.,    {Vandenberg} J.,  2008, ArXiv:0805.2612

774:

775: \bibitem[\protect\citeauthoryear{{Barnes}, {Fluke}, {Bourke} \&

776:   {Parry}}{{Barnes} et~al.}{2006}]{2006PASA...23...82B}

777: {Barnes} D.~G.,  {Fluke} C.~J.,  {Bourke} P.~D.,    {Parry} O.~T.,  2006,

778:   Publications of the Astronomical Society of Australia, 23, 82

779:

780: \bibitem[\protect\citeauthoryear{{Beisbart} \& {Kerscher}}{{Beisbart} \&

781:   {Kerscher}}{2000}]{2000ApJ...545....6B}

782: {Beisbart} C.,  {Kerscher} M.,  2000, ApJ, 545, 6

783:

784: \bibitem[\protect\citeauthoryear{{Blanton}, {Berlind} \& {Hogg}}{{Blanton}

785:   et~al.}{2006}]{2006astro.ph..8353B}

786: {Blanton} M.~R.,  {Berlind} A.~A.,    {Hogg} D.~W.,  2007, ApJ, 664, 791

787:

788: \bibitem[\protect\citeauthoryear{{Bower}, {Benson}, {Malbon}, {Helly}, {Frenk},

789:   {Baugh}, {Cole} \& {Lacey}}{{Bower} et~al.}{2006}]{2006MNRAS.370..645B}

790: {Bower} R.~G.,  {Benson} A.~J.,  {Malbon} R.,  {Helly} J.~C.,  {Frenk} C.~S.,

791:   {Baugh} C.~M.,  {Cole} S.,    {Lacey} C.~G.,  2006, MNRAS, 370, 645

792:

793: \bibitem[\protect\citeauthoryear{Cherkassky \& Ma}{Cherkassky \&

794:   Ma}{2005}]{MMR}

795: Cherkassky V.,  Ma Y.,  2005, IEEE Transactions on Neural Networks, 16, 785

796:

797: \bibitem[\protect\citeauthoryear{Conselice}{2006}]{2006MNRAS.373.1389C}

798: Conselice C.~J., 2006, MNRAS, 373, 1389

799:

800: \bibitem[\protect\citeauthoryear{{Croton}, {Springel}, {White}, {De Lucia},

801:   {Frenk}, {Gao}, {Jenkins}, {Kauffmann}, {Navarro} \& {Yoshida}}{{Croton}

802:   et~al.}{2006}]{2006MNRAS.365...11C}

803: {Croton} D.~J.,  {Springel} V.,  {White} S.~D.~M.,  {De Lucia} G.,  {Frenk}

804:   C.~S.,  {Gao} L.,  {Jenkins} A.,  {Kauffmann} G.,  {Navarro} J.~F.,

805:   {Yoshida} N.,  2006, MNRAS, 365, 11

806:

807: \bibitem[\protect\citeauthoryear{{Dressler}, {Thompson} \&

808:   {Shectman}}{{Dressler} et~al.}{1985}]{1985ApJ...288..481D}

809: {Dressler} A.,  {Thompson} I.~B.,    {Shectman} S.~A.,  1985, ApJ, 288, 481

810:

811: \bibitem[\protect\citeauthoryear{{Driver} et~al.,}{{Driver}

812:   et~al.}{2006}]{2006MNRAS.368..414D}

813: {Driver} S.~P.,  et~al., 2006, MNRAS, 368, 414

814:

815: \bibitem[\protect\citeauthoryear{Ellis et al.}{2005}]{2005MNRAS.363.1257E}

816: Ellis S.~C., Driver S.~P., Allen P.~D., Liske J., Bland-Hawthorn J., De

817: Propris R., 2005, MNRAS, 363, 1257

818:

819: \bibitem[\protect\citeauthoryear{{Giuricin}, {Samurovi{\'c}}, {Girardi},

820:   {Mezzetti} \& {Marinoni}}{{Giuricin} et~al.}{2001}]{2001ApJ...554..857G}

821: {Giuricin} G.,  {Samurovi{\'c}} S.,  {Girardi} M.,  {Mezzetti} M.,

822:   {Marinoni} C.,  2001, ApJ, 554, 857

823:

824: \bibitem[\protect\citeauthoryear{{G{\'o}mez} et~al.,}{{G{\'o}mez}

825:   et~al.}{2003}]{2003ApJ...584..210G}

826: {G{\'o}mez} P.~L.,  et~al., 2003, ApJ, 584, 210

827:

828: \bibitem[\protect\citeauthoryear{{Hogg} et~al.,}{{Hogg}

829:   et~al.}{2002}]{2002AJ....124..646H}

830: {Hogg} D.~W.,  et~al., 2002, AJ, 124, 646

831:

832: \bibitem[\protect\citeauthoryear{Kass \& Raftery}{Kass \& Raftery}{1995}]{KR95}

833: Kass R.~E.,  Raftery A.~E.,  1995, Journal of the American Statistical

834:   Association, 90, 773

835:

836: \bibitem[\protect\citeauthoryear{{Kauffmann}, {Heckman}, {Tremonti},

837:   {Brinchmann}, {Charlot}, {White}, {Ridgway}, {Brinkmann}, {Fukugita}, {Hall},

838:   {Ivezi{\'c}}, {Richards} \& {Schneider}}{{Kauffmann}

839:   et~al.}{2003}]{2003MNRAS.346.1055K}

840: {Kauffmann} G.,  {Heckman} T.~M.,  {Tremonti} C.,  {Brinchmann} J.,  {Charlot}

841:   S.,  {White} S.~D.~M.,  {Ridgway} S.~E.,  {Brinkmann} J.,  {Fukugita} M.,

842:   {Hall} P.~B.,  {Ivezi{\'c}} {\v Z}.,  {Richards} G.~T.,    {Schneider} D.~P.,

843:    2003, MNRAS, 346, 1055

844:

845: \bibitem[\protect\citeauthoryear{{Kewley}, {Dopita}, {Sutherland}, {Heisler} \&

846:   {Trevena}}{{Kewley} et~al.}{2001}]{2001ApJ...556..121K}

847: {Kewley} L.~J.,  {Dopita} M.~A.,  {Sutherland} R.~S.,  {Heisler} C.~A.,

848:   {Trevena} J.,  2001, ApJ, 556, 121

849:

850: \bibitem[\protect\citeauthoryear{{Kewley}, {Groves}, {Kauffmann} \&

851:   {Heckman}}{{Kewley} et~al.}{2006}]{2006MNRAS.372..961K}

852: {Kewley} L.~J.,  {Groves} B.,  {Kauffmann} G.,    {Heckman} T.,  2006, MNRAS,

853:   372, 961

854:

855: \bibitem[\protect\citeauthoryear{{Koenker} \& {Bassett}}{{Koenker} \&

856:   {Bassett}}{1978}]{QR}

857: {Koenker} R.,  {Bassett} G.,  1978, Econometrica, 46, 33

858:

859: \bibitem[\protect\citeauthoryear{{Lewis} et~al.,}{{Lewis}

860:   et~al.}{2002}]{2002MNRAS.334..673L}

861: {Lewis} I.,  et~al., 2002, MNRAS, 334, 673

862:

863: \bibitem[\protect\citeauthoryear{{McLachlan} \& {Krishnan}}{{McLachlan} \&

864:   {Krishnan}}{1997}]{EM}

865: {McLachlan} G.,  {Krishnan} T.,  1997, The EM algorithm and extensions (Wiley

866:   series in probability and statistics).

867: John Wiley \& Sons

868:

869: \bibitem[\protect\citeauthoryear{{Miller}, {Nichol}, {G{\'o}mez}, {Hopkins} \&

870:   {Bernardi}}{{Miller} et~al.}{2003}]{2003ApJ...597..142M}

871: {Miller} C.~J.,  {Nichol} R.~C.,  {G{\'o}mez} P.~L.,  {Hopkins} A.~M.,

872:   {Bernardi} M.,  2003, ApJ, 597, 142

873:

874: \bibitem[\protect\citeauthoryear{{Moustakas}, {Kennicutt} Jr. \&

875:   {Tremonti}}{{Moustakas} et~al.}{2006}]{2006ApJ...642..775M}

876: {Moustakas} J.,  {Kennicutt} Jr. R.~C.,    {Tremonti} C.~A.,  2006, ApJ, 642,

877:   775

878:

879: \bibitem[\protect\citeauthoryear{{Nakamura}, {Fukugita}, {Brinkmann} \&

880:   {Schneider}}{{Nakamura} et~al.}{2004}]{2004AJ....127.2511N}

881: {Nakamura} O.,  {Fukugita} M.,  {Brinkmann} J.,    {Schneider} D.~P.,  2004,

882:   AJ, 127, 2511

883:

884: \bibitem[\protect\citeauthoryear{{Park}, {Choi}, {Vogeley}, {Gott} \&

885:   {Blanton}}{{Park} et~al.}{2007}]{2007ApJ...658..898P}

886: {Park} C.,  {Choi} Y.-Y.,  {Vogeley} M.~S.,  {Gott} J.~R.~I.,    {Blanton}

887:   M.~R.,  2007, ApJ, 658, 898

888:

889: \bibitem[\protect\citeauthoryear{{Richstone}, {Ajhar}, {Bender}, {Bower},

890:   {Dressler}, {Faber}, {Filippenko}, {Gebhardt}, {Green}, {Ho}, {Kormendy},

891:   {Lauer}, {Magorrian} \& {Tremaine}}{{Richstone}

892:   et~al.}{1998}]{1998Natur.395A..14R}

893: {Richstone} D.,  {Ajhar} E.~A.,  {Bender} R.,  {Bower} G.,  {Dressler} A.,

894:   {Faber} S.~M.,  {Filippenko} A.~V.,  {Gebhardt} K.,  {Green} R.,  {Ho} L.~C.,

895:    {Kormendy} J.,  {Lauer} T.~R.,  {Magorrian} J.,    {Tremaine} S.,  1998,

896:   Nature, 395, A14

897:

898: \bibitem[\protect\citeauthoryear{Schwarz}{Schwarz}{1978}]{BIC}

899: Schwarz G.,  1978, The Annals of Statistics, 6, 461

900:

901: \bibitem[\protect\citeauthoryear{{Silk}}{{Silk}}{2005}]{2005MNRAS.364.1337S}

902: {Silk} J.,  2005, MNRAS, 364, 1337

903:

904: \bibitem[\protect\citeauthoryear{{Springel}, {White}, {Jenkins}, {Frenk},

905:   {Yoshida}, {Gao}, {Navarro}, {Thacker}, {Croton}, {Helly}, {Peacock}, {Cole},

906:   {Thomas}, {Couchman}, {Evrard}, {Colberg} \& {Pearce}}{{Springel}

907:   et~al.}{2005}]{2005Natur.435..629S}

908: {Springel} V.,  {White} S.~D.~M.,  {Jenkins} A.,  {Frenk} C.~S.,  {Yoshida} N.,

909:    {Gao} L.,  {Navarro} J.,  {Thacker} R.,  {Croton} D.,  {Helly} J.,

910:   {Peacock} J.~A.,  {Cole} S.,  {Thomas} P.,  {Couchman} H.,  {Evrard} A.,

911:   {Colberg} J.,    {Pearce} F.,  2005, Nature, 435, 629

912:

913: \bibitem[\protect\citeauthoryear{{Stasi{\'n}ska}, {Cid Fernandes}, {Mateus},

914:   {Sodr{\'e}} \& {Asari}}{{Stasi{\'n}ska} et~al.}{2006}]{2006MNRAS.371..972S}

915: {Stasi{\'n}ska} G.,  {Cid Fernandes} R.,  {Mateus} A.,  {Sodr{\'e}} L.,

916:   {Asari} N.~V.,  2006, MNRAS, 371, 972

917:

918: \bibitem[\protect\citeauthoryear{{Strateva} et~al.,}{{Strateva}

919:   et~al.}{2001}]{2001AJ....122.1861S}

920: {Strateva} I.,  et~al., 2001, AJ, 122, 1861

921:

922: \bibitem[\protect\citeauthoryear{{Tremonti}, {Heckman}, {Kauffmann},

923:   {Brinchmann}, {Charlot}, {White}, {Seibert}, {Peng}, {Schlegel}, {Uomoto},

924:   {Fukugita} \& {Brinkmann}}{{Tremonti} et~al.}{2004}]{2004ApJ...613..898T}

925: {Tremonti} C.~A.,  {Heckman} T.~M.,  {Kauffmann} G.,  {Brinchmann} J.,

926:   {Charlot} S.,  {White} S.~D.~M.,  {Seibert} M.,  {Peng} E.~W.,  {Schlegel}

927:   D.~J.,  {Uomoto} A.,  {Fukugita} M.,    {Brinkmann} J.,  2004, ApJ, 613, 898

928:

929: \bibitem[\protect\citeauthoryear{{Weinmann}, {van den Bosch}, {Yang} \&

930:   {Mo}}{{Weinmann} et~al.}{2006}]{2006MNRAS.366....2W}

931: {Weinmann} S.~M.,  {van den Bosch} F.~C.,  {Yang} X.,    {Mo} H.~J.,  2006,

932:   MNRAS, 366, 2

933:

934: \bibitem[\protect\citeauthoryear{{Weisberg}}{{Weisberg}}{2005}]{weisberg}

935: {Weisberg} S.,  2005, Applied Linear Regression, 3rd Ed..

936: Wiley/Interscience

937:

938: \bibitem[\protect\citeauthoryear{{Wolf}, {Gray}, {Arag{\'o}n-Salamanca}, {Lane}

939:   \& {Meisenheimer}}{{Wolf} et~al.}{2007}]{2007MNRAS.376L...1W}

940: {Wolf} C.,  {Gray} M.~E.,  {Arag{\'o}n-Salamanca} A.,  {Lane} K.~P.,

941:   {Meisenheimer} K.,  2007, MNRAS, 376, L1

942:

943: \bibitem[\protect\citeauthoryear{{Yu} \& {Jones}}{{Yu} \& {Jones}}{1998}]{lQR}

944: {Yu} K.,  {Jones} M.~C.,  1998, Journal of the American Statistical

945:   Association, 93, 228

946:

947: \bibitem[\protect\citeauthoryear{{Zehavi} et~al.,}{{Zehavi}

948:   et~al.}{2005}]{2005ApJ...630....1Z}

949: {Zehavi} I.,  et~al., 2005, ApJ, 630, 1

950:

951: \end{thebibliography}

952:

953: \appendix

954:

955: \section{Nonparametric mixture regression}

956: \label{sec:nmr}

957:

958: \begin{figure}

959: \includegraphics[angle=270,width=0.45\textwidth]{fig7.ps}

960: \caption{\label{fig:bic13}Offsets in the Bayesian Information Criterion

961: (BIC) score versus local galaxy density, $\rho_{1.3}$, for NMR fits

962: utilising 2, 3, and 5 components, relative to the favoured 4 component

963: fit.  Where the 5 component fit BIC offset is zero at low

964: $\rho_{1.3}$, the NMR method only uses 4 of the 5 available components

965: as two of the components are degenerate.  Four components are thus

966: preferred, by significantly higher BIC values, at all local densities.}

967: \end{figure}

968:

969: This is a newly developed statistical method for determining the

970: dependences of one variable, $y$, on another, $x$, where there may be

971: multiple components present in the data, each with a different $y$ on

972: $x$ dependence.  For the analysis presented in the main body of this

973: article we use this technique, putting $x=\rho_{1.3}$ or $\rho_{5.5}$,

974: estimates of the local environmental density, and $y=\W$, a

975: transformed version of the H$\alpha$ equivalent width (see

976: \refsec{sec:Halpha}).  Here we give a technical description of the

977: method.

978:

979: We model the probability, $f(y|x)$, of $y$ given $x$ as a sum of

980: components, thus

981: \begin{equation}

982: f(y|x;{\bmath{\Theta}}(x)) =

983: \sum_{i=1}^{c(x)}{\pi_i(x) s_i(y|{\bmath{\eta}}_i(x))}

984: \end{equation}

985: where the $s_j(y|\bmath{\eta}_j(x))$, are density functions with

986: a vector of parameters ${\bmath{\eta}}_j(x)$ that depends on $x$,

987: and the $\pi_j(x)$'s are a set of mixing proportions that sums to one

988: for each $x$. In this paper we use Gaussian functions to model the

989: components, each with parameters ${\bmath{\eta}}_i =

990: (\mu_i,\sigma_i$), mean and standard deviation respectively. The

991: number of components is $c(x)$, and may vary as a function of $x$.

992: Gaussians are rich and flexible functions which are highly suited to

993: this task, particularly if one wishes to avoid the danger of overly

994: designing the method to fit one's expectations of the results.

995:

996: The parameter set, ${\bmath{\Theta}}(x)\left({\bmath{\theta}}_1 (x),

997:   \ldots,{\bmath{\theta}}_{c(x)} (x) \right)=(\pi_1(x), {\bmath{\eta}}_1(x),

998: \ldots, \pi_{c(x)}(x), {\bmath{\eta}}_{c(x)}(x))$, is determined using local

999: likelihood estimation.  The parameters are approximated locally by a

1000: polynomial of degree $p$, and hence vary smoothly with $x$.  The

1001: variation of the parameters can thus be described by a set of

1002: polynomial coefficients, $\bmath{B}$.  These coefficients may

1003: then be constrained by data, weighted using a kernel of bandwidth

1004: $b(x)$ about $x$.

1005:

1006: The log-likelihood function of the set of polynomial coefficients $\bmath{B}$

1007: given the data is therefore

1008: \begin{eqnarray}

1009: {\cal L}_p(\bmath{B};x,b,c(x)) &=& \sum_{m=1}^{n} w_m(x;b) \times \\

1010: & &

1011: \log_e f(Y_m,x;\bmath{T}(X_m - x,\bmath{B})), \nonumber

1012: \end{eqnarray}

1013: for $n$ measurements labelled by $m$, with locations

1014: $(x,y)=(X_m,Y_m)$.  The set of polynomial functions approximating the

1015: parameters $\bmath{\Theta}$ at $x$ are

1016: \begin{eqnarray}

1017: \lefteqn{\bmath{T}(\delta_m,\bmath{B}) =

1018: \big(t_{1,1}\big(\delta_m, \bmath{\beta}_{1,1}\big), \ldots,

1019: t_{1,1}\big(\delta_m, \bmath{\beta}_{1,q_1}\big), \ldots,}

1020: \nonumber\\

1021: & &

1022: t_{c(x),1}\big(\delta_m, \bmath{\beta}_{c(x),1}\big), \ldots,

1023: t_{c(x),1}\big(\delta_m, \bmath{\beta}_{c(x),q_{c(x)}}\big)\big),

1024: \end{eqnarray}

1025: defining $\delta_m = X_m - x$, with

1026: \begin{equation}

1027: t_{i,j}(\delta_m, \bmath{\beta}_{i,j}) =

1028: \sum_{k=0}^{p}{\beta_{i,j,k} (\delta_m)^k / k!},

1029: \end{equation}

1030: where $i = 1,\ldots,c(x)$ counts over the components, $j =

1031: 1,\ldots,q_i$ counts the parameters of component $i$ (in our case each

1032: density function is a Gaussian with parameters $\mu$ and $\sigma$, and

1033: with mixing weight $\pi$, thus $q_i=3$), and $k = 0,\ldots,p$

1034: counts the degrees of the polynomials used in $\bmath{T}$ to

1035: approximate the parameters $\bmath{\Theta}$.  The

1036: $\beta_{i,j,k}$, and hence their containing sets,

1037: $\bmath{\beta}_{i,j}$ and $\bmath{B}$, correspond to a

1038: particular value of $x$.  Note that the $\beta_{i,j,k}$ give

1039: approximations around $\delta_m = 0$ for the value and $k$-th

1040: derivative of the parameter $j$ of component $i$.  The contribution to

1041: ${\cal L}_p$ of data at distance $\delta_m$ from $x$ is specified by

1042: \begin{equation}

1043: w_m(x; b(x)) = W\left(\frac{X_m - x}{b(x)}\right),

1044: \end{equation}

1045: where $W(z)$ is a weighting function.

1046:

1047: One can then attempt to determine the $\bmath{B}$ which maximises

1048: the local log-likelihood, ${\cal L}_p$, which we denote

1049: $\widehat{\bmath{B}}(x; b(x), c(x))$, explicitly indicating its dependencies.

1050: Therefore,

1051: \begin{eqnarray}

1052: \label{eqn:betahat}

1053: \lefteqn{\widehat{\bmath{B}}(x; b(x), c(x)) =

1054: \begin{array}{c}\\\mathrm{argmax}\\^{\bmath{B}}\end{array} \sum_{j=1}^{n} w_j(x; b(x))\;\times}\\

1055: &&\log_e \sum_{i=1}^{c(x)} s_i\big(Y_j|t_{i,1}(X_j - x, {\bmath{\beta}}_{i,1}),

1056: \ldots, t_{i,q_i}(X_j - x, {\bmath{\beta}}_{i,q_i})\big).\nonumber

1057: \end{eqnarray}

1058: The local likelihood estimate for the set of parameters is then defined by

1059: $\widehat{\bmath{\Theta}}(x; b(x), c(x)) =

1060: \bmath{T}(0, \widehat{\bmath{B}}(x; b(x), c(x)))$, that is

1061: $\widehat{\theta}_{i,j} (x; b(x), c(x)) =

1062: \widehat{\beta}_{i,j,0} (x; b(x), c(x))$.

1063: Our conditional density estimate given $b(x)$ and

1064: $k(x)$ is therefore

1065: \begin{equation}

1066: \label{eqn:fhat}

1067: {\widehat f}(y|x;b(x), c(x)) \equiv

1068: f(y|x;{\widehat{\bmath{\Theta}}}(x; b(x),c(x))).

1069: \label{fhatHK}

1070: \end{equation}

1071: In general, given $b(x)$ and $c(x)$, the standard method of solving

1072: Eqn.~\ref{eqn:betahat} is to use the Expectation-Maximisation (EM)

1073: method \citep{EM}.

1074:

1075: The estimator Eqn.~\ref{eqn:fhat} is dependent upon the chosen

1076: bandwidth $b(x)$ and number of components $c(x)$.  If they are \emph{a

1077:   priori} unknown, we must therefore select them in some reliable way.

1078:

1079: In this work we have chosen the bandwidth for $x=\rho_{1.3}$

1080: or $\rho_{5.5}$ to be a function of the $K$th nearest neighbour.  We

1081: use $K=5000$, selected as a compromise between the smoothness of the

1082: resulting component regression lines and their ability to trace any

1083: variation in $\W$ versus environment.  We have checked that the exact

1084: choice of $K$ (within the range 1000--7500) does not affect our

1085: results.  The optimum number of components was determined using the

1086: Bayesian Information Criterion\citep[BIC;][]{BIC}:

1087: \begin{equation}

1088: \label{eqn:BIC}

1089: \rmn{BIC} =  {\cal L}_p - \frac{1}{2}(3c-1)\log_e(K)

1090: \end{equation}

1091: where ${\cal L}_p$ is the maximised log-likelihood, $c$ is the number

1092: of components, and $K$ is the sample size.  With this definition,

1093: otherwise known as the Schwarz Criterion, the preferred model is that

1094: which maximises the value of BIC.  Note that other definitions

1095: sometimes multiply the right hand side of Eqn. \ref{eqn:BIC} by $-2$.

1096: The difference between the BIC values of two models, $\Delta$BIC,

1097: approximates the natural logarithm of the Bayes factor, a summary of

1098: the evidence for one model over another.  A $\Delta$BIC of 7 indicates

1099: that the preferred model is truly better than the alternative model

1100: with odds better than a thousand to one.  A Bayes factor of $> 150$,

1101: i.e. $\Delta \rmn{BIC} > 5$, is generally taken to be very strong

1102: evidence for the preferred model \citep{KR95}.  Four components are

1103: thus very strongly favoured, by $\Delta$BIC = 147.1, 11.9 and 7.7

1104: versus 2, 3 and 5 components, respectively, averaged over

1105: $\log_{10}\rho_{1.3}$.  The $\Delta$BIC are shown versus $\rho_{1.3}$

1106: in \reffig{fig:bic13}.

1107:

1108: One might argue that choosing different density functions, other than

1109: Gaussians, or applying a different transformation, would result in our

1110: finding a different optimal number of components.  However, when

1111: varying the $\W = \log_{10}(\EW + \lambda)$ transformation by changing

1112: $\lambda$, and trying various combinations of Gaussians and lognormal

1113: functions in $\EW$-space, the optimum number of components has

1114: consistently turned out to be four.  A careful visual inspection of

1115: the $\EW$ and $\W$ distributions also supports this conclusion.

1116:

1117: Obviously one could examine the data and devise component density

1118: functions that would result in the NMR method finding any desired

1119: number of components.  However, this defeats the object of employing

1120: the NMR technique.  By `components' we mean simple, distinct elements

1121: of the overall population.  We must therefore make only simple

1122: assumptions and transformations in order to identify them, with

1123: minimal prior reference to the data.

1124:

1125: If two or more NMR components together represent only a single true

1126: component of the galaxy populations, then we would expect them to

1127: behave identically.  Otherwise, they could not represent a single

1128: component, by definition.  However, our four NMR components each

1129: demonstrate different behaviour with respect to local density,

1130: indicating they are truly distinct (see Figs. 1--3, B1--B3).

1131:

1132: Finally, the components we find using the NMR technique correspond

1133: remarkably well to traditional galaxy classifications (compare

1134: Figs. \ref{fig:wrho13} \& \ref{fig:wrho13bpt}).  This strongly

1135: supports our interpretation of the NMR components as physically

1136: distinct elements of the galaxy population.  However, the NMR

1137: components have the advantage of being based on all galaxies in our

1138: sample.  Traditional diagnostic diagrams can only be used for objects

1139: with multiple, significantly-detected, emission lines, and in many

1140: cases give ambiguous classifications (e.g. \emph{SF+AGN}).

1141:

1142: \section{Larger scale environment}

1143: \label{sec:55Mpc}

1144:

1145: The main text of the paper focuses on a measure of environment using a

1146: kernel of bandwidth $1.3$~Mpc, chosen by cross-validation.  This

1147: bandwidth performs well at the scales of galaxy clusters.  However, at

1148: low densities there is frequently only one galaxy within the kernel,

1149: and the estimator is unable to differentiate between different

1150: low-density environments.  We thus additionally perform our analysis

1151: using local densities estimated using a kernel with $5.5$~Mpc

1152: bandwidth.  The results are very similar to those from the $1.3$~Mpc

1153: densities, and thus our conclusions are robust to the precise

1154: definition of local density.  The figures corresponding to the

1155: $5.5$~Mpc kernel are given in this appendix.

1156:

1157: \begin{figure*}

1158: \includegraphics[clip=True,trim = 3cm 22.3cm 9.5cm 3.3cm,height=0.30\textheight]{figS1.ps}

1159: \caption{\label{fig:w55}As \reffig{fig:w13}, but for local galaxy densities

1160:   estimated using a $5.5$~Mpc bandwidth kernel, $\rho_{5.5}$.  The

1161:   results are very similar to those found using $\rho_{1.3}$.}

1162: \end{figure*}

1163:

1164: \begin{figure*}

1165: \includegraphics[clip=True,trim = 3cm 22.3cm 9.5cm 3.3cm,height=0.30\textheight]{figS2.ps}

1166: \caption{\label{fig:ew55}As \reffig{fig:w55}, but shown in terms of

1167:   the untransformed equivalent width, $\EW$.  The inset shows the same

1168:   plot with axis-ranges chosen to better show the behaviour at small

1169:   $\EW$.}

1170: \end{figure*}

1171:

1172: \begin{figure*}

1173: \includegraphics[clip=True,trim = 3cm 22cm 5cm 3.3cm,height=0.21\textheight]{figS3.ps}

1174: \caption{\label{fig:wrho55}As \reffig{fig:wrho13}, but for local

1175:   galaxy densities estimated using a $5.5$~Mpc bandwidth kernel,

1176:   $\rho_{5.5}$.  The results are very similar to those found using

1177:   $\rho_{1.3}$.}

1178: \end{figure*}

1179:

1180: \label{lastpage}

1181:

1182: \end{document}

1183: