0809.2800/ms.tex
1: % Created from:
2: % mn2esample.tex
3: % v2.1 released 22nd May 2002 (G. Hutton)
4: 
5: \documentclass[usegraphicx,usenatbib,usedcolumn,useAMS]{mn2e}
6: 
7: %%%%% AUTHORS - PLACE YOUR OWN MACROS HERE %%%%%
8: 
9: \newcommand\W{W_{\rm{H}\alpha}}
10: \newcommand\EW{\rm{EW}(\rm{H}\alpha)}
11: 
12: %%%-----------------------------------------------------------------------------
13: 
14: %%% General maths and units commands
15: \newcommand{\unisim}{\sim\!}
16: %%% References
17: \newcommand{\refsec}[1]{Section \ref{#1}}
18: \newcommand{\reffig}[1]{Fig.~\ref{#1}}
19: \newcommand{\reftab}[1]{Table \ref{#1}}
20: 
21: %%%-----------------------------------------------------------------------------
22: % make sure full page visible if processed as A4 or letter
23: \voffset-1.25cm
24: %%%-----------------------------------------------------------------------------
25: 
26: \title[Components of the galaxy population]%
27: {Revealing components of the galaxy population
28:   through nonparametric techniques}
29: \author[S. P. Bamford et al.]{%
30: Steven P. Bamford$^{1,2}$\thanks{E-mail: steven.bamford@nottingham.ac.uk},
31: Alex L. Rojas$^{3,4}$, Robert C. Nichol$^{1}$, Christopher J. Miller$^{5}$,\newauthor
32: Larry Wasserman$^{3}$, Christopher R. Genovese$^{3}$, Peter
33: E. Freeman$^{3}$
34: \vspace{6pt}\\
35: $^{1}$Institute of Cosmology and Gravitation, University of
36: Portsmouth, Mercantile House, Hampshire Terrace, Portsmouth, PO1 2EG, UK\\
37: $^{2}$Centre for Astronomy \& Particle Theory, School of Physics \& Astronomy,
38: University of Nottingham, Nottingham, NG7 2RD, UK\\
39: $^{3}$Department of Statistics, Baker Hall, Carnegie Mellon
40: University, Pittsburgh, PA 15213, USA\\
41: $^{4}$Carnegie Mellon University in Qatar, c/o Qatar Foundation,
42: P.O. Box 24866, Doha, Qatar\\
43: $^{5}$Observatorio Cerro Tololo, Observatorio de AURA en Chile,
44: Casilla 603, La Serena, Chile
45: }
46: 
47: \begin{document}
48:   
49: \date{Accepted ???. Received ???; in original form ???}
50: 
51: \pagerange{\pageref{firstpage}--\pageref{lastpage}} \pubyear{2008}
52: 
53: \maketitle
54: 
55: \label{firstpage}
56: 
57: \begin{abstract}
58:   The distributions of galaxy properties vary with environment, and
59:   are often multimodal, suggesting that the galaxy population may be a
60:   combination of multiple components.  The behaviour of these
61:   components versus environment holds details about the processes of
62:   galaxy development.  To release this information we apply a novel,
63:   nonparametric statistical technique, identifying four components
64:   present in the distribution of galaxy H$\alpha$ emission-line
65:   equivalent-widths. We interpret these components as passive,
66:   star-forming, and two varieties of active galactic nuclei.
67:   Independent of this interpretation, the properties of each component
68:   are remarkably constant as a function of environment.  Only their
69:   relative proportions display substantial variation.  The galaxy
70:   population thus appears to comprise distinct components which are
71:   individually independent of environment, with galaxies rapidly
72:   transitioning between components as they move into denser
73:   environments.
74: \end{abstract}
75: 
76: \begin{keywords}
77: methods: statistical -- galaxies: statistics -- galaxies: fundamental
78: parameters -- galaxies: clusters: general
79: \end{keywords}
80: 
81: \section{Components of the galaxy population}
82: It has long been recognised that galaxies may be divided into at
83: least two distinct sub-populations. Originally this division was based
84: on visual appearance.  Most galaxies can be morphologically
85: classified as either elliptical or spiral.  Finer classification is
86: possible, discretizing an apparently continuous variation in galaxy
87: appearance.  However the dichotomy between elliptical and spiral
88: morphology is more pronounced than the variations within each
89: class.  Subsequently, it has been discovered that several other, more
90: quantitative, galaxy properties are distributed unevenly or in a
91: multi-modal manner.
92: 
93: The colour distribution of SDSS galaxies is strongly bimodal
94: \citep{2001AJ....122.1861S}. Galaxies in the ``red'' and ``blue'' modes
95: can be roughly identified as those with elliptical and spiral
96: morphology, respectively
97: \citep{2002AJ....124..646H,2006MNRAS.368..414D}.  Whereas morphology
98: reflects the dynamical state of galaxies, colour is related to their
99: star-formation history, particularly over the last $\la 10^9$
100: years.  The colour bimodality thus implies a division of the galaxy
101: population into blue galaxies, which have recently formed stars, and
102: red galaxies, which have not.  Such a bimodality in the star-formation
103: properties of galaxies has also been observed using more direct
104: measures of current star-formation, such as emission-line strength
105: \citep{2004MNRAS.348.1355B}.
106: 
107: The position of the red and blue galaxy sequences in the
108: colour--luminosity or colour--stellar mass planes display only a weak
109: dependence on environment.  However, the relative proportions of
110: galaxies in the two sequences vary strongly. In regions with a higher
111: local galaxy density the fraction of galaxies on the red sequence is
112: higher
113: \citep{2004ApJ...615L.101B,2004AIPC..743..106B,2006MNRAS.373..469B}.
114:   
115: It remains a matter of debate whether colour is more closely related to
116: environment than morphology.  Some claim that trends in morphology
117: versus environment can be mostly explained via a morphology--colour
118: relation which is almost independent of environment
119: \citep{2006MNRAS.366....2W,2006astro.ph..8353B,2006astro.ph.10171B,2007MNRAS.376L...1W}.
120: However, other studies oppose this view \citep{2007ApJ...658..898P},
121: and it has been clearly shown that the colour and morphology
122: bimodalities behave differently with respect to environment and
123: stellar mass \citep{2008arXiv0805.2612B}.
124: 
125: There are growing indications that, in a fraction of the galaxy
126: population, star-formation must be terminated rapidly
127: \citep{2004MNRAS.348.1355B,2006MNRAS.373..469B}.  The emission lines
128: in galaxy spectra provide a way of measuring the level of current star
129: formation on a timescale of $\la 10^7$ years.  They therefore trace
130: rapid star formation variations more sensitively than colour.  Another
131: important property of emission lines is that they are produced by
132: active galactic nuclei (AGN), in addition to star formation.  AGN are
133: present in many galaxies, and are thought to be produced by accretion
134: of material onto the super-massive black holes which appear to reside
135: at the centre of most, if not all, galaxies
136: \citep{1998Natur.395A..14R}.  Recently a variety of studies have
137: suggested that AGN strongly influence star-formation in their host
138: galaxies, and thus play an important role in defining the galaxy
139: population
140: \citep{2003MNRAS.346.1055K,2005MNRAS.364.1337S,2006MNRAS.365...11C,2006MNRAS.370..645B}.
141: The potential presence of an AGN contribution complicates the
142: traditional usage of emission-lines as an indicator of star
143: formation rate (SFR).  However, it also presents an opportunity to
144: study these two interdependent processes, star-formation and AGN,
145: through the distribution of a single quantity.
146: 
147: Galaxies with contrasting properties are found to be distributed
148: differently in space.  Elliptical galaxies cluster together more
149: strongly than spirals \citep{2000ApJ...545....6B,2001ApJ...554..857G}.
150: Similarly, red galaxies are preferentially found in denser
151: environments than blue galaxies \citep{2005ApJ...630....1Z}.  We have a
152: well developed theory for how structure forms in the cosmos, at least
153: in terms of the underlying cold dark matter which dominates the mass
154: density \citep{2005Natur.435..629S}.  Baryonic matter is expected to be
155: similarly distributed, in broad terms.  This theory thus explains the
156: range of galaxy environments observed.  However, the properties of
157: galaxies as a function of environment is a much more complicated
158: issue, depending on the detailed physics of galaxy formation and
159: evolution.  By studying trends in the galaxy population with
160: environment we can learn about these physical processes.
161: 
162: There has been a logical progression in studies of galaxy properties
163: as a function of environment.  Early work was based on dividing
164: galaxies into simple classes and looking at variations in the
165: fractions of galaxies of each class in bins of environment
166: \citep{1985ApJ...288..481D}.  As galaxy samples grew, this moved on to
167: examining trends in the mean properties of galaxies as a smooth
168: function of local galaxy density
169: \citep{2002MNRAS.334..673L,2003ApJ...584..210G}. A significant
170: development was fitting to the data functions that describe the
171: distribution of galaxies in two classes
172: \citep{2004ApJ...615L.101B,2004AIPC..743..106B,2006MNRAS.373..469B}.
173: Most of the approaches employed so far have relied upon enforcing a
174: predefined view of how to divide or classify the galaxy population in
175: increasingly complex ways.  However, our understanding of the physical
176: processes at work is highly uncertain and does not provide a
177: sufficient basis to make this decision.  Our only guide is the data
178: itself.  A natural next step is thus to turn to nonparametric methods,
179: where the components of the population are deduced consistently from
180: the data itself.
181: 
182: Recently, several studies have performed multivariate statistical
183: analyses on datasets containing a wide variety of galaxy properties,
184: in order to identify components of the galaxy population, and
185: determine which properties are most important for identifying to which
186: component a galaxy belongs
187: \citep{2005MNRAS.363.1257E,2006MNRAS.373.1389C}.  Such studies are
188: highly informative, but become complicated when one wishes to
189: determine the behaviour of the identified components versus another
190: variable.  In this paper we are primarily concerned with variation in
191: the components of the galaxy population as a function of environment.
192: The statistical method we present below may be straightforwardly
193: applied to multivariate datasets.  However, for simplicity, in the
194: present work we consider the environmental dependence of just one
195: galaxy property.  Nevertheless, even with this elementary approach, we
196: are able to learn much about the galaxy population.
197: 
198: \begin{figure*}
199: \includegraphics[clip=True,trim = 3cm 22.3cm 9.5cm 3.3cm,width=1.0\textwidth]{fig1.ps}
200: \caption{\label{fig:w13}
201:   The distribution of transformed H$\alpha$ equivalent width
202:   ($\W$) for (left) low and (right) high density environments.  The
203:   histogram displays the data, with Poisson uncertainties indicated by
204:   the grey shading.  The red, purple, green and blue lines show the
205:   components derived by applying the NMR technique.  The brown line
206:   gives the sum of these components, which is clearly a good
207:   representation of the data.}
208: \end{figure*}
209: 
210: \section{Conditional density estimation}
211: A common problem in astronomy, and statistical sciences in general, is
212: that one wishes to understand how the behaviour of one variable depends
213: upon another.  This is relatively straightforward in the case where
214: there is a single relationship between the variables, albeit with
215: some, possibly variable, scatter or width to the distribution.  Much
216: statistical and astronomical literature has been devoted to the
217: development of such regression methods \citep{weisberg}.  However, in
218: the case where multiple components may be present in the overall
219: distribution, each with a different functional dependence on the
220: variables, the situation becomes substantially more difficult.  One
221: can still attempt to apply single-component statistical tools, for
222: example nonparametric quantile regression \citep{QR,lQR}, on the whole
223: distribution, but the understanding one gains from such an exercise is
224: limited and sometimes misleading.  Alternatively one may individually
225: analyse subsamples selected by defining regions in the parameter
226: space, or preferably using additional information\citep{MMR}.  This
227: approach, however, is unsuitable when the multiple components
228: significantly overlap, or when it is unclear how many components are
229: present.
230: 
231: Most regression techniques focus on estimating the conditional mean,
232: the average value of one variable as a function of another variable;
233: for example, a line through a set of scattered points.  However, one
234: may get a better understanding of the relationship between a response
235: variable and a set of covariates by considering the estimation of the
236: conditional density as a whole; the \emph{distribution} of one
237: variable as a function of another.  (Note that \emph{density} here
238: refers to probability density as a function of the parameter set, not
239: a measure of environmental local galaxy density as elsewhere in this
240: paper.)  We use a new conditional density estimator based on finite
241: mixture models and local likelihood estimation, which describes the
242: underlying relationship between two variables by a set of
243: parameterised functions. This feature gives the proposed procedure the
244: advantage of being easily interpretable. This method is called
245: nonparametric mixture regression (NMR), and is described in detail in
246: Appendix \ref{sec:nmr}.
247: 
248: The NMR technique has the potential to aid the understanding of many
249: datasets, across all fields of science.  In the present work, it
250: allows us to determine the environmental dependence for individual
251: components of the galaxy population, with minimal prior assumptions on
252: the number and properties of these components.
253:   
254: \section{\boldmath Galaxy H$\alpha$ equivalent widths}
255: \label{sec:Halpha}
256: The strongest emission line in a galaxy optical spectrum is H$\alpha$.
257: The luminosity of H$\alpha$ is approximately proportional to the rate
258: of ongoing star-formation \citep{2006ApJ...642..775M}, when
259: uncontaminated by additional emission, such as from an AGN.  A
260: commonly employed quantity is the equivalent width (EW) of a spectral
261: line, the line flux normalised by the continuum flux at the same
262: wavelength.  The EW measurement has the advantages of being
263: approximately independent of uncertainties in the spectral flux
264: calibration and any extinction present in both the observed galaxy and
265: our own.  The H$\alpha$ line is in the red region of the spectrum,
266: where the continuum is dominated by the light from old stars.  The
267: H$\alpha$ continuum flux is therefore roughly proportional to stellar
268: mass, and hence $\EW$ is approximately proportional to the SFR
269: per unit stellar mass.
270: 
271: \begin{figure*}
272: \includegraphics[clip=True,trim = 3cm 22.3cm 9.5cm 3.3cm,width=1.0\textwidth]{fig2.ps}
273: \caption{\label{fig:ew13}As \reffig{fig:w13}, but shown here in terms of the
274:   untransformed equivalent width, $\EW$.  The inset shows the same
275:   plot with axis-ranges chosen to better show the behaviour at small $\EW$.}
276: \end{figure*}
277: 
278: The overall distribution of galaxy H$\alpha$ luminosity, equivalent
279: width, and hence absolute and normalised SFR, are
280: found to move to lower levels with increasing environmental density
281: \citep{2002MNRAS.334..673L,2003ApJ...584..210G}.  This generally agrees
282: with the colour and morphology trends described above, and the
283: variation of H$\alpha$ emission with morphological type
284: \citep{2004AJ....127.2511N}.  However, if the galaxy population is
285: separated into galaxies which are star-forming and those which are
286: not, the distribution of $\EW$ for each component
287: does not depend significantly on environment.  Only the relative
288: proportion of star-forming galaxies changes strongly
289: \citep{2004MNRAS.348.1355B} with environment.  This finding, of
290: distinguishable components in the galaxy population with properties
291: independent of environment but proportions which vary strongly,
292: mirrors the behaviour found in the colour distribution.  It also
293: motivates us to perform a more rigorous evaluation of the components
294: present in the galaxy population in this work.
295: 
296: As mentioned earlier, an important feature of emission lines is that,
297: in addition to star formation, they are also produced by AGN.
298: Galaxies whose emission lines are dominated by star-formation or AGN
299: activity can be separated using various diagnostic diagrams.  The most
300: common of these plots the emission line ratios
301: ${\rm{[OIII]}\lambda5007}/{\rm{H}\beta}$ versus
302: ${\rm{[NII]}\lambda6583}/{\rm{H}\alpha}$, and is known as the BPT
303: diagram \citep{1981PASP...93....5B}. The usual approach is to use these
304: diagrams to reject objects inappropriate to the particular study.
305: Thus a study of galaxy star formation properties would exclude all
306: galaxies with signs of AGN contamination.  However, classifying a
307: galaxy using the BPT diagram requires multiple emission lines to be
308: detected, resulting in a fraction of objects which cannot be
309: classified.  In addition, the separation between galaxies dominated by
310: star-formation and AGN is not clear, and there appears to be a large
311: population of galaxies which host both star-formation and an AGN.
312: Roughly 20\% of all galaxies are unambiguously AGN-dominated, while it
313: is estimated that a further 20\% are star-forming galaxies with a
314: significant AGN contribution \citep{2003ApJ...597..142M}. This
315: ambiguity means a variety of SFR--AGN demarcations exist
316: \citep{2001ApJ...556..121K,2003MNRAS.346.1055K,2006MNRAS.371..972S}.
317: Star formation studies based on emission lines have therefore rejected
318: widely varying fractions of galaxies from their samples.  This
319: fraction is usually low, so significant numbers of AGN-contaminated
320: galaxies remain.  More importantly, if our aim is to gain knowledge of
321: star-formation properties across the whole galaxy population, then we
322: may be rejecting an important fraction of the population.  If there
323: are any intrinsic correlations between AGN and star-formation, as has
324: been suggested by other studies \citep{2003MNRAS.346.1055K}, then
325: information about these will be lost.
326: 
327: A number of classes of AGN have been identified.  A primary
328: distinction is between Type 1 and Type 2 AGN.  In Type 1 objects our
329: viewing angle is such that we see the region immediately around the
330: central black hole directly, and thus the galaxy's light is dominated
331: by the AGN emission.  In this case the properties of the host galaxy
332: are generally very difficult to determine.  In Type 2 AGN, the central
333: region is obscured by a dusty torus surrounding it.  The observed AGN
334: emission is therefore due to material further removed from the central
335: ionising source, and mostly confined to emission lines.  Most
336: photometric and structural galaxy properties may therefore be reliably
337: measured, despite the presence of a Type 2 AGN.  In this work we
338: exclude all Type 1 AGN, identified by the large widths of their
339: emission lines, and consider only the more common Type 2 objects.  A
340: further subdivision within Type 2 AGN is between LINER and Seyfert 2
341: objects.  These are similar, and may simply be two parts of a
342: continuum of objects, with Seyfert 2 AGN being more powerful and
343: highly ionised.  However, there are signs that LINERs and Seyfert 2
344: AGN are truly physically distinct classes \citep{2006MNRAS.372..961K}.
345: 
346: In this work we examine the components in the distribution of galaxy
347: $\EW$, interpretable as a proxy for star formation rate and
348: nuclear activity per unit stellar mass.  It is possible to estimate
349: the true star-formation rate and stellar mass, for galaxies which do
350: not host an AGN, using a combination of several spectral features.
351: However, such estimates are sensitive to the details of the assumed
352: model.  There is therefore a concern that any finding concerning the
353: components of the resulting distribution may be attributable to the
354: model.  The $\EW$, on the other hand, is a single, robust,
355: model-independent measurement.
356: 
357: The data we use in our study is from Data Release 4 of the SDSS
358: \citep{2006ApJS..162...38A}. The emission line fluxes, continua and
359: resulting EW used in this study are those provided for DR4 by the
360: MPA-Garching group \citep{2004ApJ...613..898T}\footnote{available from
361:   http://www.mpa-garching.mpg.de/SDSS/DR4}.  All quantities used in
362: this paper were obtained from the CMU-PITT SDSS DR4 Value Added
363: Catalog\footnote{available from
364:   http://nvogre.phyast.pitt.edu/dr4\_value\_added} (VAC).  The SQL code for
365: the selection of each of our samples is given in \reftab{tab:sql}.
366: We construct a volume-limited sample by selecting galaxies with $0.05
367: < z < 0.095$ and $M_r < -20.4$.  In this work we thus focus on the
368: behaviour of fairly bright galaxies.  The lower redshift limit ensures
369: the spectra are based on a reasonable fraction of the galaxies' light;
370: at $z=0.05$ the $3$~arcsec diameter of each spectroscopic fibre
371: corresponds to $3$~kpc.  Throughout we convert to physical scales
372: assuming a flat Friedman-Robertson-Walker cosmology with $\Omega_m =
373: 0.3$, $\Omega_{\lambda} = 0.7$ and $H_0 = 70$ km~s$^{-1}$~Mpc$^{-1}$.
374: 
375: \begin{table}
376:   \caption{\label{tab:sql}Definitions of the galaxy samples used in this
377:     study, given as `where' clauses of the SQL queries of the CMU-PITT SDSS
378:     DR4 VAC}
379: \begin{tabular}{p{0.06\textwidth}p{0.31\textwidth}c}
380: \hline
381: \centering sample & \centering SQL selection & $n$ \\
382: \hline \hline
383: density defining sample &
384: \texttt{!z between 0.02 and 0.10 and absolute\_Petro\_r <= -20.4 and Sort~=~0}
385: &
386: 117873\\
387: \hline
388: $\rho_{1.3}$ \mbox{sample} &
389: \texttt{!z between 0.05 and 0.095 and absolute\_Petro\_r <= -20.4 and
390:  2.4 < Dist\_right\_edge and 2.4 < Dist\_left\_edge and 
391:  2.4 < Dist\_upper\_edge and 2.4 < Dist\_lower\_edge and
392:  H\_ALPHA\_FLUX > -99 and H\_ALPHA\_CONT > 0.0001 and
393:  H\_ALPHA\_FLUX/H\_ALPHA\_CONT > -0.4 and
394:  absolute\_Petro\_u > -990 and absolute\_Petro\_r > -990
395:  and Sort~=~0}
396: &
397: 76420\\
398: \hline
399: $\rho_{5.5}$ \mbox{sample} &
400: \texttt{!z between 0.05 and 0.095 and absolute\_Petro\_r <= -20.4 and
401:  11 < Dist\_right\_edge and 11 < Dist\_left\_edge and 
402:  11 < Dist\_upper\_edge and 11 < Dist\_lower\_edge and
403:  H\_ALPHA\_FLUX > -99 and H\_ALPHA\_CONT > 0.0001 and
404:  H\_ALPHA\_FLUX/H\_ALPHA\_CONT > -0.4 and
405:  absolute\_Petro\_u > -990 and absolute\_Petro\_r > -990
406:  and Sort~=~0}
407: &
408: 46998\\
409: \hline
410: \end{tabular}
411: \end{table}
412: 
413: \section{\boldmath Measuring galaxy environment}
414: Galaxy environment can be characterised in many ways, but a commonly
415: adopted value is the local number density of galaxies brighter than a
416: given luminosity, averaged over some volume or kernel.  We estimate
417: the local galaxy number density, $\rho_b$, within a fixed-scale,
418: spherical kernel with a Gaussian radial profile and bandwidth $b$.
419: Our local galaxy densities are thus simple to interpret physically.
420: 
421: To select the bandwidth, or scale, $b$ of the kernel, we apply
422: leave-one-out cross-validation; that is, we select the value of $b$
423: which minimizes the estimated integrated mean squared error, $CV(b)$. This
424: error is obtained by estimating the density function $n$ times, each
425: time leaving out one galaxy from the estimation:
426: \begin{equation}
427: CV(b) = \int \widehat f_{n,b}^{\;2}(\bmath{x}) d\bmath{x} -
428: \frac{2}{n}\sum_{i=1}^n \widehat f_{(-i),b}(\bmath{X_i})
429: \end{equation}
430: where $\{\bmath{X_i}\}$ is the set of galaxy positions, and $\widehat
431: f_{n,b}$ and $\widehat f_{(-i),b}$ are the kernel density estimators
432: with bandwidth $b$, using all $n$ galaxies and after removing the
433: $i^{\rmn{th}}$ galaxy, respectively.  We compute $CV(b)$ for a range
434: of different bandwidth values to find that which minimizes the error.
435: Applying this cross-validation method we determine an optimum
436: bandwidth value of $1.3$~Mpc.  A similar optimum bandwidth for local
437: galaxy density estimation was found using cross-validation by
438: \citet{2004MNRAS.348.1355B}.
439: 
440: Interestingly, this scale corresponds to the size of
441: galaxy clusters, and is thus highly appropriate for characterising
442: density from a physical, as well as a statistical, point of view.
443: However, while cross-validation provides the statistically optimum
444: bandwidth for the whole sample, any choice of bandwidth has its
445: limitations.  This density estimator loses resolution at low
446: densities, where there are no neighbouring galaxies within the kernel
447: bandwidth, and is thus unable to discriminate between densities lower
448: than $\rho_{1.3} \sim 0.03$~Mpc$^{-3}$, comprising 17\% of the sample.
449: In order to probe environments less dense than this, but necessarily
450: on larger physical scales, we additionally perform the analysis with
451: local densities measured using a larger bandwidth of $5.5$~Mpc.
452: Almost all galaxies have a neighbour within this radius.  One could
453: also consider estimating densities with a kernel bandwidth
454: significantly smaller than $1.3$~Mpc. However, such an estimator would
455: lose resolution below even moderate densities, where galaxies are
456: typically separated by more than the bandwidth.  It would also be less
457: able to discriminate between high density environments, because the
458: densities are estimated using galaxy positions uncorrected for
459: redshift-space distortions, and hence an increase in true-space
460: density no longer results in a higher redshift-space density within
461: the kernel.  We mostly show results based on the
462: statistically-motivated $1.3$~Mpc bandwidth in the main body of this
463: article, but provide figures using the $5.5$~Mpc bandwidth in Appendix
464: \ref{sec:55Mpc}, to demonstrate that we find similar results on larger
465: scales and to lower densities.
466: 
467: We avoid biased density estimates for galaxies at the edges of our
468: sample volume by determining the densities using a larger volume
469: sample of galaxies with $0.02 < z < 0.10$ and $M_r < -20.4$.
470: We then limit the analysis sample to galaxies with $0.05 < z < 0.095$
471: and further than approximately twice the bandwidth from a survey
472: boundary.  We reject a further 3\% of galaxies with unreliable $\EW$
473: or $(u-r)$ rest-frame colour measurements.  The exact selections, and
474: corresponding sample sizes, are given in \reftab{tab:sql}.
475: 
476: \begin{figure*}
477: \includegraphics[clip=True,trim = 3cm 22cm 5cm 3.3cm,width=1.0\textwidth]{fig3.ps}
478: \caption{\label{fig:wrho13}The behaviour of the NMR components versus
479:   environment.  The left panel plots the data as dots, along with the
480:   location of each component, indicated by thick, solid lines, and additionally
481:   their widths via the coloured shading and dashed lines.  These widths
482:   are shown explicitly in the middle panel.  The right panel displays
483:   the variation in the proportion of each component.  While the
484:   location and width of the components do not change significantly
485:   with environment, the proportions vary strongly.}
486: \end{figure*}
487: 
488: \section{Applying the NMR technique}
489: 
490: \begin{figure*}
491: \includegraphics[clip=True,trim = 5cm 7.5cm 10cm 2cm,width=1.0\textwidth]{fig4.ps}
492: \caption{\label{fig:3Dwrho55}A three-dimensional view of the NMR estimate of the
493:   $\W$--$\rho_{5.5}$ distribution, shown by the grey, transparent
494:   surface, and its constituent components, colour-coded is in the
495:   previous figures.  It can be clearly seen that the positions and widths of the
496:   components do not change significantly, while their relative
497:   proportions vary substantially.}
498: \end{figure*}
499: 
500: A brief inspection of the sample $\EW$ distribution reveals a peak
501: around zero EW, with a long, asymmetric tail to high EW.  The NMR
502: technique is more computationally efficient when using symmetrical,
503: Gaussian functions to model the distribution.  Gaussians are also an
504: obvious choice due to their exceptional richness and flexibility.  For
505: convenience we therefore wish to transform the equivalent width
506: quantity to a space where its natural components appear to take a more
507: symmetrical, Gaussian, form.  Better matching the shape of the true
508: distribution components to that assumed in the NMR technique will also
509: naturally result in fewer NMR components being required to model the
510: distribution (but see Appendix \ref{sec:nmr}).  The EW extend slightly to
511: negative values, proscribing a simple logarithmic transformation.  We
512: therefore choose the transformation $\W = \log_{10}(\EW + \lambda)$.
513: The zero offset parameter, $\lambda$, must be large enough to make the
514: logarithm argument positive for the most negative EW value in our
515: sample. In constructing our sample we remove outliers by requiring
516: $\EW > -0.4$, thereby clipping the lowest 0.1\% of the sample.
517: Therefore, we must have $\lambda > 0.4$.  We have examined the
518: behaviour of our NMR fits and their likelihood with variations in
519: $\lambda$.  The chosen value has only a relatively small effect,
520: slightly altering the shape of the Gaussian basis functions once they
521: are transformed back into EW space, but not changing our results
522: significantly.  Here we adopt $\lambda = 1.4$ as a compromise between
523: maximising the fit likelihood and ensuring stable behaviour.  We must
524: also choose a reasonable bandwidth for the regression kernel in
525: $\rho$.  Following extensive tests we adopt an adaptive bandwidth
526: enclosing the nearest 5000 points (also see discussion in Appendix
527: \ref{sec:nmr}).
528: 
529: We apply the NMR technique to the distribution of $\W$, and determine
530: the optimum number of components using the Bayesian Information
531: Criterion \citep[BIC;][]{BIC}.  Four components are strongly preferred
532: by the data, by $\Delta$BIC $>$ 7 (see Appendix \ref{sec:nmr} for more
533: details).  In \reffig{fig:w13} (\reffig{fig:w55}) we show the NMR
534: components we obtain for the $\rho_{1.3}$ ($\rho_{5.5}$) sample, at
535: two values of local galaxy density.  The components are plotted in
536: $\W$-space, in which the technique is applied.  We also show the
537: components and data transformed back into $\EW$-space in
538: \reffig{fig:ew13} (\reffig{fig:ew55}).  The properties of these
539: components as a function of environmental density are shown in
540: \reffig{fig:wrho13} (\reffig{fig:wrho55}).  In \reffig{fig:3Dwrho55}
541: we show a three-dimensional view of the components and their sum for
542: the $\rho_{5.5}$ sample, which includes all the relevant information
543: (location, width and relative proportion of each component) in a
544: single plot.  We show the results for the $\rho_{5.5}$ here simply
545: because they are smoother than those for $\rho_{1.3}$, and the
546: individual components are more clearly visible in this
547: three-dimensional view.  It is critical to note that the only data
548: which has been used to determine these components is the $\EW$
549: distribution.
550: 
551: At this stage we make no attempt at interpreting the components as
552: physically distinct populations. Nevertheless, Figs. \ref{fig:w13},
553: \ref{fig:w55}, \ref{fig:ew13}, \ref{fig:ew55} indicate that the $\EW$
554: distribution can be well described by multiple components.  The
555: hypothesis that the galaxy population comprises distinct components,
556: or types, is strongly supported by the various property bimodalities
557: described earlier.  We find that the locations and widths of the
558: components of the $\EW$ distribution are independent of environment.
559: Only the relative proportions of the components are found to vary
560: strongly.  This implies that the variations with environment are
561: primarily the result of differences in the relative frequency of each
562: galaxy type, rather than changes in the intrinsic properties of each
563: type.
564: 
565: Galaxies move to regions of higher density over time, under the
566: influence of gravity.  The variation of galaxy properties with
567: environment is therefore at least partly due to
568: environmentally-dependent changes in individual galaxy properties over
569: time.  If all galaxies in a given environment were affected similarly,
570: we would expect to see smooth changes in the property distributions of
571: each individual component.  However, we find that the individual
572: components remain mostly unchanged with environment.  This implies
573: that some galaxies are transformed directly from one type to another,
574: in an apparently stochastic manner.  If this transformation is
575: sufficiently slow, we would expect to see the transitioning galaxies
576: appearing as a separate component in the relevant range of local
577: density.  If it is rapid, then the fraction of transitioning galaxies
578: at any time would be too low to separate from the main distribution.
579: 
580: \begin{figure}
581: \includegraphics[angle=270,width=0.45\textwidth]{fig5.ps}
582: \caption{\label{fig:bpt13}The BPT diagram for our $\rho_{1.3}$ sample, traditionally
583:   used to identify star-forming galaxies and AGN hosts.  For clarity,
584:   only one-fifth of our sample galaxies are plotted. The \emph{LINER},
585:   \emph{Seyfert 2} and \emph{SF dominated} regions are colour-coded to
586:   match our interpretation of their correspondence to the NMR
587:   components shown in the other figures (purple, blue and green,
588:   respectively).  Note that many galaxies cannot be placed on this
589:   diagram.  These are \emph{passive} galaxies, with no emission lines,
590:   and \emph{uncertain} galaxies, with some detected emission lines,
591:   but not all four of those required for inclusion in this diagram.}
592: \end{figure}
593: 
594: \section{Identifying the components}
595: It is easy to identify the component at zero $\EW$ with passive
596: galaxies, containing no star-formation or AGN activity.  The dominant
597: component at high $\EW$ must be associated with star-forming galaxies
598: (with the above caveats concerning potential AGN contamination).  We
599: also find two intermediate $\EW$ components.  The principle change
600: with environment appears to be the movement of galaxies from the
601: star-forming component to the others, but primarily to the passive
602: component.  However, interpreting either of these intermediate EW
603: components as a population transitioning between star-forming and
604: passive is inconsistent with their existence as a significant fraction
605: of the galaxy population even at low environmental densities.
606: 
607: To explore the physical interpretation of the components we have
608: found, we now turn to more traditional diagnostics to separate the
609: contributions from star formation (SF) and AGN to the emission lines.
610: The BPT diagram for our $\rho_{1.3}$ sample is shown in \reffig{fig:bpt13}.  In
611: order to appear on this plot, all four required emission lines must be
612: detected at $>2$~sigma significance.  The classifications we define
613: are as follows;
614: %
615: \emph{passive}: no emission lines detected,
616: %
617: \emph{SF dominated}: all four lines detected and below the curve of
618: \citet{2006MNRAS.371..972S},
619: %
620: \emph{AGN dominated}: above the line of \citet{2001ApJ...556..121K}
621: with either all four lines detected or with both lines for just one of
622: the ratios detected and ${\rm{[OIII]}}/{\rm{H}\beta} > 0.6$ or
623: ${\rm{[NII]}}/{\rm{H}\alpha} > 0.05$,
624: %
625: \emph{AGN+SF}: all four lines detected and between the curves of
626: \citet{2001ApJ...556..121K} and \citet{2003MNRAS.346.1055K},
627: %
628: \emph{SF+AGN}: all four lines detected and between the curves of
629: \citet{2006MNRAS.371..972S} and \citet{2003MNRAS.346.1055K},
630: %
631: \emph{uncertain}: at least one of the four emission lines detected,
632: but none of the other classification criteria met.
633: %
634: Note that the majority of AGN-dominated galaxies can be robustly
635: identified simply from their ${\rm{[NII]}}/{\rm{H}\alpha}$ ratio
636: \citep{2003ApJ...597..142M,2006MNRAS.371..972S}.
637: 
638: Our classification method is such that galaxies classified as
639: \emph{AGN dominated} must contain a significant AGN component, and
640: will have low contribution to their emission lines from star
641: formation.  On the other hand \emph{SF dominated} galaxies may well
642: also contain up to $\sim 20$--$40$\% AGN contamination in their
643: emission lines \citep{2003MNRAS.346.1055K,2006MNRAS.371..972S}.  The
644: \emph{AGN dominated} galaxies can be further subdivided into
645: \emph{LINER} and \emph{Seyfert 2} sources using the BPT diagram
646: \citep{2003MNRAS.346.1055K}.
647: 
648: \begin{figure*}
649: \includegraphics[clip=True,trim = 3cm 22cm 9.5cm 3.3cm,width=1.0\textwidth]{fig6.ps}
650: \caption{\label{fig:wrho13bpt}The $\W$--$\rho_{1.3}$ distribution for
651:   objects in our sample colour-coded by their location in the BPT
652:   diagram shown in Fig.~4.  The lines indicate the median $\W$ in bins
653:   of $\rho_{1.3}$ for each subsample.  The left panel shows
654:   \emph{passive}, \emph{LINER}, \emph{Seyfert 2} and \emph{SF
655:     dominated} galaxies (in order of increasing $\W$), while the right
656:   panel shows \emph{uncertain}, \emph{AGN+SF} and \emph{SF+AGN}
657:   galaxies (brown, orange and cyan, respectively, and again in order
658:   of increasing $\W$).  A comparison with Fig.~2 reveals a
659:   correspondence between the NMR components and, in order of
660:   increasing $\W$, (1) \emph{passive} galaxies, (2) \emph{LINER} and
661:   \emph{uncertain} galaxies, (3) \emph{Seyfert 2} and \emph{AGN+SF}
662:   galaxies, and (4) \emph{SF dominated} and \emph{SF+AGN} galaxies.}
663: \end{figure*}
664: 
665: Figure \ref{fig:wrho13bpt} shows the $\W$--$\rho_{1.3}$ distributions
666: of galaxies classified using the BPT diagram. Comparing with
667: \reffig{fig:wrho13}, one can clearly identify the NMR components with
668: the \emph{passive}, \emph{LINER}, \emph{Seyfert 2} and \emph{SF
669:   dominated} BPT-classified galaxies.  The large fraction of galaxies
670: for which the BPT diagram gives an uncertain result may also be
671: identified with the components.  The galaxies with apparently mixed
672: star formation and AGN emission are found at similar $\W$ to the
673: \emph{Seyfert 2} objects, and the higher intermediate NMR component.
674: Galaxies with at least one emission line, but which cannot be
675: identified via the BPT diagram have similar $\W$ to \emph{LINER}
676: objects and the lower NMR component.  While not conclusive, this
677: strongly suggests that the components derived from the NMR technique
678: do represent physically distinct populations.  This is remarkable
679: given that the NMR components have been inferred from just a single
680: emission line.
681: 
682: \section{A new insight into the galaxy population}
683: 
684: By applying the newly developed NMR method to the H$\alpha$ equivalent
685: width distribution, a single astrophysical quantity that contains
686: information on both star formation and nuclear activity, we have
687: identified four distinct components in the galaxy population.  None of
688: these components vary significantly with environment, in terms of the
689: distribution of their H$\alpha$ equivalent widths.  However, the relative
690: proportions of galaxies in each component vary substantially with
691: environment.  This implies that any environmental processes at work do
692: not affect all galaxies in a gradual way, which would result in
693: changes in the component H$\alpha$ equivalent width distributions.
694: Rather, they must rapidly transform a fraction of galaxies from one
695: component to another, in a stochastic manner, in order to avoid
696: changing the properties of the individual components.
697: 
698: The above conclusions stand without requiring us to identify the
699: components with more traditional galaxy sub-populations.  However,
700: when we attempt such an identification, we find that the extreme
701: components may be associated with passive and star-forming galaxies,
702: while the two intermediate components display similarities to galaxies
703: hosting LINERs and Seyfert 2 AGN.  Galaxies with an apparent mix of
704: star-formation and AGN may also be identified with these components.
705: However, in contrast to the usual methods of classifying the
706: star-formation and AGN properties of galaxies, which require multiple
707: emission lines to be significantly detected, the technique we describe
708: in this paper is applicable to all galaxies.  We thereby avoid the
709: issue of excluding objects for which traditional methods are
710: uncertain, and the biases which this may introduce.
711: 
712: \section*{Acknowledgements}
713: SPB acknowledges support from an STFC postdoctoral grant.  AR
714: acknowledges the Qatar Foundation for Education, Science and Community
715: Development.  RCN holds a Marie Curie Excellence Chair from the
716: European Commission.  We thank the NSF for funding this
717: inter-disciplinary research through their KDI initiative.
718: Three-dimensional visualisation was conducted with the S2PLOT
719: programming library \citep{2006PASA...23...82B}.  We are grateful to
720: the referee, Dr. Nicholas Ball, for useful comments.
721: 
722: \bsp
723: 
724: % Bibliography generated by BibTeX and pasted in from bbl file
725: %\bibliographystyle{mn2e}
726: %\bibliography{sdss_halpha}
727: \begin{thebibliography}{}
728: \small
729: 
730: \bibitem[\protect\citeauthoryear{{Adelman-McCarthy} et~al.,}{{Adelman-McCarthy}
731:    et~al.}{2006}]{2006ApJS..162...38A}
732: {Adelman-McCarthy} J.~K.,  et~al., 2006, ApJS, 162, 38
733: 
734: \bibitem[\protect\citeauthoryear{{Baldry}, {Balogh}, {Bower}, {Glazebrook} \&
735:   {Nichol}}{{Baldry} et~al.}{2004}]{2004AIPC..743..106B}
736: {Baldry} I.~K.,  {Balogh} M.~L.,  {Bower} R.,  {Glazebrook} K.,    {Nichol}
737:   R.~C.,  2004, in {Allen} R.~E.,  {Nanopoulos} D.~V.,   {Pope} C.~N.,  eds,
738:   The New Cosmology: Conference on Strings and Cosmology Vol.~743 of American
739:   Institute of Physics Conference Series, {Color bimodality: Implications for
740:   galaxy evolution}.
741: pp 106--119
742: 
743: \bibitem[\protect\citeauthoryear{{Baldry}, {Balogh}, {Bower}, {Glazebrook},
744:   {Nichol}, {Bamford} \& {Budavari}}{{Baldry}
745:   et~al.}{2006}]{2006MNRAS.373..469B}
746: {Baldry} I.~K.,  {Balogh} M.~L.,  {Bower} R.~G.,  {Glazebrook} K.,  {Nichol}
747:   R.~C.,  {Bamford} S.~P.,    {Budavari} T.,  2006, MNRAS, 373, 469
748: 
749: \bibitem[\protect\citeauthoryear{{Baldwin}, {Phillips} \&
750:   {Terlevich}}{{Baldwin} et~al.}{1981}]{1981PASP...93....5B}
751: {Baldwin} J.~A.,  {Phillips} M.~M.,    {Terlevich} R.,  1981, PASP, 93, 5
752: 
753: \bibitem[\protect\citeauthoryear{{Ball}, {Loveday} \& {Brunner}}{{Ball}
754:   et~al.}{2006}]{2006astro.ph.10171B}
755: {Ball} N.~M.,  {Loveday} J.,    {Brunner} R.~J.,  2008, MNRAS, 383, 907
756: 
757: \bibitem[\protect\citeauthoryear{{Balogh} et~al.,}{{Balogh}
758:   et~al.}{2004}]{2004MNRAS.348.1355B}
759: {Balogh} M.,  et~al., 2004, MNRAS, 348, 1355
760: 
761: \bibitem[\protect\citeauthoryear{{Balogh}, {Baldry}, {Nichol}, {Miller},
762:   {Bower} \& {Glazebrook}}{{Balogh} et~al.}{2004}]{2004ApJ...615L.101B}
763: {Balogh} M.~L.,  {Baldry} I.~K.,  {Nichol} R.,  {Miller} C.,  {Bower} R.,
764:   {Glazebrook} K.,  2004, ApJL, 615, 101
765: 
766: \bibitem[\protect\citeauthoryear{{Bamford}, {Nichol}, {Baldry}, {Land},
767:   {Lintott}, {Schawinski}, {Slosar}, {Szalay}, {Thomas}, {Torki}, {Andreescu},
768:   {Edmondson}, {Miller}, {Murray}, {Raddick} \& {Vandenberg}}{{Bamford}
769:   et~al.}{2008}]{2008arXiv0805.2612B}
770: {Bamford} S.~P.,  {Nichol} R.~C.,  {Baldry} I.~K.,  {Land} K.,  {Lintott}
771:   C.~J.,  {Schawinski} K.,  {Slosar} A.,  {Szalay} A.~S.,  {Thomas} D.,
772:   {Torki} M.,  {Andreescu} D.,  {Edmondson} E.~M.,  {Miller} C.~J.,  {Murray}
773:   P.,  {Raddick} M.~J.,    {Vandenberg} J.,  2008, ArXiv:0805.2612
774: 
775: \bibitem[\protect\citeauthoryear{{Barnes}, {Fluke}, {Bourke} \&
776:   {Parry}}{{Barnes} et~al.}{2006}]{2006PASA...23...82B}
777: {Barnes} D.~G.,  {Fluke} C.~J.,  {Bourke} P.~D.,    {Parry} O.~T.,  2006,
778:   Publications of the Astronomical Society of Australia, 23, 82
779: 
780: \bibitem[\protect\citeauthoryear{{Beisbart} \& {Kerscher}}{{Beisbart} \&
781:   {Kerscher}}{2000}]{2000ApJ...545....6B}
782: {Beisbart} C.,  {Kerscher} M.,  2000, ApJ, 545, 6
783: 
784: \bibitem[\protect\citeauthoryear{{Blanton}, {Berlind} \& {Hogg}}{{Blanton}
785:   et~al.}{2006}]{2006astro.ph..8353B}
786: {Blanton} M.~R.,  {Berlind} A.~A.,    {Hogg} D.~W.,  2007, ApJ, 664, 791
787: 
788: \bibitem[\protect\citeauthoryear{{Bower}, {Benson}, {Malbon}, {Helly}, {Frenk},
789:   {Baugh}, {Cole} \& {Lacey}}{{Bower} et~al.}{2006}]{2006MNRAS.370..645B}
790: {Bower} R.~G.,  {Benson} A.~J.,  {Malbon} R.,  {Helly} J.~C.,  {Frenk} C.~S.,
791:   {Baugh} C.~M.,  {Cole} S.,    {Lacey} C.~G.,  2006, MNRAS, 370, 645
792: 
793: \bibitem[\protect\citeauthoryear{Cherkassky \& Ma}{Cherkassky \&
794:   Ma}{2005}]{MMR}
795: Cherkassky V.,  Ma Y.,  2005, IEEE Transactions on Neural Networks, 16, 785
796: 
797: \bibitem[\protect\citeauthoryear{Conselice}{2006}]{2006MNRAS.373.1389C} 
798: Conselice C.~J., 2006, MNRAS, 373, 1389 
799: 
800: \bibitem[\protect\citeauthoryear{{Croton}, {Springel}, {White}, {De Lucia},
801:   {Frenk}, {Gao}, {Jenkins}, {Kauffmann}, {Navarro} \& {Yoshida}}{{Croton}
802:   et~al.}{2006}]{2006MNRAS.365...11C}
803: {Croton} D.~J.,  {Springel} V.,  {White} S.~D.~M.,  {De Lucia} G.,  {Frenk}
804:   C.~S.,  {Gao} L.,  {Jenkins} A.,  {Kauffmann} G.,  {Navarro} J.~F.,
805:   {Yoshida} N.,  2006, MNRAS, 365, 11
806: 
807: \bibitem[\protect\citeauthoryear{{Dressler}, {Thompson} \&
808:   {Shectman}}{{Dressler} et~al.}{1985}]{1985ApJ...288..481D}
809: {Dressler} A.,  {Thompson} I.~B.,    {Shectman} S.~A.,  1985, ApJ, 288, 481
810: 
811: \bibitem[\protect\citeauthoryear{{Driver} et~al.,}{{Driver}
812:   et~al.}{2006}]{2006MNRAS.368..414D}
813: {Driver} S.~P.,  et~al., 2006, MNRAS, 368, 414
814: 
815: \bibitem[\protect\citeauthoryear{Ellis et al.}{2005}]{2005MNRAS.363.1257E} 
816: Ellis S.~C., Driver S.~P., Allen P.~D., Liske J., Bland-Hawthorn J., De 
817: Propris R., 2005, MNRAS, 363, 1257 
818: 
819: \bibitem[\protect\citeauthoryear{{Giuricin}, {Samurovi{\'c}}, {Girardi},
820:   {Mezzetti} \& {Marinoni}}{{Giuricin} et~al.}{2001}]{2001ApJ...554..857G}
821: {Giuricin} G.,  {Samurovi{\'c}} S.,  {Girardi} M.,  {Mezzetti} M.,
822:   {Marinoni} C.,  2001, ApJ, 554, 857
823: 
824: \bibitem[\protect\citeauthoryear{{G{\'o}mez} et~al.,}{{G{\'o}mez}
825:   et~al.}{2003}]{2003ApJ...584..210G}
826: {G{\'o}mez} P.~L.,  et~al., 2003, ApJ, 584, 210
827: 
828: \bibitem[\protect\citeauthoryear{{Hogg} et~al.,}{{Hogg}
829:   et~al.}{2002}]{2002AJ....124..646H}
830: {Hogg} D.~W.,  et~al., 2002, AJ, 124, 646
831: 
832: \bibitem[\protect\citeauthoryear{Kass \& Raftery}{Kass \& Raftery}{1995}]{KR95}
833: Kass R.~E.,  Raftery A.~E.,  1995, Journal of the American Statistical
834:   Association, 90, 773
835: 
836: \bibitem[\protect\citeauthoryear{{Kauffmann}, {Heckman}, {Tremonti},
837:   {Brinchmann}, {Charlot}, {White}, {Ridgway}, {Brinkmann}, {Fukugita}, {Hall},
838:   {Ivezi{\'c}}, {Richards} \& {Schneider}}{{Kauffmann}
839:   et~al.}{2003}]{2003MNRAS.346.1055K}
840: {Kauffmann} G.,  {Heckman} T.~M.,  {Tremonti} C.,  {Brinchmann} J.,  {Charlot}
841:   S.,  {White} S.~D.~M.,  {Ridgway} S.~E.,  {Brinkmann} J.,  {Fukugita} M.,
842:   {Hall} P.~B.,  {Ivezi{\'c}} {\v Z}.,  {Richards} G.~T.,    {Schneider} D.~P.,
843:    2003, MNRAS, 346, 1055
844: 
845: \bibitem[\protect\citeauthoryear{{Kewley}, {Dopita}, {Sutherland}, {Heisler} \&
846:   {Trevena}}{{Kewley} et~al.}{2001}]{2001ApJ...556..121K}
847: {Kewley} L.~J.,  {Dopita} M.~A.,  {Sutherland} R.~S.,  {Heisler} C.~A.,
848:   {Trevena} J.,  2001, ApJ, 556, 121
849: 
850: \bibitem[\protect\citeauthoryear{{Kewley}, {Groves}, {Kauffmann} \&
851:   {Heckman}}{{Kewley} et~al.}{2006}]{2006MNRAS.372..961K}
852: {Kewley} L.~J.,  {Groves} B.,  {Kauffmann} G.,    {Heckman} T.,  2006, MNRAS,
853:   372, 961
854: 
855: \bibitem[\protect\citeauthoryear{{Koenker} \& {Bassett}}{{Koenker} \&
856:   {Bassett}}{1978}]{QR}
857: {Koenker} R.,  {Bassett} G.,  1978, Econometrica, 46, 33
858: 
859: \bibitem[\protect\citeauthoryear{{Lewis} et~al.,}{{Lewis}
860:   et~al.}{2002}]{2002MNRAS.334..673L}
861: {Lewis} I.,  et~al., 2002, MNRAS, 334, 673
862: 
863: \bibitem[\protect\citeauthoryear{{McLachlan} \& {Krishnan}}{{McLachlan} \&
864:   {Krishnan}}{1997}]{EM}
865: {McLachlan} G.,  {Krishnan} T.,  1997, The EM algorithm and extensions (Wiley
866:   series in probability and statistics).
867: John Wiley \& Sons
868: 
869: \bibitem[\protect\citeauthoryear{{Miller}, {Nichol}, {G{\'o}mez}, {Hopkins} \&
870:   {Bernardi}}{{Miller} et~al.}{2003}]{2003ApJ...597..142M}
871: {Miller} C.~J.,  {Nichol} R.~C.,  {G{\'o}mez} P.~L.,  {Hopkins} A.~M.,
872:   {Bernardi} M.,  2003, ApJ, 597, 142
873: 
874: \bibitem[\protect\citeauthoryear{{Moustakas}, {Kennicutt} Jr. \&
875:   {Tremonti}}{{Moustakas} et~al.}{2006}]{2006ApJ...642..775M}
876: {Moustakas} J.,  {Kennicutt} Jr. R.~C.,    {Tremonti} C.~A.,  2006, ApJ, 642,
877:   775
878: 
879: \bibitem[\protect\citeauthoryear{{Nakamura}, {Fukugita}, {Brinkmann} \&
880:   {Schneider}}{{Nakamura} et~al.}{2004}]{2004AJ....127.2511N}
881: {Nakamura} O.,  {Fukugita} M.,  {Brinkmann} J.,    {Schneider} D.~P.,  2004,
882:   AJ, 127, 2511
883: 
884: \bibitem[\protect\citeauthoryear{{Park}, {Choi}, {Vogeley}, {Gott} \&
885:   {Blanton}}{{Park} et~al.}{2007}]{2007ApJ...658..898P}
886: {Park} C.,  {Choi} Y.-Y.,  {Vogeley} M.~S.,  {Gott} J.~R.~I.,    {Blanton}
887:   M.~R.,  2007, ApJ, 658, 898
888: 
889: \bibitem[\protect\citeauthoryear{{Richstone}, {Ajhar}, {Bender}, {Bower},
890:   {Dressler}, {Faber}, {Filippenko}, {Gebhardt}, {Green}, {Ho}, {Kormendy},
891:   {Lauer}, {Magorrian} \& {Tremaine}}{{Richstone}
892:   et~al.}{1998}]{1998Natur.395A..14R}
893: {Richstone} D.,  {Ajhar} E.~A.,  {Bender} R.,  {Bower} G.,  {Dressler} A.,
894:   {Faber} S.~M.,  {Filippenko} A.~V.,  {Gebhardt} K.,  {Green} R.,  {Ho} L.~C.,
895:    {Kormendy} J.,  {Lauer} T.~R.,  {Magorrian} J.,    {Tremaine} S.,  1998,
896:   Nature, 395, A14
897: 
898: \bibitem[\protect\citeauthoryear{Schwarz}{Schwarz}{1978}]{BIC}
899: Schwarz G.,  1978, The Annals of Statistics, 6, 461
900: 
901: \bibitem[\protect\citeauthoryear{{Silk}}{{Silk}}{2005}]{2005MNRAS.364.1337S}
902: {Silk} J.,  2005, MNRAS, 364, 1337
903: 
904: \bibitem[\protect\citeauthoryear{{Springel}, {White}, {Jenkins}, {Frenk},
905:   {Yoshida}, {Gao}, {Navarro}, {Thacker}, {Croton}, {Helly}, {Peacock}, {Cole},
906:   {Thomas}, {Couchman}, {Evrard}, {Colberg} \& {Pearce}}{{Springel}
907:   et~al.}{2005}]{2005Natur.435..629S}
908: {Springel} V.,  {White} S.~D.~M.,  {Jenkins} A.,  {Frenk} C.~S.,  {Yoshida} N.,
909:    {Gao} L.,  {Navarro} J.,  {Thacker} R.,  {Croton} D.,  {Helly} J.,
910:   {Peacock} J.~A.,  {Cole} S.,  {Thomas} P.,  {Couchman} H.,  {Evrard} A.,
911:   {Colberg} J.,    {Pearce} F.,  2005, Nature, 435, 629
912: 
913: \bibitem[\protect\citeauthoryear{{Stasi{\'n}ska}, {Cid Fernandes}, {Mateus},
914:   {Sodr{\'e}} \& {Asari}}{{Stasi{\'n}ska} et~al.}{2006}]{2006MNRAS.371..972S}
915: {Stasi{\'n}ska} G.,  {Cid Fernandes} R.,  {Mateus} A.,  {Sodr{\'e}} L.,
916:   {Asari} N.~V.,  2006, MNRAS, 371, 972
917: 
918: \bibitem[\protect\citeauthoryear{{Strateva} et~al.,}{{Strateva}
919:   et~al.}{2001}]{2001AJ....122.1861S}
920: {Strateva} I.,  et~al., 2001, AJ, 122, 1861
921: 
922: \bibitem[\protect\citeauthoryear{{Tremonti}, {Heckman}, {Kauffmann},
923:   {Brinchmann}, {Charlot}, {White}, {Seibert}, {Peng}, {Schlegel}, {Uomoto},
924:   {Fukugita} \& {Brinkmann}}{{Tremonti} et~al.}{2004}]{2004ApJ...613..898T}
925: {Tremonti} C.~A.,  {Heckman} T.~M.,  {Kauffmann} G.,  {Brinchmann} J.,
926:   {Charlot} S.,  {White} S.~D.~M.,  {Seibert} M.,  {Peng} E.~W.,  {Schlegel}
927:   D.~J.,  {Uomoto} A.,  {Fukugita} M.,    {Brinkmann} J.,  2004, ApJ, 613, 898
928: 
929: \bibitem[\protect\citeauthoryear{{Weinmann}, {van den Bosch}, {Yang} \&
930:   {Mo}}{{Weinmann} et~al.}{2006}]{2006MNRAS.366....2W}
931: {Weinmann} S.~M.,  {van den Bosch} F.~C.,  {Yang} X.,    {Mo} H.~J.,  2006,
932:   MNRAS, 366, 2
933: 
934: \bibitem[\protect\citeauthoryear{{Weisberg}}{{Weisberg}}{2005}]{weisberg}
935: {Weisberg} S.,  2005, Applied Linear Regression, 3rd Ed..
936: Wiley/Interscience
937: 
938: \bibitem[\protect\citeauthoryear{{Wolf}, {Gray}, {Arag{\'o}n-Salamanca}, {Lane}
939:   \& {Meisenheimer}}{{Wolf} et~al.}{2007}]{2007MNRAS.376L...1W}
940: {Wolf} C.,  {Gray} M.~E.,  {Arag{\'o}n-Salamanca} A.,  {Lane} K.~P.,
941:   {Meisenheimer} K.,  2007, MNRAS, 376, L1
942: 
943: \bibitem[\protect\citeauthoryear{{Yu} \& {Jones}}{{Yu} \& {Jones}}{1998}]{lQR}
944: {Yu} K.,  {Jones} M.~C.,  1998, Journal of the American Statistical
945:   Association, 93, 228
946: 
947: \bibitem[\protect\citeauthoryear{{Zehavi} et~al.,}{{Zehavi}
948:   et~al.}{2005}]{2005ApJ...630....1Z}
949: {Zehavi} I.,  et~al., 2005, ApJ, 630, 1
950: 
951: \end{thebibliography}
952: 
953: \appendix
954: 
955: \section{Nonparametric mixture regression}
956: \label{sec:nmr}
957: 
958: \begin{figure}
959: \includegraphics[angle=270,width=0.45\textwidth]{fig7.ps}
960: \caption{\label{fig:bic13}Offsets in the Bayesian Information Criterion
961: (BIC) score versus local galaxy density, $\rho_{1.3}$, for NMR fits
962: utilising 2, 3, and 5 components, relative to the favoured 4 component
963: fit.  Where the 5 component fit BIC offset is zero at low
964: $\rho_{1.3}$, the NMR method only uses 4 of the 5 available components
965: as two of the components are degenerate.  Four components are thus
966: preferred, by significantly higher BIC values, at all local densities.}
967: \end{figure}
968: 
969: This is a newly developed statistical method for determining the
970: dependences of one variable, $y$, on another, $x$, where there may be
971: multiple components present in the data, each with a different $y$ on
972: $x$ dependence.  For the analysis presented in the main body of this
973: article we use this technique, putting $x=\rho_{1.3}$ or $\rho_{5.5}$,
974: estimates of the local environmental density, and $y=\W$, a
975: transformed version of the H$\alpha$ equivalent width (see
976: \refsec{sec:Halpha}).  Here we give a technical description of the
977: method.
978: 
979: We model the probability, $f(y|x)$, of $y$ given $x$ as a sum of
980: components, thus
981: \begin{equation}
982: f(y|x;{\bmath{\Theta}}(x)) =
983: \sum_{i=1}^{c(x)}{\pi_i(x) s_i(y|{\bmath{\eta}}_i(x))}
984: \end{equation}
985: where the $s_j(y|\bmath{\eta}_j(x))$, are density functions with
986: a vector of parameters ${\bmath{\eta}}_j(x)$ that depends on $x$,
987: and the $\pi_j(x)$'s are a set of mixing proportions that sums to one
988: for each $x$. In this paper we use Gaussian functions to model the
989: components, each with parameters ${\bmath{\eta}}_i =
990: (\mu_i,\sigma_i$), mean and standard deviation respectively. The
991: number of components is $c(x)$, and may vary as a function of $x$.
992: Gaussians are rich and flexible functions which are highly suited to
993: this task, particularly if one wishes to avoid the danger of overly
994: designing the method to fit one's expectations of the results.
995: 
996: The parameter set, ${\bmath{\Theta}}(x)\left({\bmath{\theta}}_1 (x),
997:   \ldots,{\bmath{\theta}}_{c(x)} (x) \right)=(\pi_1(x), {\bmath{\eta}}_1(x),
998: \ldots, \pi_{c(x)}(x), {\bmath{\eta}}_{c(x)}(x))$, is determined using local
999: likelihood estimation.  The parameters are approximated locally by a
1000: polynomial of degree $p$, and hence vary smoothly with $x$.  The
1001: variation of the parameters can thus be described by a set of
1002: polynomial coefficients, $\bmath{B}$.  These coefficients may
1003: then be constrained by data, weighted using a kernel of bandwidth
1004: $b(x)$ about $x$.
1005: 
1006: The log-likelihood function of the set of polynomial coefficients $\bmath{B}$
1007: given the data is therefore
1008: \begin{eqnarray}
1009: {\cal L}_p(\bmath{B};x,b,c(x)) &=& \sum_{m=1}^{n} w_m(x;b) \times \\
1010: & &
1011: \log_e f(Y_m,x;\bmath{T}(X_m - x,\bmath{B})), \nonumber
1012: \end{eqnarray}
1013: for $n$ measurements labelled by $m$, with locations
1014: $(x,y)=(X_m,Y_m)$.  The set of polynomial functions approximating the
1015: parameters $\bmath{\Theta}$ at $x$ are
1016: \begin{eqnarray}
1017: \lefteqn{\bmath{T}(\delta_m,\bmath{B}) =
1018: \big(t_{1,1}\big(\delta_m, \bmath{\beta}_{1,1}\big), \ldots,
1019: t_{1,1}\big(\delta_m, \bmath{\beta}_{1,q_1}\big), \ldots,}
1020: \nonumber\\
1021: & &
1022: t_{c(x),1}\big(\delta_m, \bmath{\beta}_{c(x),1}\big), \ldots,
1023: t_{c(x),1}\big(\delta_m, \bmath{\beta}_{c(x),q_{c(x)}}\big)\big),
1024: \end{eqnarray}
1025: defining $\delta_m = X_m - x$, with
1026: \begin{equation}
1027: t_{i,j}(\delta_m, \bmath{\beta}_{i,j}) =
1028: \sum_{k=0}^{p}{\beta_{i,j,k} (\delta_m)^k / k!},
1029: \end{equation}
1030: where $i = 1,\ldots,c(x)$ counts over the components, $j =
1031: 1,\ldots,q_i$ counts the parameters of component $i$ (in our case each
1032: density function is a Gaussian with parameters $\mu$ and $\sigma$, and
1033: with mixing weight $\pi$, thus $q_i=3$), and $k = 0,\ldots,p$
1034: counts the degrees of the polynomials used in $\bmath{T}$ to
1035: approximate the parameters $\bmath{\Theta}$.  The
1036: $\beta_{i,j,k}$, and hence their containing sets,
1037: $\bmath{\beta}_{i,j}$ and $\bmath{B}$, correspond to a
1038: particular value of $x$.  Note that the $\beta_{i,j,k}$ give
1039: approximations around $\delta_m = 0$ for the value and $k$-th
1040: derivative of the parameter $j$ of component $i$.  The contribution to
1041: ${\cal L}_p$ of data at distance $\delta_m$ from $x$ is specified by
1042: \begin{equation}
1043: w_m(x; b(x)) = W\left(\frac{X_m - x}{b(x)}\right),
1044: \end{equation}
1045: where $W(z)$ is a weighting function.
1046: 
1047: One can then attempt to determine the $\bmath{B}$ which maximises
1048: the local log-likelihood, ${\cal L}_p$, which we denote
1049: $\widehat{\bmath{B}}(x; b(x), c(x))$, explicitly indicating its dependencies.
1050: Therefore,
1051: \begin{eqnarray}
1052: \label{eqn:betahat}
1053: \lefteqn{\widehat{\bmath{B}}(x; b(x), c(x)) =
1054: \begin{array}{c}\\\mathrm{argmax}\\^{\bmath{B}}\end{array} \sum_{j=1}^{n} w_j(x; b(x))\;\times}\\
1055: &&\log_e \sum_{i=1}^{c(x)} s_i\big(Y_j|t_{i,1}(X_j - x, {\bmath{\beta}}_{i,1}),
1056: \ldots, t_{i,q_i}(X_j - x, {\bmath{\beta}}_{i,q_i})\big).\nonumber
1057: \end{eqnarray}
1058: The local likelihood estimate for the set of parameters is then defined by
1059: $\widehat{\bmath{\Theta}}(x; b(x), c(x)) =
1060: \bmath{T}(0, \widehat{\bmath{B}}(x; b(x), c(x)))$, that is
1061: $\widehat{\theta}_{i,j} (x; b(x), c(x)) =
1062: \widehat{\beta}_{i,j,0} (x; b(x), c(x))$.
1063: Our conditional density estimate given $b(x)$ and
1064: $k(x)$ is therefore
1065: \begin{equation}
1066: \label{eqn:fhat}
1067: {\widehat f}(y|x;b(x), c(x)) \equiv
1068: f(y|x;{\widehat{\bmath{\Theta}}}(x; b(x),c(x))).
1069: \label{fhatHK}
1070: \end{equation}
1071: In general, given $b(x)$ and $c(x)$, the standard method of solving
1072: Eqn.~\ref{eqn:betahat} is to use the Expectation-Maximisation (EM)
1073: method \citep{EM}.
1074: 
1075: The estimator Eqn.~\ref{eqn:fhat} is dependent upon the chosen
1076: bandwidth $b(x)$ and number of components $c(x)$.  If they are \emph{a
1077:   priori} unknown, we must therefore select them in some reliable way.
1078: 
1079: In this work we have chosen the bandwidth for $x=\rho_{1.3}$
1080: or $\rho_{5.5}$ to be a function of the $K$th nearest neighbour.  We
1081: use $K=5000$, selected as a compromise between the smoothness of the
1082: resulting component regression lines and their ability to trace any
1083: variation in $\W$ versus environment.  We have checked that the exact
1084: choice of $K$ (within the range 1000--7500) does not affect our
1085: results.  The optimum number of components was determined using the
1086: Bayesian Information Criterion\citep[BIC;][]{BIC}:
1087: \begin{equation}
1088: \label{eqn:BIC}
1089: \rmn{BIC} =  {\cal L}_p - \frac{1}{2}(3c-1)\log_e(K)
1090: \end{equation}
1091: where ${\cal L}_p$ is the maximised log-likelihood, $c$ is the number
1092: of components, and $K$ is the sample size.  With this definition,
1093: otherwise known as the Schwarz Criterion, the preferred model is that
1094: which maximises the value of BIC.  Note that other definitions
1095: sometimes multiply the right hand side of Eqn. \ref{eqn:BIC} by $-2$.
1096: The difference between the BIC values of two models, $\Delta$BIC,
1097: approximates the natural logarithm of the Bayes factor, a summary of
1098: the evidence for one model over another.  A $\Delta$BIC of 7 indicates
1099: that the preferred model is truly better than the alternative model
1100: with odds better than a thousand to one.  A Bayes factor of $> 150$,
1101: i.e. $\Delta \rmn{BIC} > 5$, is generally taken to be very strong
1102: evidence for the preferred model \citep{KR95}.  Four components are
1103: thus very strongly favoured, by $\Delta$BIC = 147.1, 11.9 and 7.7
1104: versus 2, 3 and 5 components, respectively, averaged over
1105: $\log_{10}\rho_{1.3}$.  The $\Delta$BIC are shown versus $\rho_{1.3}$
1106: in \reffig{fig:bic13}.
1107: 
1108: One might argue that choosing different density functions, other than
1109: Gaussians, or applying a different transformation, would result in our
1110: finding a different optimal number of components.  However, when
1111: varying the $\W = \log_{10}(\EW + \lambda)$ transformation by changing
1112: $\lambda$, and trying various combinations of Gaussians and lognormal
1113: functions in $\EW$-space, the optimum number of components has
1114: consistently turned out to be four.  A careful visual inspection of
1115: the $\EW$ and $\W$ distributions also supports this conclusion.
1116: 
1117: Obviously one could examine the data and devise component density
1118: functions that would result in the NMR method finding any desired
1119: number of components.  However, this defeats the object of employing
1120: the NMR technique.  By `components' we mean simple, distinct elements
1121: of the overall population.  We must therefore make only simple
1122: assumptions and transformations in order to identify them, with
1123: minimal prior reference to the data.
1124: 
1125: If two or more NMR components together represent only a single true
1126: component of the galaxy populations, then we would expect them to
1127: behave identically.  Otherwise, they could not represent a single
1128: component, by definition.  However, our four NMR components each
1129: demonstrate different behaviour with respect to local density,
1130: indicating they are truly distinct (see Figs. 1--3, B1--B3).
1131: 
1132: Finally, the components we find using the NMR technique correspond
1133: remarkably well to traditional galaxy classifications (compare
1134: Figs. \ref{fig:wrho13} \& \ref{fig:wrho13bpt}).  This strongly
1135: supports our interpretation of the NMR components as physically
1136: distinct elements of the galaxy population.  However, the NMR
1137: components have the advantage of being based on all galaxies in our
1138: sample.  Traditional diagnostic diagrams can only be used for objects
1139: with multiple, significantly-detected, emission lines, and in many
1140: cases give ambiguous classifications (e.g. \emph{SF+AGN}).
1141: 
1142: \section{Larger scale environment}
1143: \label{sec:55Mpc}
1144: 
1145: The main text of the paper focuses on a measure of environment using a
1146: kernel of bandwidth $1.3$~Mpc, chosen by cross-validation.  This
1147: bandwidth performs well at the scales of galaxy clusters.  However, at
1148: low densities there is frequently only one galaxy within the kernel,
1149: and the estimator is unable to differentiate between different
1150: low-density environments.  We thus additionally perform our analysis
1151: using local densities estimated using a kernel with $5.5$~Mpc
1152: bandwidth.  The results are very similar to those from the $1.3$~Mpc
1153: densities, and thus our conclusions are robust to the precise
1154: definition of local density.  The figures corresponding to the
1155: $5.5$~Mpc kernel are given in this appendix.
1156: 
1157: \begin{figure*}
1158: \includegraphics[clip=True,trim = 3cm 22.3cm 9.5cm 3.3cm,height=0.30\textheight]{figS1.ps}
1159: \caption{\label{fig:w55}As \reffig{fig:w13}, but for local galaxy densities
1160:   estimated using a $5.5$~Mpc bandwidth kernel, $\rho_{5.5}$.  The
1161:   results are very similar to those found using $\rho_{1.3}$.}
1162: \end{figure*}
1163: 
1164: \begin{figure*}
1165: \includegraphics[clip=True,trim = 3cm 22.3cm 9.5cm 3.3cm,height=0.30\textheight]{figS2.ps}
1166: \caption{\label{fig:ew55}As \reffig{fig:w55}, but shown in terms of
1167:   the untransformed equivalent width, $\EW$.  The inset shows the same
1168:   plot with axis-ranges chosen to better show the behaviour at small
1169:   $\EW$.}
1170: \end{figure*}
1171: 
1172: \begin{figure*}
1173: \includegraphics[clip=True,trim = 3cm 22cm 5cm 3.3cm,height=0.21\textheight]{figS3.ps}
1174: \caption{\label{fig:wrho55}As \reffig{fig:wrho13}, but for local
1175:   galaxy densities estimated using a $5.5$~Mpc bandwidth kernel,
1176:   $\rho_{5.5}$.  The results are very similar to those found using
1177:   $\rho_{1.3}$.}
1178: \end{figure*}
1179: 
1180: \label{lastpage}
1181: 
1182: \end{document}
1183: