0705.4020/ms.tex
1: 
2: 
3: \documentclass[apj]{emulateapj}
4: 
5: 
6: %\documentclass[12pt,preprint]{aastex}
7: 
8: 
9: 
10: \usepackage{graphicx}
11: \usepackage{apjfonts}
12: \usepackage{mathptmx}
13: 
14: 
15: \makeatletter
16: \newenvironment{inlinetable}{%
17: \def\@captype{table}%
18: \noindent\begin{minipage}{0.999\linewidth}\begin{center}\footnotesize}
19: {\end{center}\end{minipage}\smallskip}
20: 
21: \newenvironment{inlinefigure}{%
22: \def\@captype{figure}%
23: \noindent\begin{minipage}{0.999\linewidth}\begin{center}}
24: {\end{center}\end{minipage}\smallskip}
25: \makeatother
26: 
27: 
28: 
29: \def\***#1{{\sc #1}}
30: \def\plan#1{\relax}
31: \def\Plan#1{\relax}
32: \def\PLAN#1{\relax}
33: 
34: \renewcommand{\arraystretch}{1.2}
35: 
36: \newcommand{\gcc}{g~cm$^{-3}\ $}
37: \newcommand{\sfun}[2]{$#1(#2)\ $}
38: \newcommand{\rhonot}{$\rho_{\circ}\ $}
39: \newcommand{\msun}{$M_{\odot}\ $}
40: \newcommand{\greq}{$\stackrel{>}{ _{\sim}}$}
41: \newcommand{\lteq}{$\stackrel{<}{ _{\sim}}$}
42: \def\lta{\mathrel{\spose{\lower 3pt\hbox{$\mathchar"218$}}
43:      \raise 2.0pt\hbox{$\mathchar"13C$}}}
44: \def\gta{\mathrel{\spose{\lower 3pt\hbox{$\mathchar"218$}}
45:      \raise 2.0pt\hbox{$\mathchar"13E$}}}
46: \newcommand{\etal}{{\it et al. }}
47: 
48: \shorttitle{}
49: \shortauthors{}
50: 
51: 
52: \def\mathnew{\mathsurround=0pt}
53: 
54: \def\simov#1#2{\lower .5pt\vbox{\baselineskip0pt \lineskip-.5pt
55: \ialign{$\mathnew#1\hfil##\hfil$\crcr#2\crcr\sim\crcr}}}
56: 
57: \def\simgreat{\mathrel{\mathpalette\simov >}}
58: \def\simless{\mathrel{\mathpalette\simov <}}
59: 
60: 
61: \begin{document}
62: \title{Statistical Evidence for Three classes of Gamma-ray Bursts}
63: 
64: \author{Tanuka Chattopadhyay$^1$, Ranjeev Misra $^2$, Asis Kumar Chattopadhyay$^3$ and Malay Naskar $^4$\\ }
65: 
66: \altaffiltext{1}{Shibpur Dinobundhoo College, 412/1 G.T. Road
67: (South), Howrah 711102, India, Email: tanuka@iucaa.ernet.in}
68: 
69: \altaffiltext{2}{Inter-University Centre for Astronomy and
70: Astrophysics, Post Bag 4, Ganeshkhind, Pune 411007. Email:
71: rmisra@iucaa.ernet.in}
72: 
73: 
74: \altaffiltext{3}{Department of Statistics,Calcutta University,35
75: Ballygunge Circular Road, Kolkata 700019, India.
76: Email:akcstat@caluniv.ac.in}
77: 
78: 
79: \altaffiltext{4}{NIRJAIT, Indian Council of Agricultural
80: Research,12, Regent Park, Kolkata 700 040, India. Email:
81: malaynaskar@yahoo.com}
82: 
83: 
84: 
85: \begin{abstract}
86:  Two different multivariate clustering techniques, the K-means partitioning
87: method and the Dirichlet process of mixture modeling, have been applied to
88: the BATSE Gamma-ray burst (GRB) catalog, to obtain the optimum number of
89: coherent groups. In the standard paradigm, GRB are classified
90: in only two groups, the long and short bursts. However, for both the
91: clustering techniques, the optimal number of classes was found to be three,
92: a result which is consistent with previous statistical analysis.
93:  In this classification,
94: the long bursts are further divided into two groups which are primarily
95: differentiated by their total fluence and duration and hence are named low
96: and high fluence GRB. Analysis of GRB with known red-shifts and spectral
97: parameters suggests that low fluence GRB have nearly constant isotropic
98: energy output of $10^{52}$ ergs while for the high fluence ones, the
99: energy output ranges from $10^{52}$ to $10^{54}$ ergs. It is speculated that the
100: three kinds of GRBs reflect three different origins: mergers of
101: neutron star systems, mergers between white dwarfs and neutron stars,
102: and collapse of massive stars
103: 
104: 
105: \end{abstract}
106: 
107: \keywords{Gamma Rays: Bursts - Methods: Data Analysis - Methods: Statistical}
108: 
109: \section{Introduction}
110: 
111: Although it has now been well established that Gamma-Ray Bursts
112: (GRB) are of cosmological origin, their nature and source still
113: remains a mystery. Detailed observations and studies of their
114: afterglow emission have revealed important information regarding
115: the dynamic features and environments of these explosive events
116: (see \cite{Pir05} for a review). The detection of supernova light
117: curve in the afterglows of long duration nearby GRB has indicated
118: that a  fraction of the GRB occur during the the collapse of a
119: massive star (see \cite{Woo06} for a review). Other mechanism that could produce GRB are the
120: merger of compact objects like a pair of neutron stars or a
121: neutron star with a black hole \citep[e.g.][]{Pir92,Geh05,Blo06}. 
122: Thus GRB may be a heterogeneous
123: group and a  proper classification of the phenomena is crucial to
124: isolate and identify the possible different sources. Such a
125: classification will also enable the identification of spectral or
126: temporal correlations which may exist only for a particular class
127: of GRB.
128: 
129: In general, GRB have been classified into two groups of long ($> 2$ sec)
130: and short ($< 2$ sec) duration bursts.
131: This is based on visual inspection of the distribution of burst duration
132:  which clearly shows two peaks. Theoretically, this may be understood by identifying
133: long bursts with collapsing stars where the duration of the event is linked to
134: the dynamical collapse time-scale. On the other hand, the merger of two neutron
135: stars should occur on short timescales and hence may correspond to the
136: short duration bursts. In this scenario, the long duration bursts should
137: always be associated with a supernova explosion and occur in star burst regions, while
138: short bursts should have no relation to star burst regions and should have no associated
139: supernova. However, theoretically another mechanism to create a GRB could be the
140: merger of neutron stars with white dwarfs. These would be long duration bursts,
141: with no associated supernova and could be a significant fraction of the total
142: observed bursts \citep{Kin07}. Observationally, there have been some evidence that
143: there may be more than two classes of GRB. Some GRB have been recorded with low intrinsic
144: luminosity and do not comply with standard spectral relationships \citep{Saz04,Sod04}.
145: More compelling evidence was the absence of a
146: super-nova light curve in two nearby bursts, GRB 060614 and 060505, which suggested
147: that not all long duration bursts are due to  massive stellar collapses \citep{Geh06,Fyn06}
148: ,although for GRB 060614 this result has been disputed by \cite{Sch06},
149: who claim that this GRB is not a nearby one.
150:  While confirmation of these results are awaited, they do highlight the need to examine
151: the possibility that there are more than two types of GRB.
152: 
153: The Burst and Transient Source Experiment (BATSE) on board the
154: COMPTON Gamma-Ray Observatory (CGRO) has provided spectral and
155: temporal information for more than 1500 GRB. Although BATSE
156: provides several spectral parameters, the bimodal distribution of
157: GRB is based on  univariate analysis i.e. only duration is
158: considered as a parameter \citep[e.g.][]{Dez92,Kou93}. There is a
159: claim that even such a univariate analysis supports the existence
160: of three classes \citep{Hor98}. Classification analysis taking
161: into account more observed parameters, i.e. a multivariate
162: analysis, was first undertaken by \cite{Fei98}. Subsequently,
163: different type of analysis were undertaken to classify GRB.
164: \cite{Bau94} used a neural network technique while \cite{Bag98}
165: have undertaken a factor analysis. Nonparametric hierarchical
166: clustering techniques have been used by \cite{Muk98} for 797 GRB
167: with six variables and by \cite{Bal01} for 1599 GRB with nine
168: variables. The results were confirmed for the complete BATSE GRB
169: catalog \citep{Hor02}. \cite{Hak03} used a unsupervised pattern
170: recognition algorithm while \cite{Hor06} have classified GRB by
171: fitting bivariate distributions to the observed duration and
172: hardness ratios. In all these cases, the authors have claimed the
173: existence of at least three classes of GRB although it is not
174: clear whether the different classifications found are consistent
175: with each other, primarily because of the different techniques and
176: the choice of different observed parameters and data sets.
177: 
178: These analyses are based on the observed properties of the GRB and
179: are hence subject to observational biases. In fact, \cite{Hak00}
180: argue that such classification techniques are significantly
181: hampered and the three classes found is probably due to  such
182: biases (see \cite{Hor06} for a counter argument). Indeed, a proper
183: classification should be based on intrinsic rather than observed
184: properties. However since the number of GRB with known red-shift
185: and uniformly measured temporal and spectral parameters is small,
186: such an exercise is presently not possible. Although it can be
187: argued that certain correlations between observed parameters (e.g.
188: the positive correlation between duration and flux) cannot be
189: entirely due to observational bias, it is difficult to ascertain
190: whether the quantitative relationships are not affected. A prudent
191: approach may be to treat such classification as indicative of the
192: nature of systems which should be corroborated by theoretical
193: expectations and further observations. Since theoretical
194: expectations are on intrinsic properties, there is a need to
195: estimate how these intrinsic properties cluster, given the results
196: of a classification based on observed ones. The classification
197: itself should be tested for robustness using different types of
198: schemes. The result of such an analysis may guide or be supported
199: by theoretical models. Moreover, the results can lay down the
200: broad framework and requirements of future observations which can
201: confirm or rule out the proposed  theoretical scenario.
202: 
203: In this work, we use two different multivariate clustering
204: techniques, the K-means partitioning method and the Dirichlet
205: process of mixture modeling, to classify GRB based on their
206: observed properties. These two schemes, which have not been used
207: before for GRB, have the advantage that they do not follow any
208: prior assumption about the number of homogeneous classes. The
209: optimum classification comes out of the process itself. The two
210: schemes allow for post classification discriminant analysis which
211: can be used to verify the acceptability of the classification by
212: computing classification/misclassification probabilities. More
213: importantly, a  GRB which is not in the original sample and which
214: has only a subset of the observed properties used for the
215: classification can be assigned a probability for it to be a member
216: of a certain class. Thus although our analysis has been based on
217: the BATSE catalog, GRB with known red-shifts (which have been
218: observed by other instruments) can be assigned such probabilities
219: and suitably classified. As we shall see, this not only provides
220: qualitative estimation of any possible  observational bias, but
221: can constrain the intrinsic properties of the different clusters.
222: We have selected 21 GRB with known redshifts and well constrained
223: spectral parameters for such classifications and obtained
224: constrains on the cluster's average luminosity.
225: 
226: In the next section, we briefly describe the two classification
227: schemes and present the result of the analysis on the BATSE
228: catalog. In \S 3, the classification obtained from the BATSE data
229: are used to classify GRB with known red-shifts and inferences are
230: made on the intrinsic properties of the different GRB groups. In
231: \S 4 the work is summarized and the main results are discussed.
232: 
233: 
234: \section{Clustering Analysis for GRB data}
235: 
236: The BATSE catalog provides temporal and spectral information for more than 1500
237: GRB. The parameters include, two measures of burst
238: durations, the times within which 50\% ($T_{50}$) and 90\%
239: ($T_{90}$) of the flux arrive, three peak fluxes, $P_{64},
240: P_{256}, P_{1024}$ measured in 64, 256 and 1024 ms bins
241: respectively, four time integrated fluences $F_1-F_4$, in the
242: $20-50$, $50-100$, $100-300$ and $ > 300$ KeV spectral channels.
243: Many of the parameters are highly correlated and following previous
244: works \citep[e.g.][]{Muk98,Hak00} we use the following six parameter
245: set: log$T_{50}$,
246:  log$T_{90}$,  log$P_{256}$,  log$F_T$,  log$H_{32}$,  log$H_{321}$,
247: where  $F_T = F_1 + F_2 +F_3 +F_4$ is the total fluence while  $H_{32}=F_3/F_2$ and
248: $H_{321}=F_3/(F_1+F_2)$ are measures of spectral hardness. The sample consists of
249: $1594$ GRB that have non-zero detections of these parameters. We have not introduced
250: any completeness criteria (like a lower flux cutoff), since incompleteness primarily affects
251: the short duration bursts and hence is not expected to change the qualitative results
252: obtained.  We retain the $F_4$ flux (in the definition of $F_T$),
253: despite the uncertainties in its calibration and
254: sensitivity, because as we discuss later in \S 4, the $\gamma$-ray flux $> 300$ keV
255: is expected to have important spectral information. However, this fluence is not used
256: in the computation of spectral hardness.
257: 
258: \subsection{Partitioning (K-means clustering) method}
259: 
260: 
261:  Over the last several decades, different algorithms
262: have been developed for Cluster Analysis which is used to find groups in
263: a multivariate data set. The choice of a clustering algorithm
264: depends both on the type of data available and on the particular
265: purpose. Generally, clustering algorithms can be divided into two
266: principal types viz. partitioning and hierarchical methods.
267: A partitioning method constructs K clusters i.e. it classifies
268:  the data into K groups which together satisfy the requirement of
269:  a partition such that each group must contain at least one object and
270:  each object must belong to exactly one group. So there are at
271:  most as many groups as there are objects ($K <=n$). Two different
272:  clusters cannot have any object in common and the K groups
273:  together add up to the full data set.  The aim
274:  is usually to uncover a structure that is already present in the
275:  data.  On the other hand,
276: Hierarchical algorithms do not construct single partition with K
277:   clusters but they deal with all values of K in the same run. The
278:   extreme partitions with $K=1$ (all objects are
279:   together in one cluster)and $K = n$ (where each object forms a separate cluster)
280: is a part of the output. In between all values of
281:   $K=2,3,...n-1$ are covered in a kind of gradual transition. The
282:   only difference between $K=r$ \& $K=r+1$ is that one of the r
283:   clusters splits in order to obtain $r+1$ clusters or two of the
284:   $(r+1)$ clusters combined to yield $r$ clusters. In this method
285:   either one starts with $K=n$ and move hierarchically downwards
286:   where at each step two clusters are merged depending on
287:   similarity until only one is left i.e. $K=1$ (agglomerative) or
288:   the reverse way where one starts with $K=1$ and moves upwards
289:   where at each step one cluster is divided into two (depending on
290:   dissimilarity) until $K=n$ (divisive). Most of the previous
291:   works  on GRB \citep[e.g.][]{Muk98,Hak00,Bal01} have been based on
292: hierarchical clustering. However, for GRB classification, a
293:  partitioning  method may be more applicable because
294:   (a) it tries to select the best clustering with K
295:   groups which is not the goal of a hierarchical method,
296:   (b) a hierarchical method can never repair what was done in
297:   previous steps and (c) partitioning methods are designed to group items rather than
298:   variables into a collection of K clusters.
299: 
300: In this work, we apply the K- means method of \cite{Mac67} which is probably the
301:  most widely technique, to the BATSE catalog.  For this method,
302: the optimum value of K  can be obtained in different ways \citep{Har75}.
303: This is done by computing for each cluster formation ( i.e for number
304: of clusters $K = 2,3,4..$) a distance measure
305:  $d_{K} = (1/p) min_{x} E[( x_{K} - c_{K} )^{'}(x_{K} - c_{K})]$ which is defined
306: as the distance of
307: the $x_{K}$ vector (values of the parameters) from the center of a cluster
308: $c_{K}$ (which is estimated as mean value). $p$ is the order of the
309: $x_{K}$ vector, i.e. the number of parameters which for our case is six.
310: If $d_{K}^{\prime}$ is the estimate of $d_{K}$ at the
311: $K^{th}$ point, then $d_{K}^{\prime}$ is the minimum achievable
312: distortion associated with fitting K centers to the data.
313: A natural way of choosing the number of clusters is to plot
314: $d_{K}^{\prime}$ versus K and look for the resulting distortion
315: curve. This curve will monotonically decrease with increasing K, till
316: K is greater than true number of clusters, after which the curve will
317: level off with a smaller slope. This is expected since adding more clusters
318: beyond the true number, will simply create partitions within a group.
319: According to \cite{Sug03} it is more illustrative to consider the
320: transformation of the distortion curve to an appropriate negative power,
321: $J_{K} = (d_{K}^{\prime-(p/2)}  - d_{K-1}^{\prime-(p/2)}$), which will exhibit a
322: sharp "jump" when K equals the true number of clusters.
323: The optimum number of clusters is the value of  K at which the
324: distortion curve levels off as well as its value associated with
325: the largest jump for the transformed curve.
326: 
327: \begin{figure}
328: \begin{center}
329: {\includegraphics[width=1.0\linewidth,angle=0]{f1.eps}}
330: \end{center}
331: \caption{The distortion curve $d_{K}^{\prime}$ and the transformed jump curve $J_K$ for
332: different number of clusters. The largest value of $J_K$ and the leveling of $d_{K}^{\prime}$
333: indicates that for the K-means clustering technique the optimal number of clusters is three. }
334: \label{f1}
335: \end{figure}
336: 
337: 
338: Fig \ref{f1} shows the distortion curve and transformed jump curve
339: for the analysis of the BATSE data. The leveling of the distortion
340: curve and the shape of the jump function strongly suggests that
341: the optimum number of clusters is greater than two and is likely
342: to be three. The group means and the standard errors for the six
343: parameters for three cluster classification, are tabulated in
344: Table \ref{t1}. Cluster I with 423 members has an average
345: $<T_{90}> \sim 0.5$ s can be clearly identified with the short
346: duration bursts. The long duration bursts are cleanly separated
347: into two clusters (clusters 2 and 3) with 622 and 549 members.
348: 
349: 
350: \begin{deluxetable} {lrrr}
351: \tablewidth{0pt}
352: \tablecaption{Average Cluster properties based on K-means classification}
353: \tablehead{
354: \colhead{Parameters} & \colhead{Cluster I} & \colhead{Cluster II} &
355: \colhead{Cluster III} }
356: \startdata
357: $T_{50}$ (sec) &$0.19\pm0.01$& 5.37$\pm$ 0.25&22.9$\pm$ 1.0\\
358: $T_{90}$ (sec)&0.50$\pm$ 0.02&15.85$\pm$ 0.73&63.1$\pm$ 2.9\\
359: $P_{256}$ (\#/cm$^2$/sec) &1.66$\pm$ 0.08& 1.26$\pm$ 0.06& 2.88$\pm$ 0.13\\
360: $F_T$ ($\times 10^{-6}$ ergs/sec)&$0.62\pm{0.04}$& $2.34\pm{0.11}$&$17.8\pm{0.8}$\\
361: $H_{32}$&5.50$\pm$ 0.13& 2.45$\pm$ 0.06& 3.16$\pm$ 0.07\\
362: $H_{321}$&3.39$\pm$ 0.08& 1.32$\pm$ 0.03& 1.78$\pm$ 0.04\\
363: \enddata
364: \tablecomments{Errors quoted are standard errors. The number of members are
365: 423, 622 and 549 for Clusters I,II and III respectively.}
366: \label{t1}
367: \end{deluxetable}
368: 
369: 
370: 
371: 
372: Once the optimum classification (clustering) is obtained, using a
373: process called Discrimination Analysis \cite{Joh96}, one can
374: verify the acceptability of the classification by computing
375: classification/misclassification probabilities for the different
376: GRB.  Although the K-means clustering method is purely a data
377: analytic method, for classification it may be necessary to assume
378: that the underlying distribution is Multivariate Normal. In this
379: standard procedure, using the probability density functions in
380: parameter space for the different clusters, one can assign an
381: object (in this case a GRB) to be a member of a certain class. If
382: the original classification was robust, then every GRB
383: should be classified again as a member of the same class that it was
384: before. If a
385: significant number of objects are not reclassified then that would
386: mean that the original classification was not stable and hence not
387: trustworthy. Table \ref{t2} show the result of a Discrimination
388: Analysis, where the columns represent how the GRB of a cluster
389: were assigned by the analysis. The
390:  fraction of correct classification is $0.954$ which implies that the classification is
391: indeed robust.
392: 
393: \begin{deluxetable} {lccc}
394: \tablewidth{0pt}
395: \tablecaption{Discriminant Analysis for the  K-means classification}
396: \tablehead{
397: \colhead{} & \colhead{Cluster I} & \colhead{Cluster II} &
398: \colhead{Cluster III} }
399: \startdata
400: Cluster $I^*$ &417& 28&  0\\
401: Cluster $II^*$ &  6&578& 23\\
402: Cluster $III^*$ &  0& 21&526\\
403: &&&\\
404: Total         &423&622&549\\
405: \enddata
406: \tablecomments{Clusters I,II and III, are the clusters obtained from the K-means classification.
407: Clusters $I^*,II^*$ and $III^*$ are the clusters to which the GRB were assigned by the Discriminant
408: analysis.}
409: \label{t2}
410: \end{deluxetable}
411: 
412: 
413: 
414: 
415: 
416: 
417: 
418: 
419: 
420: 
421: \subsection{Dirichlet process model based clustering}
422: 
423: 
424: The standard approach to  model-based clustering analysis, is
425: based on modeling by finite mixture of parametric distributions.
426: For example, \cite{Muk98} used such a model based approach to analyze
427: GRB data where they assumed that the GRB population
428: consists of mixture of multivariate Gaussian classes. The number
429: of classes is, however, determined from an initial classification
430: method (e.g. via agglomerative hierarchical clustering).  The
431: Dirichlet process model based clustering is more general and avoids the
432: assumption of known number of possible classes. Since this method
433: is less commonly used as compared to the K-means technique, we describe here
434: the basic concept on the analysis in more detail.
435: 
436: The Dirichlet process avoids a prior assumption of the number of classes
437: by applying a Bayesian
438: nonparametric modeling of the unknown distribution for the multi component data. In this
439: particular case the six component GRB data can be represented by
440: $x_i=($log$T_{50},$log$T_{90},$log$F_{T},$log$P_{256},$log$H_{321},$log$H_{32})^\prime,i=1,2,\ldots,n$.
441: More specifically, $x_i$ is assumed to follow a multivariate normal
442: distribution whose mean vector is generated from a Dirichlet
443: Process (DP). Following \cite{Esc95}, the method is
444: best conceptualized by representing the model as
445: \begin{eqnarray}\label{eq:model}
446: x_i|\mu_i, \Sigma & \sim & \hbox {MVN}(\mu_i,\Sigma), \;\;\;\; i=1,2,\ldots,n; \nonumber \\
447: \mu_i|G & \sim & G ; \;\;\;\;\;\; G \sim DP(\alpha G_0)
448: \end{eqnarray}
449: where MVN means multivariate normal distribution, $G$ is a
450: discrete measure of the unknown  distribution, $\alpha$ is the precision
451: parameter and $G_0$ is a known base measure distribution. Since
452: $G$ is discrete, there can be ties among the $\mu_i$'s , which can
453: also be seen from polya urn representation of \cite{Bla73} as
454: \begin{equation}\label{eq:bm}
455: \mu_i|\mu_1,\mu_2,\ldots,\mu_{i-1} \sim
456: \frac{\alpha}{\alpha+i-1}G_0 + \frac{1}{\alpha+i-1}
457: \sum_{h=1}^{i-1} \delta(\mu_h)
458: \end{equation}
459:  where $\delta(x)$ is the distribution concentrated at the single
460:  point $x$. It is evident from Eqn. (\ref{eq:bm}) that $\mu_i$ are
461: marginally sampled from $G_0$ with positive probability and that some
462: of the $\mu_i$'s are identical. Thus, a partition of
463: $S=\{1,2,\ldots,n\}$ can be formed by defining classes under the
464: relation that $\mu_i$ belongs to the $j^{th}$ class if and only if
465: $\mu_i=\mu_j , j=1,2,\ldots,k$, $k$ being the number of distinct
466: $\mu_i$'s $, i=1,2,\ldots,n$. This induces a certain posterior
467: distribution of $S$ and a posterior inference  can then be used to
468: provide clustering procedure. There are various algorithms
469: available to obtain the posterior partitions of $S$ which are
470: useful for making inferences on clustering of $x_1,x_2, \ldots,
471: x_n$. We implement  the independent and identically distributed
472: Weighted Chinese Restaurant (iidWCR) algorithm \citep[see][]{Ish02}
473: which comes from its use of the partition
474: distribution of $S$. Let $p=\{C_1,C_2,\ldots,C_{n(p)}\}$ be a
475: partition of size $n(p)$ of $S$, where each $C_j$ contains $e_j$
476: elements. Assuming $G_0 $ as the multivariate normal with mean
477: vector $m$ and covariance matrix $B_0$ and denoting
478:  $N_p(x;\mu,\Sigma)=(2\pi)^{-\frac{p}{2}}|\Sigma|^{-\frac{1}{2}}
479:  exp[-\frac{1}{2}(x-\mu)^\prime \Sigma^{-1}(x-\mu)]$
480: as the density of a $p$-component multivariate normal
481: distribution, the iidWCR algorithm for inducing posterior
482: partition of $S$ consists of following steps:
483: \begin{description}
484: \item[\emph{Step 1:}] Assign $p_1 = \{1\}$ and the corresponding
485: importance weight $\lambda (1)=N_6(x_1;m,\Sigma_0+B_0)$ where
486: $\Sigma_0$ is the initial estimate of $\Sigma$.
487: 
488: \item[\emph{Step r:}] Given $p_{r-1}$, compute $\Sigma_{r-1}$ from
489: $x_1,x_2,\ldots, x_{r-1}$. Create $p_r$ by assigning label $r$ to
490: a new set with probability $\frac{\alpha}{(\alpha+r-1)\lambda(r)}
491: \times N_6(x_r;m,\Sigma_{r-1}+B_0)$. Otherwise, assign label $r$
492: to an existing set $C_{j,r-1}$ with probability $
493: \frac{e_{j,r-1}}{(\alpha+r-1)\lambda(r)} \times
494: N_6(x_r;\mu_{j,r-1}, \Sigma_{j,r-1})$ where
495: $\Sigma_{j,r-1}=(B_0^{-1} +e_{j,r-1}\Sigma_{r-1}^{-1})^{-1}$ and
496: $\mu_{j,r-1}=\Sigma_{j,r-1}(B_0^{-1}+e_{j,r-1}\Sigma_{r-1}^{-1}\overline{x}_{j,r-1})$.
497: Note that $e_{j,r-1}$ and $\overline{x}_{j,r-1}$ are the  number
498: of elements and observed mean in $C_{j,r-1}$ respectively and
499: $\lambda(r)$ is the normalizing constant.
500: \end{description}
501: Running step 1 followed by step $r$ for $r=2,3,\ldots,n$ gives a
502: draw from posterior partition of $S$. This $n-step$ draw, in fact,
503: provides an iid sample from WCR density given by
504: \begin{equation}
505: g(p) = \frac{f(x|p)\pi(p)}{\Delta(p)}\nonumber
506: \end{equation}
507: where $\pi(p)$ is the prior density of $p$, f(.) is the density of
508: $x$ and $\Delta(p)=\lambda(1) \times \lambda(2) \times \ldots
509: \times \lambda(n)$ is the importance weight. Repeating the above
510: algorithm $B$ times, one can obtain $p^1,p^2,\ldots,p^B$ iid
511: sample observations from posterior partition of $S$. Based on
512: these sample observations, Monte Carlo method can be devised to
513: estimate $E\{n(p)\}$, the expected number of clusters, as
514: \begin{equation}\label{eq:est-cluster}
515: \widehat{E\{n(p)\}} \approx \frac{\sum_{b=1}^B n(p^b)
516: \Delta(p^b)}{\sum_{b=1}^B  \Delta(p^b)}
517: \end{equation}
518: The key advantages of using this Dirichlet process model-based
519: clustering are that  the underlying distribution of $x_i$'s and
520: the number of clusters are unknown. Moreover, one can provide an
521: estimate of the expected number of clusters by using Eqn.
522: (\ref{eq:est-cluster}).
523: 
524: \begin{deluxetable} {lrrr}
525: \tablewidth{0pt}
526: \tablecaption{Average Cluster properties based on the Dirichlet Mixture Modeling method }
527: \tablehead{
528: \colhead{Parameters} & \colhead{Cluster I} & \colhead{Cluster II} &
529: \colhead{Cluster III} }
530: \startdata
531: $T_{50}$ (sec) &0.31$\pm$ 0.02& 6.76$\pm$ 0.31&16.22$\pm$ 1.50\\
532: $T_{90}$ (sec)& 0.45$\pm$ 0.03&19.05$\pm$ 0.88&43.65$\pm$ 3.02\\
533: $P_{256}$ (\#/cm$^2$/sec) &1.66$\pm$ 0.08& 1.35$\pm$ 0.03& 4.79$\pm$ 0.33\\
534: $F_T$ ($\times 10^{-6}$ ergs/sec)&$0.89\pm{0.06}$& $3.46\pm{0.08}$&$18.2\pm{0.2}$\\
535: $H_{32}$&4.68$\pm$ 0.22& 2.82$\pm$ 0.06& 3.31$\pm$ 0.15\\
536: $H_{321}$&2.75$\pm$ 0.13& 1.58$\pm$ 0.04& 1.86$\pm$ 0.09\\
537: \enddata
538: \tablecomments{Errors quoted are standard errors. The number of members are
539: 409, 892 and 293 for Clusters I,II and III respectively.}
540: \label{t3}
541: \end{deluxetable}
542: 
543: 
544: For fitting the model, we used $\alpha=1.0$ and a flat
545: prior $G_0 \sim N_6(0,\sigma^2\mathbf{I})$ with $\sigma^2=1000$.
546: The initial value of $\Sigma$, $\Sigma_0$ is obtained as the
547: sample covariance matrix. We applied the iidWCR algorithm for
548: $B=1000$ to obtain the estimate of the number of clusters.
549: Three classes were obtained consistent with the results from
550: the K-means technique.    In Table \ref{t3} the mean values of
551: the parameters with errors are tabulated. The values are consistent
552: with those found from the K-means method.
553: The number of members of cluster II, 892 is somewhat larger
554: than what was found by the K-means method, 622, but considering the different nature and
555: approach of the two techniques, such differences are perhaps expected. In summary,
556: these  two independent
557: clustering techniques indicate that there are three classes of GRB with
558: qualitatively similar properties.
559: 
560: 
561: 
562: 
563: \section{Classification of GRB with known red-shift}
564: 
565: 
566: Although the classification described in the previous section is based on six GRB parameters
567: the segregation of the classes can be visualized using the total fluence, $F_T$ and the
568: duration, $T_{90}$. This is illustrated in Fig (\ref{f2}) which shows  $F_T$ versus
569: $T_{90}$ for the members of the three clusters obtained using the K-means technique. The
570: Dirichlet process model gives qualitatively similar results. The solid line representing
571: $T_{90} = 2$ sec, differentiates the members of Cluster I (marked
572: by triangles) with those of Cluster II (marked by filled circles). Thus Cluster I is consistent
573: with the standard classification of short duration bursts. The standard long duration bursts
574: (with $T_{90} > 2$ s) are further classified into two groups with one of them (members
575: of Cluster III, marked using open circles) having typically higher fluence. Thus we have named
576: members of Cluster II and III as low and high  fluence GRB. The
577: solid line representing $F_T = 1.6 \times 10^{-4}/T_{90}$ ergs/cm$^2$ qualitatively separates two groups. There are eight GRB detected by BATSE for which
578: there are redshift estimates \citep[e.g.][]{Bag03}. These are marked by
579: squares in Figure 2. Six of them are in Cluster III while two are close to
580: the demarking line. One of these GRB (980425) is at a low redshift 
581: ($z = 0.0085$) and is associated with a supernova. 
582: 
583: \begin{figure}
584: \begin{center}
585: {\includegraphics[width=1.0\linewidth,angle=0]{f2.eps}}
586: \end{center}
587: \caption{The fluence $F_T$ in ergs/cm$^2$ versus the duration $T_{90}$ in seconds for members
588: of Cluster I (triangles), II (solid circles) and III (open circles) from the 
589: K-means clustering method. The solid lines represent
590: $T_{90} = 2$s and $F_T = 1.6 \times 10^{-4}/T_{90}$ ergs/cm$^2$ which qualitatively 
591: separate the three groups. The solid squares represent eight GRB detected by 
592: BATSE for which redshifts are also measured \citep[e.g.][]{Bag03}. }
593: \label{f2}
594: \end{figure}
595: 
596: \begin{figure}
597: \begin{center}
598: {\includegraphics[width=1.0\linewidth,angle=0]{f3.eps}}
599: \end{center}
600: \caption{ The fluence $F_T$ in ergs/cm$^2$ versus the duration $T_{90}$ in
601: seconds for $21$ GRB (filled circles) with known redshifts taken from
602: \cite{Ghi04} and references therein.  The solid lines represent $T_{90} = 2$s and $F_T =
603: 1.6 \times 10^{-4}/T_{90}$ which qualitatively distinguish the
604: three clusters (Fig \ref{f2}). For each GRB, the dotted lines
605: represent the predicted fluence and duration if it was located at
606: a redshift range $0.1 < z < 5$. The open triangles represent the
607: predicted fluence and duration if the GRB were located at
608: redshift $z = 1$. These predicted values allow for the
609: classification of the GRB into members of Cluster II and III. }
610: \label{f3}
611: \end{figure}
612: 
613: 
614: 
615: 
616: 
617: To identify a GRB with  a known red-shift as a member of a
618: cluster, broad band coverage of the prompt emission is required in
619: order to correctly estimate the total fluence $F_T$, which BATSE
620: would have observed for the burst. This is particularly important,
621: when the peak of the energy spectrum of a GRB is at high energies
622: $> 300$ keV. \cite{Ama02} analyzed GRB with known red-shifts and
623: well constraned spectral parameters over a broad energy range and
624: discovered that the intrinsic (i.e. red-shift corrected) peak of
625: the energy spectrum, $E_p$ correlates with the isotropic energy
626: output, $E_{iso}$. Apart from being a stringent condition and test
627: for any theoretical model that describes the GRB prompt emission,
628: this empirical relation highlights the possibility that GRB can be
629: used to probe and constrain the expansion of the Universe at early
630: times. \cite{Ghi04} added more GRB to the sample and  found that
631: the the beaming corrected luminosity, $E$ has a tighter
632: correlation with $E_p$ than the isotropic one. Nevertheless, there
633: is still significant dispersion in the relationship which needs to
634: be explained. In order to see how the classification  found in
635: this work affects such relationship, we  have used 21 GRB listed
636: in Table 1 of \cite{Ghi04} that have well measured temporal and
637: spectral parameters. The total fluence $F_T$ versus the duration
638: $T_{90}$ for these GRB are plotted in Fig. (\ref{f3}) (filled
639: circles). Overlaid on the plot are the two solid lines
640:  $F_T = 1.6 \times 10^{-4}/T_{90}$ ergs and $T_{90} = 2$s which qualitatively segregate the three
641: clusters (Fig \ref{f2}). Most of the GRB have high fluence and are of long duration (consistent with them
642: being members of Cluster III) which is probably a selection effect.  Two of the GRB have $F_T < 1.6 \times 10^{-4}/T_{90}$ ergs/cm$^2$
643: and are probably members of Cluster II. However, there are three GRB with
644:  $F_T \sim 1.6 \times 10^{-4}/T_{90}$ ergs/cm$^2$ and hence there is an ambiguity about their classification.
645: For each GRB, a corresponding dotted line is plotted in
646: Fig (\ref{f3}) which shows the variation of the observed $F_T$ versus $T_{90}$ if the same GRB was
647: located at different red-shifts. The lines are drawn for a red-shift range $0.1 < z < 5.0$ for a
648: $\Lambda$ CDM cosmology with $\Omega_\Lambda = 0.7$ and Hubble parameter $H = 65$ km/s/Mpc.
649: It is interesting to note that the red-shift trajectories for the high fluence GRB in general do
650: not cross the $F_T = 1.6 \times 10^{-4}/T_{90}$ line. In other words, if these GRB were located
651: at a wide range of red-shifts, their observed fluence would have been $F_T > 1.6 \times 10^{-4}/T_{90}$
652: and hence they would have been classified as members of Cluster III. On the other hand there are six
653: GRB whose red-shift trajectories mostly lie below the de-marking line. Their  observed fluence would be
654: $F_T < 1.6 \times 10^{-4}/T_{90}$ for a wide range of red-shifts and hence they would be classified
655: as members of Cluster II. The open triangles in the Figure mark the positions of the GRBs if they were
656: all located at a red-shift, $z = 1.0$. In this representation, the GRB are more clearly segregated into Clusters
657: II and III with five of them having a  predicted fluence less
658: than  $F_T > 1.6 \times 10^{-4}/T_{90}$ with the others being significantly brighter. This strongly suggests
659: that the classification described in this work is not due to observational bias arising from the use
660: of observed parameters instead of intrinsic ones.
661: 
662: \begin{figure}
663: \begin{center}
664: {\includegraphics[width=1.0\linewidth,angle=0]{f4.eps}}
665: \end{center}
666: \caption{ The intrinsic (red-shift corrected) peak of the energy flux, $E_{peak}$ versus the
667: isotropic energy output for 21 GRB with well defined spectral parameters \citep{Ama02,Ghi04}.
668: GRB that are members of Cluster II (triangles) have isotropic energy output of nearly
669: $10^{52}$ ergs, while members of Cluster III (circles) have a much wider range of energy
670: output.}
671: \label{f4}
672: \end{figure}
673: 
674: 
675: 
676: We classify the 21 GRB according to their predicted observed fluence and duration if they were situated
677: at $z = 1$. In this scheme, five GRBs
678: are classified as members of Cluster II while the remaining 18 are identified as
679: members of Cluster III. Fig (\ref{f4}) shows the variation of the intrinsic energy peak $E_{peak}$ versus
680: the isotropic energy realized, $E_{iso}$, which is the correlation discovered by \cite{Ama02}. GRB identified
681: as Cluster I are marked as triangles while those belonging to Cluster III are represented by filled circles.
682: GRB belonging to Cluster II all
683: have an isotropic energy output close to $\sim 10^{52}$ ergs, while those belonging  to Cluster III span
684: a much larger range of energy $10^{52-54}$ ergs and roughly follow the $E_p-E_{iso}$ correlation. Although the
685: number of GRB in this sample is small, this segregation of the GRB in intrinsic parameter space is a
686: another indication that the classification is robust and perhaps not due to observational bias.
687: Most of the members of Cluster III have a rest frame peak energy $E_{peak} > 300$ keV and hence would have
688: had significant flux in the highest energy channel of BATSE, i.e. $F_4$. Thus, the retention of $F_4$ in
689: the classification analysis of \S 2 is important even though there is systematic  uncertainty in
690: the measured value of the fluence. Indeed, if $F_4$ is not taken into account the evidence for  three
691: clusters in the BATSE sample decreases.
692: 
693: \section{Summary and Discussion}
694: 
695: 
696: Two multivariate clustering techniques, the K-means partitioning
697: method and the Dirichlet process of mixture modeling, have been applied for the first time to
698: the BATSE Gamma-ray burst (GRB) catalog. These two schemes do not make any a priori assumptions
699: about the number of clusters, but instead provide quantitative estimate of the optimal number of
700: groups. The jump curve for the K-means partitioning method suggests that this optimal number is
701: three which is further supported with the value of  the expected number of clusters, $E\{n(p)\}$, obtained
702: using Dirichlet process of mixture modeling. The two techniques group the GRBs in qualitatively similar
703: classes, which can be described as short bursts ($T_{90} < 2$ s, Cluster I), long duration, low fluence bursts
704: ($F_T < 1.6 \times 10^{-4}/T_{90}$ ergs/cm$^2$, Cluster II) and long duration, high fluence bursts ($F_T > 1.6 \times 10^{-4}/T_{90}$ ergs/cm$^2$, Cluster III).
705: 
706: To estimate how such a classification, based on observed spectral and temporal parameters,
707:  can arise from intrinsic GRB properties, a sample of 21 GRB with known red-shifts and well constrained
708: spectral parameters,  are classified within
709: this scheme. The observed total fluence, $F_T$ and duration $T_{90}$ for
710: these GRBs if they were located at different red-shifts ($0.1 < z < 5$) were estimated. For 16 of the
711: 21 GRB, the estimated fluence would have satisfied $F_T > 1.6 \times 10^{-4}/T_{90}$ ergs/cm$^2$
712: for nearly the entire
713: range of red-shift space and hence they were classified as high fluence bursts.
714: This invariance in red-shift, indicates that the  classification scheme is not strongly effected
715: by observational bias and by the use of observed parameters instead of intrinsic ones. For
716: five GRB, classified as low fluence bursts, the predicted fluence   $F_T < 1.6 \times 10^{-4}/T_{90}$
717: for a significant fraction of the red-shift space, which again signifies the physical nature
718: of the classification. Based on the classification of these GRB with known red-shift, it can
719: be inferred that the low fluence GRB have a  nearly constant isotropic Energy output of $10^{52}$ ergs
720: and have an intrinsic (red-shift corrected) duration of $T_{90} \sim 2$-$30$ secs.
721: On the other hand, the high fluence GRB (Cluster III)
722: have a much wider range of isotropic energy output $10^{52-54}$ ergs and a corresponding wide
723: range of intrinsic durations $10$ - $500$ secs.
724: 
725: 
726: We note with caution that the number of GRB with known red-shifts, used for this analysis
727: is small and a much larger sample is required before concrete conclusions can be drawn. It is
728: also important, for this analysis,  to have well constrained spectral parameters of these GRB.
729: In particular, the peak of the energy fluxes, $E_p$ are required to be well estimated
730: and since for high energetic
731: sources $E_p > 300$ keV, it is imperative to have well calibrated high energy information. Indeed, if the
732: highest energy channel of the BATSE measurement is not taken into account,
733: the significance of the classification is smaller.
734: 
735: The classification presented here, needs to be supported by
736: theoretical considerations. It is tempting to identify the low
737: fluence GRB with neutron star-white dwarf mergers \citep{Kin07}
738: and the higher fluences ones with massive stellar collapse. The
739: near constancy of the isotropic energy output of low fluence
740: bursts, seem to be consistent with them being neutron star-white
741: dwarf mergers. Since both neutron stars and white dwarfs do not
742: have significant mass variations, their initial conditions for the
743: binary merger could be similar, leading to the nearly constant
744: energy output. Moreover, their merger time may also be typically
745: smaller than massive stellar collapse time-scales, which is
746: consistent with the shorter intrinsic duration $2-30$ s, found in
747: this work. On the other hand, the energy output and duration of
748: GRB induced by massive stellar collapse may depend on the mass and
749: size of the progenitor which is consistent with the variation
750: inferred for high fluence bursts. The present observational
751: evidence for such a model is not clear. Evidence for supernova
752: light curves have been detected in GRB with different energy
753: output, including some low luminosity ones, e.g. GRB 0311203, $E
754: \sim 3 \times 10^{49}$ ergs \citep{Mal04}, the nearby GRB 060614,
755: for which no supernova was detected also had a low isotropic
756: energy output of $10^{51}$ ergs. Such low energy output are not
757: represented in the 21 GRB with well constrained spectral
758: parameters used in this analysis which all have energies $>
759: 10^{52}$ ergs. These rare GRB (since they have to be located
760: relatively nearby to be detected) may not represent a significant
761: fraction of the BATSE catalogue. BATSE did detect GRB 980425 which
762: is at low redshift ($z = 0.0086$) and is associated with a supernova. 
763: At this redshift, the GRB would be a borderline case between the high and 
764: low fluence GRB (Figure 2). Thus the interpretation of the
765: two different classes of long bursts as being due to stellar
766: collapse and white dwarf-neutron star mergers, is speculative and
767: more quantitative theoretical predictions and observational
768: evidences are required before a definite conclusion can be made.
769: 
770: 
771: 
772: \acknowledgements
773: 
774: 
775: TC and AKC thank the IUCAA associateship program for
776: support
777: 
778: \begin{thebibliography}{}
779: 
780: \bibitem[Amati et al.(2002)]{Ama02} Amati, L., et al.\ 2002,
781: \aap, 390, 81
782: 
783: \bibitem[Bagoly et al.(1998)]{Bag98} Bagoly, Z., Meszaros, 
784: A., Horvath, I., Balazs, L.~G., \& Meszaros, P.\ 1998, \apj, 498, 342 
785: 
786: \bibitem[Bagoly et al.(2003)]{Bag03} Bagoly, Z., Csabai, I., 
787: M{\'e}sz{\'a}ros, A., M{\'e}sz{\'a}ros, P., Horv{\'a}th, I., Bal{\'a}zs, 
788: L.~G., \& Vavrek, R.\ 2003, \aap, 398, 919 
789: 
790: \bibitem[Balastegui et al.(2001)]{Bal01} Balastegui, A., 
791: Ruiz-Lapuente, P., \& Canal, R.\ 2001, \mnras, 328, 283 
792: 
793: 
794: 
795: \bibitem [Baumgart (1994)]{Bau94} Baumgart,C.W. 1994, in SPIE
796:           Proc. 2243, Applications of Artificial Neural Networks V, eds. S.
797:           K. Rogers \& D. W. Ruck (Bellingham: SPIE), 552.
798: 
799: \bibitem [Blackwell \& MacQueen (1973)]{Bla73} Blackwell, D. \&
800:           MacQueen,J. B. 1973, {\it Annals of Statistics}, 1, 353.
801: 
802: \bibitem[Bloom et al.(2006)]{Blo06} Bloom, J.~S., et al.\ 
803: 2006, \apj, 638, 354 
804: 
805: 
806: 
807: 
808: 
809: 
810: 
811: 
812: 
813: 
814: \bibitem[Dezalay et al.(1992)]{Dez92} Dezalay, J.-P., Barat, 
815: C., Talon, R., Syunyaev, R., Terekhov, O., \& Kuznetsov, A.\ 1992, American 
816: Institute of Physics Conference Series, 265, 304 
817: 
818: 
819: 
820: \bibitem [Escober \& West(1995)]{Esc95} Escobar, M. D. \& West, M. 1995,
821:           {\it Journal of the American Statistical Association}, 90, 577.
822: 
823: \bibitem[Feigelson \& Babu(1998)]{Fei98} Feigelson, E.~D., \& 
824: Babu, G.~J.\ 1998, IAU Symp.~179: New Horizons from Multi-Wavelength Sky 
825: Surveys, 179, 363 
826: 
827: 
828: 
829: \bibitem[Fynbo et al.(2006)]{Fyn06} Fynbo, J.~P.~U., et al.\
830: 2006, \nat, 444, 1047
831: 
832: \bibitem[Gehrels et al.(2005)]{Geh05} Gehrels, N., et al.\ 
833: 2005, \nat, 437, 851 
834: 
835: \bibitem[Gehrels et al.(2006)]{Geh06} Gehrels, N., et al.\
836: 2006, \nat, 444, 1044
837: 
838: \bibitem[Ghirlanda et al.(2004)]{Ghi04} Ghirlanda, G.,
839: Ghisellini, G., \& Lazzati, D.\ 2004, \apj, 616, 331
840: 
841: \bibitem[Hakkila et al.(2000)]{Hak00} Hakkila, J., Haglin, 
842: D.~J., Pendleton, G.~N., Mallozzi, R.~S., Meegan, C.~A., \& Roiger, R.~J.\ 
843: 2000, \apj, 538, 165 
844: 
845: \bibitem[Hakkila et al.(2003)]{Hak03} Hakkila, J., Giblin, 
846: T.~W., Roiger, R.~J., Haglin, D.~J., Paciesas, W.~S., \& Meegan, C.~A.\ 
847: 2003, \apj, 582, 320 
848: 
849: 
850: 
851:  \bibitem [Hartigan (1975)]{Har75} Hartigan, J. A. 1975, Clustering
852:           Algorithms, New York, Wiley.
853: 
854: 
855: \bibitem[Horv{\'a}th (1998)]{Hor98} Horv{\'a}th, I.\ 1998,
856: \apj, 508, 757
857: 
858: \bibitem[Horv{\'a}th (2002)]{Hor02} Horv{\'a}th, I.\ 2002,
859: \aap, 392, 791
860: 
861: 
862: \bibitem[Horv{\'a}th et al.(2006)]{Hor06} Horv{\'a}th, I., 
863: Bal{\'a}zs, L.~G., Bagoly, Z., Ryde, F., \& M{\'e}sz{\'a}ros, A.\ 2006, 
864: \aap, 447, 23 
865: 
866: 
867: 
868: 
869: \bibitem [Ishwaran \& Takahara(2002)]{Ish02} Ishwaran, H. \& Takahara, G., 2002,
870: {\it Journal of the American Statistical Association}, 97, 1154.
871: 
872: \bibitem [Johnson (1996)]{Joh96} Johnson, R. A., \& Wichern, D.W. 1996,
873: Applied Multivariate Statistical Analysis, Prentice Hall of India.
874: 
875: \bibitem[King et al.(2007)]{Kin07} King, A., Olsson, E., \&
876: Davies, M.~B.\ 2007, \mnras, 374, L34.
877: 
878: \bibitem[Kouveliotou et al.(1993)]{Kou93} Kouveliotou, C., 
879: Meegan, C.~A., Fishman, G.~J., Bhat, N.~P., Briggs, M.~S., Koshut, T.~M., 
880: Paciesas, W.~S., \& Pendleton, G.~N.\ 1993, \apjl, 413, L101 
881: 
882: 
883: 
884: \bibitem [MacQueen (1967)]{Mac67} MacQueen, J., 1967, Fifth Berkeley
885:           Symp. Math. Statist. Prob.,1,281.
886: 
887: \bibitem[Malesani et al.(2004)]{Mal04} Malesani, D., et al.\
888: 2004, \apjl, 609, L5
889: 
890: \bibitem[Mukherjee et al.(1998)]{Muk98} Mukherjee, S., 
891: Feigelson, E.~D., Jogesh Babu, G., Murtagh, F., Fraley, C., \& Raftery, A.\ 
892: 1998, \apj, 508, 314 
893: 
894: 
895: \bibitem[Piran(1992)]{Pir92} Piran, T.\ 1992, \apjl, 389, L45 
896: 
897: \bibitem[Piran(2005)]{Pir05} Piran, T.\ 2005, Reviews of
898: Modern Physics, 76, 1143
899: 
900: 
901: \bibitem[Sazonov et al.(2004)]{Saz04} Sazonov, S.~Y.,
902: Lutovinov, A.~A., \& Sunyaev, R.~A.\ 2004, \nat, 430, 646
903: 
904: \bibitem[Schaefer \& Xiao(2006)]{Sch06} Schaefer, B.~E., \&
905: Xiao, L.\ 2006, ArXiv Astrophysics e-prints, arXiv:astro-ph/0608441
906: 
907: \bibitem [Sugar \& James(2003)]{Sug03} Sugar, A. S., \& James, G. M.
908:          2003, JASA, 98, 750.
909: 
910: \bibitem[Soderberg et al.(2004)]{Sod04} Soderberg, A.~M., et
911: al.\ 2004, \nat, 430, 648
912: 
913: \bibitem[Woosley \& Bloom(2006)]{Woo06} Woosley, S.~E., \& 
914: Bloom, J.~S.\ 2006, \araa, 44, 507 
915: 
916: 
917: 
918: \end{thebibliography}
919: 
920: 
921: \end{document}
922: