0111:astro-ph0111195/ms.tex

1: \documentclass[12pt,preprint]{aastex}

2:

3: \slugcomment{Preprint DAMTP-2001-96; April 18th, 2002}

4: \shorttitle{Modified Median Statistics and Type Ia Supernova Data}

5: \shortauthors{P.P. Avelino, C.J.A.P. Martins and P. Pinto}

6:

7: \begin{document}

8: \title{Modified Median Statistics and Type Ia Supernova Data}

9:

10: \author{P.P. Avelino\altaffilmark{1}, C.J.A.P. Martins\altaffilmark{2,3}

11: and P. Pinto\altaffilmark{1}}

12: \affil{Centro de Astrof\'{\i}sica da Universidade do Porto,\\

13: Rua das Estrelas s/n, 4150-762 Porto, Portugal}

14:

15: \altaffiltext{1}{Dep. de F{\' \i}sica da Faculdade de Ci\^encias da

16: Univ. do Porto, Rua do Campo Alegre 687, 4169-007 Porto, Portugal.

17: Email: pedro\,@\,astro.up.pt}

18: \altaffiltext{2}{Department of Applied Mathematics and Theoretical Physics,

19: Centre for Mathematical Sciences, University of Cambridge,

20: Wilberforce Road, Cambridge CB3 0WA, United Kingdom.

21: Email: C.J.A.P.Martins\,@\,damtp.cam.ac.uk}

22: \altaffiltext{3}{Institut d'Astrophysique de Paris,

23: 98 bis Boulevard Arago, 75014 Paris, France}

24:

25: \begin{abstract}

26: The median statistic has recently been discussed by Gott \textit{et al.}

27: as a more reliable alternative to the standard $\chi^2$ likelihood

28: analysis, in the sense of requiring fewer assumptions about the data

29: and being almost as constraining.

30: We apply this statistic to the currently available combined

31: dataset of 92 distant type Ia supernovae, and also to a mock SNAP-class

32: dataset. We find that the performances of the modified median and $\chi^2$

33: statistics are comparable, particularly in the latter case.

34: We further extend the work of Gott \textit{et al.} by modifying the median

35: statistic to account for the number and size of sequences of consecutive

36: points above or below the median. We also comment on how the performance

37: of the statistic depends on the choice of free parameters that one is

38: estimating.

39: \end{abstract}

40:

41: \keywords{Cosmology; data analysis; statistical methods; type Ia supernovae}

42:

43: \section{Introduction}

44:

45: In recent work \citet{Gott:2000mv} have argued, through

46: several convincing examples, that the median statistic is a reliable

47: alternative to the usual $\chi^2$ likelihood analysis. Even though it

48: usually has the caveat of not being as constraining (for the same data set),

49: it has the strong advantage of requiring much weaker assumptions about the

50: dataset itself and its errors. Furthermore, it is also less vulnerable than

51: the mean to the presence of bad data, such as when `outliers' exist.

52: Hence, if nothing else, median statistics can be useful for

53: the early stages of a data analysis pipeline, when one is still trying

54: to put together evidence that may justify the use of stronger hypotheses

55: on the dataset.

56:

57: In this work, we apply the median statistic to the combined dataset of

58: 92 type Ia supernovae taken from the Supernova Cosmology Project

59: (SCP) \citep{Perlmutter:1998np} and the High-z Supernova Search

60: Team (HzST) \citep{Riess:1998cb}\footnote{Note that in

61: \protect \citep{Gott:2000mv} median statistics was only separately applied to

62: earlier versions of each dataset.}. This is a particularly relevant example,

63: since the current data set is still fairly small. Furthermore, the

64: physics of supernova explosions is not at all well known, and hence the

65: possible presence of outliers in the available data is a particular concern.

66:

67: We also apply our results to a SNAP-class \citep{Nugent:2000vs,Weller:2001gf}

68: simulated dataset. We further extend the analysis of \citet{Gott:2000mv}

69: by considering simple modifications  of the median statistic which account

70: for the number and size of sequences of  consecutive points above or below

71: the median (to which the standard median statistic is `blind'). We discuss

72: the dependence of the performance of

73: the statistic on the choice of free parameters that one is estimating and

74: we find that the median statistic can be competitive with the standard

75: $\chi^2$ analysis, provided  one knows the Hubble parameter

76: $H_0$ (or equivalently the absolute magnitude of a standard type Ia

77: supernova).

78:

79: The rest of the paper is organised as follows. In Section 2 we

80: briefly recall the tools necessary to carry out the estimation of the

81: present day  values of the cosmological parameters $\Omega_m$ and

82: $\Omega_\Lambda$ from a type Ia

83: supernova dataset. In Section 3 we introduce the median statistic

84: as a data analysis method and motivate some simple modifications thereof.

85: We then present our results in Section 4, and finally in

86: Section 5 we draw some conclusions and discuss possible further

87: improvements.

88:

89: \section{Cosmological parameters from type Ia supernovae}

90:

91: Following the release of the results from the Supernova Cosmology

92: Project (SCP) \citep{Perlmutter:1998np} and the High-z Supernova Search Team

93: (HzST) \citep{Riess:1998cb}, which altogether include some 100 supernovae,

94: there has been an increased effort towards the parameterisation of the energy

95: content of the Universe using Type Ia Supernovae \citep{Weller:2001gf}.

96:

97: The currently available data, when combined with CMB

98: results \citep{Jaffe:2000tx}

99: indicates that about one third of the critical energy density in the

100: Universe is in the form of ordinary matter (and here we include the classic

101: dark matter), while the remaining two thirds are in the form of a so-called

102: dark energy component, the exact form of which is yet

103: unknown. The cosmological constant $\Lambda$ is arguably the simplest

104: candidate for this dark energy \citep{Bean:2001xy}, though there are various

105: other contenders, from frustrated topological defects \citep{Bucher:1998mh}

106: to a time varying cosmological constant \citep{ratra1,ratra2,ratra3,waga},

107: in particular what is commonly called

108: quintessence \citep{Caldwell:1998ii,Wang:1999fa}.

109: The main problem associated with the cosmological

110: constant is that theoretical predictions of its value are many orders of

111: magnitude off from the experimental results \citep{carroll}. On the other hand,

112: quintessence may suffer from considerable `coincidence' problems.

113:

114: Aiming to help settling the question of the constitution of dark energy, the

115: SNAP (Supernova Acceleration Probe) \citep{Nugent:2000vs}

116: satellite was recently proposed. Its goal is to obtain a supernova dataset

117: more than one order of magnitude larger than currently available datasets,

118: with much-improved control over systematic errors,

119: to redshifts up to about $z \simeq 1.7$.

120: Even though SNAP results are still years away, we can of course

121: simulate the expected results, and thus forecast the impact of this improved

122: dataset on the constraints on the energy density and equation of state of

123: dark energy that permeates the Universe \citep{Weller:2001gf}.

124:

125: As usual, the parameter fit is done through the luminosity distance $d_L$,

126: defined as

127: \begin{equation}

128: {\mathcal F} = \frac{{\mathcal L}}{4\pi d_L{}^2}\,, \label{flux}

129: \end{equation}

130: where ${\mathcal L}$ is the intrinsic luminosity of the source and

131: ${\mathcal F}$ the measured flux. From the Friedmann--Robertson--Walker

132: metric it follows that this distance is given,

133: as a function of redshift $z$, by

134: \begin{equation}

135: d_L = \frac{c(1 + z)}{H_0 \sqrt{|\Omega_k|}}\ S(\sqrt{|\Omega_k|} \int _0^z

136: (\Omega_m(z' + 1)^3 + \Omega_{\Lambda} + (z' + 1)^2

137: \Omega_k)^{-\frac{1}{2}}dz')\,, \label{ludist}

138: \end{equation}

139: where $\Omega_k = 1 - \Omega_m - \Omega_{\Lambda}$, and the function $S$

140: is defined as

141: \begin{equation}

142: S(x) = \left\{

143: \begin{array}{ll}

144: \sin x, & \Omega_k < 0 \\

145: x, & \Omega_k = 0 \\

146: \sinh x, & \Omega_k > 0

147: \end{array} \right.\,.\label{curvature}

148: \end{equation}

149:

150: The apparent magnitude $m$ of a supernova (a parameter more often used than

151: the measured flux ${\mathcal F}$, to which it is related) at a given redshift

152: is then given by

153: \begin{equation}

154: m = M + 5\log \left(\frac{d_L}{{\rm {Mpc}}}\right) + 25\,, \label{mags}

155: \end{equation}

156: $M$ being the absolute magnitude of the supernova (related to its

157: intrinsic luminosity ${\mathcal L}$).

158: Following Wang \citep{Wang:1999bz}, we use results from both the SCP

159: and HzST even though their published datasets differ in presentation.

160: We define the distance modulus

161: \begin{equation}

162: \mu _0 = 5\log \left(\frac{d_L}{{\rm {Mpc}}}\right) + 25\,, \label{distmod}

163: \end{equation}

164: as presented in the HzST results comprising 50 supernovae.

165: Comparatively, the SCP

166: published the estimated effective {\it B}-band magnitude

167: $m_b^{\textrm{eff}}$ for 60 supernovae which relates to the HzST results

168: through

169: \begin{equation}

170: \label{m-mu}

171: m_B^{\textrm{eff}} = M_B + \mu _0\,,

172: \end{equation}

173: where $M_B$ is the peak {\it B}-band absolute magnitude of a standard

174: type Ia supernova.

175: The published results of the SCP and HzST groups have 18 common supernovae,

176: 16 of which are from the Cal\'an--Tololo Survey data \citep{Phillips:1999vh}.

177: If we calculate $M_B$

178: by comparing results for these 18 supernovae (using the results from the HzST

179: estimated by means of the MLCS method) we get

180: \begin{equation}

181: M_B = m_B^{\textrm{eff}} - \mu _0 = -19.33\pm 0.25\,.\label{meff}

182: \end{equation}

183: Assuming the value $M_B = -19.33$ for the absolute luminosity, we convert the

184: SCP results to distance modulus using eqn. (\ref{m-mu}). We then add 42 of

185: these supernovae to the dataset from HzST, leaving out the 18 already present,

186: thus making a total of 92 supernovas.

187:

188: As for future results, our simulation assumes a specific set of parameters

189: based on the results from current available measurements

190: \begin{equation}

191: \Omega_m = 0.3,\quad \Omega_{\Lambda} = 0.7,\quad H_0 = 65.2\ {\rm km\

192: s^{-1}\ Mpc^{-1}}.

193: \end{equation}

194: We have simulated a supernova dataset with $\mu_0$

195: drawn from a Gaussian distribution with an

196: average computed from the above parameters and

197: $\sigma_{\mu_0} = 0.15$ standard

198: deviation, divided in bins with similar characteristics to those of SNAP

199: projections, as described in Table \ref{tabsnap}.

200:

201: \begin{table}

202: \caption{\label{tabsnap}Distribution of type Ia supernovae

203: by redshift bins in the mock SNAP-class simulated dataset.}

204: \begin{tabular}{|c|c|c|}

205: Min. redshift & Max. redshift & Number of SNe \\

206: \hline

207: 0.0 & 0.2 & 50 \\

208: 0.2 & 1.2 & 1800 \\

209: 1.2 & 1.4 & 50 \\

210: 1.4 & 1.7 & 15 \\

211: \end{tabular}

212: \end{table}

213:

214: \section{Standard median statistics, and how to improve it}

215:

216: Type Ia supernova data analysis is usually carried out using a $\chi^2$

217: analysis. Here, however, we shall describe median statistics as an alternative

218: analysis method, and then propose and motivate some simple

219: modifications of it and study the constraints which can be thus obtained.

220: Rigorous descriptions of the standard median statistic can be found in most

221: good statistics textbooks. For a more detailed review emphasizing

222: some aspects relevant to astrophysics see \citep{Gott:2000mv}.

223:

224: Recall that a $\chi^2$ statistical treatment requires that four hypothesis

225: be met, namely (1) that the experimental results are statistically independent;

226: (2) that there are no systematic errors present; (3) that the experimental

227: errors follow a Gaussian distribution; and (4) that the standard deviation of

228: these errors is known.

229:

230: The fewer assumptions one needs to make about a given dataset,

231: the higher will be the confidence in the results derived from it. It turns out

232: that keeping only assumptions (1) and (2) and relaxing the others one

233: can still make quite strong statements.

234: Assuming that the experimental results are statistically independent and that

235: there were no systematic errors made, one expects that upon performing a

236: large number of measurements approximately half of the values obtained

237: will be above the correct mean value (the other half being below it).

238: In the limit of an infinite number of measurements the middle value is,

239: by definition, the correct mean value.\footnote{Though strictly speaking

240: one should keep in mind that one can construct distributions that are

241: pathological enough to violate this.}

242:

243: If each measurement is statistically independent, and with no assumptions

244: about the probability density function (PDF) or standard deviation of the

245: errors, there is a $50 \%$ chance that each measurement will be above (or

246: below) the true median value of the distribution.

247: So, if we perform $n$ measurements, the probability

248: that $k$ of them will be above (or below) the median is simply given by

249: the binomial distribution,

250: \begin{equation}

251: P(k) = \frac{2^{-n}n!}{k!\ (n - k)!}\,.\label{binom}

252: \end{equation}

253: If we take $n$ measurements $M_n$ ordered from the smallest to the largest,

254: in such a way that $M_{i + 1} > M_i$, the probability of finding the median

255: between the measurements $M_i$ and $M_{i + 1}$ is again

256: \begin{equation}

257: P(i) = \frac{2^{-n}n!}{i!\ (n - i)!} \,, \label{probs}

258: \end{equation}

259: where we suppose $M_0 = - \infty$ and $M_{n + 1} = + \infty$.

260:

261: Given a Hubble diagram with the experimental results plotted against a

262: specific set of cosmological parameters,

263: \begin{equation}

264: \label{m_teo}

265: \mu _0(z) = 5 \log  \left(\frac{d_L(z, H_0, \Omega_m,

266: \Omega_\Lambda)}{{\rm {Mpc}}}\right) + 25\,,

267: \end{equation}

268: the relative likelihood associated with that set of parameters can be

269: simply computed by counting the number of points above (or below) the

270: expected curve and using eqn. (\ref{binom}).

271: When assuming Gaussian errors using a $\chi^2$ statistical treatment

272: we benefit from the fact that doing so the

273: precision increases as $n^{-1/2}$, where $n$ is number of measurements.

274: Nonetheless, one can show \citep{Gott:2000mv} that with median statistics, and

275: relaxing assumptions (3) and (4), this result still holds.

276:

277: It should be stressed that even though there is presently no evidence that

278: supernova luminosity errors are not Gaussian, calibrated light-curves are most

279: likely not Gaussian distributed. There are in fact indications that some

280: outliers are not well calibrated with the current methods of luminosity-curve

281: calibration \footnote{It is noteworthy that the use of different methods of

282: luminosity-curve normalisation carries an uncertainty of about

283: $\Delta m\sim 0.15$ magnitudes.}.

284: Median statistics are not as susceptible to these outliers as the more classic

285: $\chi^2$ analysis. \citet{Gott:2000mv} provide various

286: examples of how just one or very

287: few `fluke' data points could seriously distort a $\chi^2$ analysis,

288: and of why median statistics are much less vulnerable to such effects.

289:

290: On the other hand, one should be wary of the fact that when

291: computing probabilities, the median statistic only

292: accounts for the number of experimental points above or below the expected

293: value. It does not differentiate between the various ways in which these

294: points could be distributed. Suppose that one has 10 magnitude \textit{versus}

295: redshift supernova measurements ordered by increasing redshift. A binomial

296: distribution associates a probability to the case where the first five

297: supernovae are brighter than expected and the last five fainter, equal to the

298: case where the first is brighter, the second fainter, the third brighter, the

299: fourth fainter and so forth until the tenth. These two cases should not be

300: indistinct since the first could turn out to be a terrible fit to the

301: data that happened to have exactly half of its points above and the other half

302: below it \footnote{It is a simple example to consider a horizontal

303: swarm of data points crossed by an almost vertical line through the middle,

304: leaving half the points in each side.}.

305:

306: Note that this `sequence blindness' problem also exists, to some extent,

307: for the $\chi^2$ statistic. However, the crucial difference is that in this

308: case the error bars are know, which substantially attenuates the problem.

309:

310: In order to improve the median statistic we will consider some adjustments to

311: its theoretical framework. Instead of just counting the number of data

312: points above or

313: below the model prediction, we also take into account (a) the size of

314: the largest contiguous

315: sequence found above or below the model prediction or (b) the number of

316: sequences obtained. With either of these we expect to more explicitly

317: account for the way in which the model under consideration intersects the

318: experimental data.

319:

320: Let us be more specific about the modifications to the standard median

321: statistic that we are presently proposing. Consider a random variable,

322: $X$, with a probability distribution with median $M$ and a number, $N$,

323: of realizations of that variable. Assume for the sake of illustration the

324: following result of $N=10$ realizations of the variable $X$

325: \begin{equation}

326: (-,+,+,-,+,+,+,-,-,+)

327: \end{equation}

328: where a $+(-)$ means that the particular realization ($X_i$ where $i$ can

329: take any value between $1$ and $N$) of the variable $X$ is above

330: (below) the median of the distribution. In this particular case we can

331: see that there are $6$ sequences the largest of which has $3$ elements

332: $(+,+,+)$. We considered the probability, $P(k,N_s)$ of having at the

333: same time $k$ measurements above (or below) the median and a number, $N_s$,

334: of sequences using a Monte Carlo simulation. We have also considered the

335: probability $P(k,N_l)$ where in this case $N_l$ means the size of the

336: largest sequence. It is also important to refer that in our particular

337: application we consider that the measurements are naturally ordered by

338: increasing redshift.

339:

340: Through a simple Monte Carlo simulation we compute the probabilities

341: $P(k,N_s)$ and $P(k,N_l)$ considering the number of data points

342: above or below the model prediction,

343: as well as the required sequence counting for each of

344: these two alternatives

345: (a) and (b). As we'll show below, these two alterations slightly

346: improve the constraining process. We will also find that the performance of

347: median statistic (and its modifications) strongly depends on the choice of

348: the parameters being fitted for, due to reasons that will become apparent

349: in the discussion.

350:

351: \section{Results}

352:

353: We now proceed to apply median and modified median

354: statistics to the current and SNAP supernova

355: datasets, and present confidence regions for the present day values of

356: the cosmological parameters $\Omega_m$ and $\Omega_\Lambda$.

357: We also compare our results with the usual $\chi^2$ analysis.

358:

359: \subsection{Standard median statistics}

360:

361: Results for the current supernova data set, using standard median and

362: $\chi^2$ statistics are shown in Fig. \ref{fig1} \footnote{This

363: updates the results of \citet{Gott:2000mv}, which only analysed

364: the results of the two supernova groups separately.}.

365: Here we assume the knowledge of the Hubble's constant, which we take to be

366: $H_0 = 65.2\ {\rm km\ s^{-1}\ Mpc^{-1}}$, in

367: agreement with \citet{Riess:1998cb}.

368:

369: In a $\chi^2$ analysis it would be standard procedure to integrate over the

370: Hubble parameter,

371: \begin{equation}

372: P(\Omega_m, \Omega_\Lambda) = \int P(\Omega_m, \Omega_\Lambda, H_0) dH_0,

373: \end{equation}

374: but that is not the case with standard median statistics. In fact

375: possibly the main

376: problem with this method is its inability to cope very well with a

377: multi-dimensional fit specially if it depends on several parameters.

378: Clearly the Hubble parameter plays a very important part in this

379: analysis, and a good knowledge of it is necessary to obtain good results.

380: The factor $- 5\log H_0$, present in the distance modulus definition as an

381: additive constant, can move the zero-redshift

382: point up or down the magnitude scale leaving otherwise the curve

383: unaffected. Recall that the supernovae distance scale depends on

384: the Large Magellanic Cloud's distance modulus, and indeed this is

385: the largest source of systematic error.

386:

387: We note that given a sufficient number of local supernovae

388: it is possible to calibrate the value of $H_0$ since when $z \ll 1$ the

389: distance modulus is simply given by

390: \begin{equation}

391: \mu _0= 5\log \frac{cz}{H_0} + 25\,,

392: \end{equation}

393: independently of the other cosmological parameters. In this way the value

394: $H_0 = 65.2 \pm 1.3\ {\rm km\ s^{-1}\ Mpc^{-1}}$ (only the statistical error

395: is included) was found by HzST \citep{Riess:1998cb} which is in agreement

396: with the HST Key Project result \citep{Freedman:1999wh}.

397:

398: As expected, we find a somewhat larger confidence region in the case of

399: median statistics. Nevertheless, we can still exclude a

400: Universe with a vanishing cosmological constant with more than $99 \%$

401: confidence.

402: Similarly, the confidence region is above the

403: $q_0 = \Omega_m/2-\Omega_\Lambda = 0$ line that separates an

404: accelerated expansion of the Universe from a decelerating one.

405:

406: Fig. \ref{fig2} shows an analogous analysis but now using the SNAP

407: simulated results; again we assume the knowledge of the present day

408: value of the Hubble parameter $H_0$.

409: As expected, both statistics

410: can now accurately pin down a `degeneracy axis' but the error bars within

411: this axis are considerably larger for median statistics.

412:

413: \subsection{Modified median statistics}

414:

415: So far we have done the analysis using the standard median statistics.

416: We now consider the effect of the modifications which we described in

417: the in order to take into consideration

418: (a) the largest sequence or (b) the number of sequences obtained.

419: We shall see that

420: these modifications allow us to slightly improve the constraints on

421: the parameters being estimated.

422:

423: Fig. \ref{fig3} shows the results of

424: analysing a SNAP-class mock dataset using median statistics modified

425: to include either of the two effects (a) or (b).

426:

427: We can see that either modification seems to be

428: more constraining than standard median statistics, as it reduces the

429: error bars within the above-mentioned `degeneracy axis'.

430: This was expected, for the reasons already pointed out above: a fit

431: where points alternate above and below a theoretical line should in

432: principle be better than one where there are long continuous sequences

433: either above or below it.

434:

435: As one varies the present day values of the cosmological matter and

436: vacuum densities, $\Omega_m$ and $\Omega_{\Lambda}$, the luminosity

437: versus redshift curves also change. As a result of this tilting,

438: some of these curves will mostly be above (or below)

439: the data points at high redshift.

440: A $\chi^2$ analysis

441: will immediately disfavour these models. On the other hand, in the case of

442: the standard median, many such models which can `compensate' for this by

443: having a fair amount of points at lower redshift below the data points will

444: still survive. However, if one accounts for the presence of sequences and

445: reduces the likelihood of any model where such sequences are found,

446: then one will be able to reduce the range of allowed models. We have verified

447: that the performance of the modified median statistics upon integration

448: over the Hubble parameter is significantly better than that of standard

449: median statistic but still not competitive with the $\chi^2$ analysis.

450:

451: We also note that the gain from either of the modifications is fairly similar.

452: Of course we could also implement them together if

453: desired. One would obtain a further (slight) improvement, at the expense

454: of having to deal with a somewhat more complicated statistic.

455:

456: \subsection{The flat case}

457:

458: Most inflationary models predict

459: a flat Universe, and this seems to be confirmed by recent

460: results from CMB anisotropy measurements \citep{Jaffe:2000tx}.

461: Using this prior, the precision of the fits is quite substantially

462: increased, as we'll be fitting for a single parameter.

463: This can already be seen in Figs.

464: \ref{fig1}\textendash\ref{fig3} where the diagonal line that

465: intersects the confidence regions represents the combinations of the

466: cosmological parameters $\Omega_m$ and $\Omega_\Lambda$ which correspond

467: to a flat universe.

468:

469: In the case of a flat universe the modifications we made to median

470: statistics do not significantly improve

471: the constraints on $\Omega_m-\Omega_\Lambda$ plane obtained using

472: the conventional median analysis and so we'll restrict ourselves to the

473: standard case.

474:

475: In Fig. \ref{fig4} we show results obtained for both datasets through the

476: methods previously presented, assuming a flat universe prior using the

477: conventional median statistic and the $\chi^2$ statistic.

478: We present these results in a more convenient form

479: in table \ref{median_1D_results}.

480:

481: For a flat Universe we obtain $\Omega_m = 0.29^{+0.03}_{-0.10}$ (not including

482: the uncertainty in the Hubble parameter) using currently available

483: supernovae and simple median statistics.

484: For these 92 supernovae the median statistic is marginally less competitive

485: than the $\chi^2$ statistic and one notices that the median results are

486: slightly skewed towards low values of $\Omega_m$. Nevertheless the $90 \%$

487: confidence upper limit for $\Omega_m$ is slightly lower in the case of

488: median statistics and the confidence intervals still overlap nicely.

489:

490: Clearly the SNAP results are expected to significantly improve the

491: constraints on the energy density of the Universe. Note that the results

492: obtained with median statistics are almost indistinguishable from those

493: obtained with the standard $\chi^2$ analysis in this case.

494:

495: This analysis clearly show that the median and the

496: standard $\chi^2$ statistics have quite similar performances if the

497: assumptions about a Gaussian distribution for the errors and the

498: estimate of the standard deviations are correct. If this is not the

499: case, then obviously the results obtained with the median statistic are

500: the more reliable ones.

501:

502: Indeed, one could use this information to reverse the argument

503: and use both statistics together as a test on the assumptions being made

504: on the data. If in the case of SNAP the median and $\chi^2$ statistics

505: do not agree, then this indicate that either the errors do not have a

506: Gaussian distribution or that one is somehow underestimating

507: the statistical and/or systematic errors.

508:

509: \begin{table}

510: \caption{\label{median_1D_results}Values of $\Omega_m $ obtained

511: through median statistics for supernova data

512: (both the current and the SNAP datasets) with a flat universe prior.

513: Results for the usual $\chi^2$ statistic are also shown for comparison.

514: Here a value of $H_0 = 65.2\ {\rm km\ s^{-1}\ Mpc^{-1}}$ was

515: assumed for the Hubble parameter}

516: \begin{tabular}{|c|c|c|}

517: Statistic               & Current data set      & SNAP data set       \\

518: \hline

519: Median & \( 0.29^{+0.03}_{-0.10} \) & \( 0.301^{+0.004}_{-0.005} \) \\

520: Chi Squared  & \( 0.28^{+0.05}_{-0.04} \) & \( 0.300^{+0.004}_{-0.003} \) \\

521: \end{tabular}

522: \end{table}

523:

524: \section{Conclusions}

525:

526: We have discussed standard and modified median statistics in the context

527: of current and forthcoming type Ia supernova data sets. The purpose of the

528: modifications is to reduce some of its weaknesses, mainly by accounting for

529: the number and size of sequences of consecutive points above or

530: below the median. We found that in some circumstances the performances of the

531: median and $\chi^2$ statistics can be comparable, and if used together they

532: provide a simple test on the assumptions being made on the data.

533:

534: The main problem with the standard median statistics analysis is its

535: inability to cope well with a multi-dimensional fit specially if it

536: is dependent on several parameters. This is due to the very simple

537: assumptions it makes. On the other hand, when confronted

538: with a single parameter to fit, the ensuing results can be of similar

539: precision to the ones obtained with a $\chi^2$ analysis.

540:

541: Another advantage of median statistics is that it is an analysis method

542: which is extremely easy to implement. So even in the cases where

543: it is not expected to produce results as constraining as a $\chi^2$ analysis,

544: it can be used to complement it or to provide fast fits in the early stages

545: of the analysis, notably if one is still trying to gather supporting

546: evidence for the use of stronger hypothesis about the dataset.

547: Using median statistics we no longer have to suppose

548: that the errors follow a Gaussian distribution with known standard deviation,

549: and can therefore have greater confidence in the parameter ranges obtained.

550: This is an important concern for the particular case of type Ia supernovae:

551: recall that when studying them we are considering renormalised light curves,

552: and that the calibration data set (of nearby supernovae) is smaller than the

553: main data set (of distant supernovae). The median statistic is also less

554: sensitive to systematic effects such as weak lensing.

555:

556: We have studied some simple means by which to improve median statistics,

557: namely accounting for the size of the largest sequence or the number of

558: sequences present in the dataset above or below the model prediction,

559: Our adjustments did provide some improvement, even if in a

560: our confidence regions are still larger than that obtained

561: from a $\chi^2$ study. We note however that other more `baroque'

562: modifications are certainly conceivable. Other possibilities, which certainly

563: deserve further work, are to study how similar procedures could be

564: used to improve the standard $\chi^2$ analysis (which is sequence blind),

565: and to apply median statistics to other cosmological data sets,

566: most notably the cosmic microwave background. We shall report on these

567: issues in a forthcoming publication.

568:

569: \acknowledgments

570:

571: C.M. is funded by FCT (Portugal), under grant no. FMRH/BPD/1600/2000.

572: We are grateful to Alessandro Melchiorri, Gra\c ca Rocha and Paul Shellard

573: for useful comments and suggestions.

574:

575: \begin{thebibliography}{}

576: \bibitem[Bean \& Melchiorri(2002)]{Bean:2001xy}

577: Bean, R. \& Melchiorri, A. 2002, \prd 65, 041302

578: \bibitem[Bucher(1999)]{Bucher:1998mh}

579: Bucher, M. 1999, \prd 60, 043505

580: \bibitem[Caldwell, Dave, \& Steinhardt(1998)]{Caldwell:1998ii}

581: Caldwell, R. R., Dave, R. \& Steinhardt, P.1998, \prl 80, 1582

582: \bibitem[Carroll, Press \& Turner(1992)]{carroll}

583: Carroll, S., Press, W. \& Turner, E.L. 1992, Ann. Rev. Astron. Astrophys. 30, 499

584: \bibitem[Freedman(2000)]{Freedman:1999wh}

585: Freedman, W. L. 2000, Phys. Reports 333, 13

586: \bibitem[Gott et al.(2001)]{Gott:2000mv}

587: Gott, J. R. {\em et al.} 2001, \apj 549, 1

588: \bibitem[Jaffe et al.(2001)]{Jaffe:2000tx}

589: Jaffe, A. H. {\em et al.} 2001, \prl 86, 3475

590: \bibitem[Nugent(2000)]{Nugent:2000vs}

591: Nugent, P. 2000, SNAP: Supernova acceleration probe: An experiment to measure the

592: properties of the accelerating universe. See homepage at http://snap.lbl.gov

593: \bibitem[Peebles \& Ratra(1988)]{ratra1}

594: Peebles, P.J.E. \& Ratra, B. 1988, \apj 325, L17

595: \bibitem[Perlmutter et al.(1999)]{Perlmutter:1998np}

596: Perlmutter, S. {\em et al.} 1999, \apj 517, 565

597: \bibitem[Phillips et al.(1999)]{Phillips:1999vh}

598: Phillips, M. M. {\em et al.} 1999, \aj 118, 1766

599: \bibitem[Podariu \& Ratra(2000)]{ratra3}

600: Podariu, S. \& Ratra, B. 2000, \apj 532, 109

601: \bibitem[Ratra \& Peebles(1988)]{ratra2}

602: Ratra, B. \& Peebles, P.J.E. 1988, \prd 37, 3406

603: \bibitem[Riess et al.(1998)]{Riess:1998cb}

604: Riess, A. {\em et al.} 1998, \aj 116, 1009

605: \bibitem[Waga \& Frieman(2000)]{waga}

606: Waga, I. \& Frieman, J. 2000, \prd 62, 043501

607: \bibitem[Wang et al.(2000)]{Wang:1999fa}

608: Wang, L. {\em et al.} 2000, \apj 530, 17

609: \bibitem[Wang et al.(2000)]{Wang:1999bz}

610: Wang, Y. {\em et al.} 2000, \apj 536, 531

611: \bibitem[Weller \& Albrecht(2001)]{Weller:2001gf}

612: Weller, J., and Albrecht, A. 2001, astro-ph/0106079

613:

614: \end{thebibliography}

615:

616:

617: \clearpage

618:

619: \begin{figure}

620: \plottwo{f1a.eps}{f1b.eps}

621: \caption{The $68 \%$, $95 \%$ and $99 \% $ confidence regions

622: resulting from standard

623: median and $\chi^2$ statistics in the $\Omega_m-\Omega_{\Lambda}$ plane,

624: using the 92 supernovae from the combined SCP and HzST results. A value of

625: $H_0 = 65.2\ {\rm km\ s^{-1}\ Mpc^{-1}}$ was assumed for the Hubble

626: parameter (see main text). \label{fig1}}

627: \end{figure}

628:

629: \begin{figure}

630: \plottwo{f2a.eps}{f2b.eps}

631: \caption{The $68 \%$, $95 \%$ and $99 \% $ confidence

632: regions resulting from standard

633: median and $\chi^2$ statistics in the $\Omega_m-\Omega_{\Lambda}$ plane,

634: using the SNAP-class dataset. A value of

635: $H_0 = 65.2\ {\rm km\ s^{-1}\ Mpc^{-1}}$ was assumed for the Hubble

636: parameter (see main text). \label{fig2}}

637: \end{figure}

638:

639: \begin{figure}

640: \plottwo{f3a.eps}{f3b.eps}

641: \caption{The $68 \%$, $95 \%$ and $99 \% $ confidence

642: regions resulting from modified

643: median statistics in the $\Omega_m-\Omega_{\Lambda}$ plane,

644: using the SNAP-class dataset. A value of

645: $H_0 = 65.2\ {\rm km\ s^{-1}\ Mpc^{-1}}$ was assumed for

646: the Hubble parameter (see main text).

647: The first plot shows the results obtained using median statistics modified to

648: account for the largest sequence, while in the second we consider the number

649: of sequences. \label{fig3}}

650: \end{figure}

651:

652: \begin{figure}

653: \plottwo{f4a.eps}{f4b.eps}

654: \caption{Marginal distributions for $\Omega_m$ obtained using the standard

655: median (dot-dashed) and $\chi^2$ (solid line) statistics,

656: assuming a flat Universe, for the current and

657: the SNAP-class datasets. The $68 \%$, $95 \%$ and

658: $99 \%$ confidence limits are where the curves drop below the

659: (dashed) lines of constant $p$ (0.68,0.95 and 0.99 respectively).

660: Note that a value of $H_0 = 65.2\ {\rm km\ s^{-1}\ Mpc^{-1}}$ was

661: assumed for the Hubble parameter. \label{fig4}}

662: \end{figure}

663:

664: \end{document}

665: