0704.3601/ms.tex
1: %\documentclass[manuscript]{aastex}
2: %\documentclass[preprint2]{aastex}
3: %\documentclass[12pt,preprint]{aastex}
4: %\documentclass[10pt,preprint]{aastex}
5: \documentclass[]{emulateapj}
6: 
7: \newcommand{\myemail}{kanbur@oswego.edu}
8: \def\ni{\noindent}
9: 
10: %\slugcomment{To be determined}
11: 
12: \shorttitle{Cepheid P-L Relation with Testimator \& SIC}
13: \shortauthors{Kanbur et al.}
14: 
15: \begin{document}
16: 
17: \title{Investigations of the Non-Linear LMC Cepheid Period-Luminosity Relation with Testimator and Schwarz Information Criterion Methods}
18: 
19: \author{S. M. Kanbur}
20: \affil{Department of Physics, State University of New York at Oswego, Oswego, NY 13126}
21: 
22: \author{C. Ngeow}
23: \affil{Department of Astronomy, University of Illinois, Urbana, IL 61801}
24: 
25: \author{A. Nanthakumar and R. Stevens}
26: \affil{Department of Mathematics, State University of New York at Oswego, Oswego, NY 13126}
27: 
28: 
29: \begin{abstract}
30: In this paper, we investigate the linearity versus non-linearity of the Large Magellanic Cloud (LMC) Cepheid period-luminosity (P-L) relation using
31: two statistical approaches not previously applied to this problem: the testimator method and the Schwarz Information Criterion (SIC).
32: The testimator method is extended to multiple stages for the first time,
33: shown to be unbiased and the variance of the estimated slope can be proved to be smaller
34: than the standard slope estimated from linear regression theory.
35: The Schwarz Information Criterion (also known as the Bayesian Information Criterion) is more conservative than the Akaike Information
36: Criterion and tends to choose lower order models.
37: By using simulated data sets, we verify that these statistical techniques can be 
38: used to detect intrinsically linear and/or non-linear P-L relations. These methods are then applied to independent 
39: LMC Cepheid data sets from the OGLE project and the MACHO project, respectively. Our results imply that there is a change of slope 
40: in longer period ranges for all of the data sets. This strongly supports previous results, 
41: obtained from independent statistical tests, that the observed LMC P-L relation is non-linear with a break period at/around 10 days. 
42: 
43: \end{abstract}
44: 
45: \keywords{Cepheids --- distance scale -- Stars: fundamental parameters --  methods: statistical}
46: 
47: \section{Introduction}
48: 
49: The cornerstone of the extra-galactic distance scale is the Cepheid Period-Luminosity (P-L) relation defined by the 
50: Large Magellanic Cloud (LMC) Cepheids. The assumed linear relation of the LMC Cepheid P-L relation, which is linear 
51: in $\log(P)$, with $P$ the pulsation period in days, has been under debate due to recent results that this 
52: relation could be non-linear \citep{tam02,kan04,san04,nge05}. These authors contended that the existing Cepheid data in the LMC 
53: strongly suggested the LMC P-L relation is consistent with two lines of significantly differing slopes with a break 
54: at/around a period of 10 days. This is referred as the non-linearity of the Cepheid P-L relation in this paper. 
55: Arguments for choosing the fiducial period at 10 days can be found in \citet{kan04}, \citet{san04}, \citet{nge05} and \citet{nge06a}. 
56: Furthermore, \citet{kan04,kan06}, \citet{san04}, \citet{nge05} and \citet{nge06} examined various factors that may cause the 
57: non-linearity of the LMC P-L relation, including the observing strategies, photometric errors, extinction errors,
58: removal of outliers, sample selection, number of long period Cepheids in the samples and
59: contamination of overtone Cepheids. They found that none of these remedies or any combination of them
60: could be responsible for the observed non-linear LMC P-L relation. As 
61: argued in \citet{nge06}, rigorous statistical tests are needed to test the linearity versus the non-linearity of the LMC 
62: P-L relation.
63: 
64: In our previous studies, the $F$-test \citep[e.g.][]{wei80} has been applied to the OGLE (Optical Gravitational Lensing 
65: Experiment) and MACHO (MAssive Compact Halo Objects project) Cepheid data, in \citet{kan04} and \citet{nge05} respectively, 
66: to test for the non-linearity of the LMC P-L relation. In such a formulation, the full and reduced models are models with 
67: four and two parameters respectively. This test looks at the change in the mean residual sum of squares between the full 
68: and reduced model divided by the mean residual sum of squares in the full model (see equation [5] of \citealt{kan04}). This test 
69: statistic can be formulated as the difference in slopes between short and long period slopes divided by the standard error 
70: of that difference. Hence if the number and nature of the long/short period data are such that the long/short period slope 
71: is estimated with a large error, then the $F$-value will be low and return a non-significant result. Thus the $F$-test 
72: is sensitive to the number of data points on either side of the period cut at 10 days. The OGLE and MACHO data sets 
73: we used in \citet{kan04} and \citet{nge05}, respectively, do have adequate number of long and short period Cepheids for 
74: the application of the $F$-test. The $F$-test has returned a significant result when testing the non-linearity of the P-L 
75: relation in both of the data sets.
76: 
77: Nevertheless, the results that suggesting a non-linear LMC P-L relation are still controversial. As we emphasize that statistical tests are needed, however claims of linear LMC P-L relation in the literature lack of rigorous statistical tests. In this paper, we apply two additional statistical tests, the testimator method and the Schwarz Information Criterion method, to examine the non-linearity of the LMC Cepheid P-L relation. These tests will be complementary to the $F$-test carried out in previous studies since they will serve to check and verify the results obtained from the $F$-test. In this way previous conclusions about the non-linear LMC P-L relation are considerably strengthened. Furthermore, both testimator and Schwarz Information Criterion methods are also able to estimate the break period without any a priori assumption: recall that in previous work, the break period at 10 days is usually adopted. These two methods not only can be applied for Cepheid studies, as we did in this paper, but also to other astronomical and astrophysical hypothesis testing problems. We also emphasize that our use of the testimator has, for the first time, been generalized to more than two stages and hence is also a statistical result in its own right. In the next section, we outline these techniques in detail and their application to our problems. In Section 3 we apply these methods to LMC Cepheid data and present our results. The conclusions and discussion are given in the last section.
78: 
79: 
80: \section{The Statistical Methods}
81: 
82: \subsection{The Testimator}
83: 
84: The concept of a testimator (or test estimator) was first proposed by \citet{ban44} in the context of estimating a parameter 
85: where a prior guess will be used in place of the estimator of an unknown parameter. The testimator can be applied if the prior 
86: guess for the unknown parameter can be ascertained by a test of hypothesis, otherwise the traditional estimator will be used. 
87: Due to its superior efficiency compared to traditional estimators, the testimator method has been adapted and refined to suit 
88: other situations by \citet{pau50}, \citet{hun55}, \citet{ban64}, \citet{arn72}, \citet{boc73}, \citet{han78}, 
89: \citet{gho88}, \citet{yan89}, \citet{pan90}, \citet{pan95}, \citet{pan97} and \citet{pan01} to name a few. \citet{wai84} and 
90: \citet{wai01}, in work on two-stage shrinkage estimation, proposed a weighted testimator by placing a weight $1-k$ on the prior 
91: guess and weight $k$ on the traditional estimator, where $k$ is the probability that the guess will be true. They showed that 
92: the testimators have far superior efficiency and therefore are more powerful in estimating unknown parameters. This weighted
93: two stage testimation concept can be extended to cover multiple stages. In this paper we apply this ``weighted'' testimator to 
94: investigate the non-linearity of the LMC Cepheid P-L relation as mentioned in the Introduction. 
95: 
96: The description of the two-stage testimator method is summarized as follows. 
97: For a linear regression of the form of $y={\beta} x + a$, the usual least square estimation of the slope to $N$ data points is 
98: given as
99: 
100: \begin{eqnarray}
101: \hat{\beta} & = & {{\sum_{i=1}^{N}(x_i-\bar{x})(y_i-\bar{y})}\over{\sum_{i=1}^{N}(x_i-\bar{x})^2}}, 
102: \end{eqnarray}
103: 
104: \ni where $\bar{x}=N^{-1}\sum x_i$ and $\bar{y}=N^{-1}\sum y_i$ are the mean values of $x$ and $y$, respectively. In the standard
105: hypothesis testing procedure, the null and alternate hypotheses are constructed as $H_0: {\beta} = \beta_0$ and 
106: $H_a: {\beta} \neq \beta_0$, respectively, where $\beta_0$ is the assumed value of (true) slope given the prior knowledge on the
107: slope. For example, $\beta_0$ can be predicted from theoretical calculations. In case that the (true) variance of the slope is known, 
108: the $z$-statistical test (with normal-distribution) can be applied, otherwise the $t$-statistical test (with $T$-distribution) will be 
109: used for the hypothesis testing. In general the variance is not known, therefore we adopt the $t$-statistical test in this paper.
110: If the null hypothesis is accepted from the hypothesis testing, the testimator (of the slope), $\hat{\beta_\omega}$, is calculated as 
111: \citep{wai84}:
112: 
113: \begin{eqnarray}
114: \hat{\beta}_{\omega} = k{\hat {\beta}} + (1-k){\beta_0}.
115: \end{eqnarray}
116: 
117: \ni The constant $k$ in the above equation is defined as 
118: 
119: \begin{eqnarray}
120: k & = & \frac{|t_{\mathrm observed}|}{t_{\mathrm critical}}, \\ 
121: t_{\mathrm observed} & = & \frac{\hat{\beta}-\beta_0}{\sqrt{MSE/S_{XX}}}, \nonumber \\
122: t_{\mathrm critical} & = & t_{\alpha/2,\nu}, \nonumber
123: \end{eqnarray}
124: 
125: \ni where $MSE=(N-2)^{-1}\sum^{N}_{i=1}(y_i-\hat{a}-\hat{\beta}x_i)^2$, $S_{XX}=\sum_{i=1}^{N}(x_i-\bar{x})^2$ and $t_{\alpha/2,\nu}$
126: is the $t$-value for $100(1-\alpha/2)$\% confidence interval obtained from the associated $T$-distribution table with $\nu=N-2$ degree of 
127: freedom. Note that the null hypothesis is rejected if $k>1$. The properties of the testimator are such that:
128: 
129: \begin{enumerate}
130: \item The testimator is an unbiased estimator under $H_0$. 
131: \item The testimator has a smaller variance than the usual least square estimator, that is $\mathrm{Var}(\hat{{\beta}_{\omega}}) < \mathrm{Var}(\hat{{\beta}})$. 
132: \end{enumerate}
133: 
134: \ni The proofs for these two properties are given in the Appendix.
135: 
136: \subsubsection{Application to the Cepheid P-L Relation}
137: 
138: \begin{figure}
139: %\epsscale{.80}
140: \plotone{f1.eps}
141: \caption{Illustration of the testimator procedures. The Cepheid data points are divided to several subsets, sorted according to the $\log(P)$. For the first round, two slopes from the first and second subsets are compared under the hypothesis testing. The testimator, $\hat{\beta_\omega}$, is calculated if the null hypothesis is accepted. In the second round, the testimator from the previous round will be used to compare the estimated slope from the third subset. This is repeated several times until all the subsets have been used up or the null hypothesis is rejected. \label{testimator}}
142: \end{figure}
143: 
144: The motivation of this paper is to apply the testimator method to detect any non-linearity in the LMC P-L relation; this has been
145: detected using the $F$-test \citep{kan04,nge05}. To study any possible variation in slope as the period increases through
146: 10 days, we first
147: sorted the data according to period, from shortest to longest period in $\log(P)$. The sorted sample is then divided into $m$ number of
148: non-overlapping and hence independent subsets according to the Cepheid period. The purpose is to make the bi-variate 
149: observations independent for each of the subsets. Each of the subsets will then contain $n$ numbers of Cepheids (if the number of data points in 
150: the last subset is small, then the last subset will be combined with the previous subset). This enables us to apply the testimator method in multiple
151: stages, together with a conservative Bonferroni testing procedure\footnote{The Bonferroni testing procedure states that
152: for testing $n_g$ number of hypotheses, 
153: the confidence coefficient $(1-\alpha/2)$ is replaced by $(1-\alpha/2n_g)$ in each of the hypothesis testings. This is to ensure that the overall 
154: confidence coefficient will not be less than the original desired value of $(1-\alpha/2)$.}, for detecting such a slope variation in the sample. 
155: In essence the line of attack is to 
156: compute the slope of the first subset and then compare with the slope of the next subset. If the two slopes are ``similar'', we look at the slope of the 
157: third subset with the smoothed slope obtained from a combination of all the previous subsets.
158: Hence, at the $i^{th}$ round, the slope of a given subset of the data is computed and compared 
159: with the smoothed slope from the testimator of all the previous data points. If the two slopes are statistically equivalent, then the current
160: subset of data will be incorporated into the computation of the smoothed slope and compared with the slope of the next subset of data. This smoothness 
161: is an important feature since it helps to alleviate, to some extent, the influence of outliers.  However, if the slopes are ``different'', i.e. a 
162: rejection of the null hypothesis, then there is an indication of slope change in the P-L relation. Therefore there will be a total of $n_g=m-1$ number 
163: of hypothesis testings in the multi-stage testimator procedures. 
164: In short, the algorithm of applying  the testimator method in our case can be summarized as follows:
165: 
166: \begin{itemize}
167: \item[a.] In the first round, the slope of first subset, ${\beta}_1$, is calculated and denoted by $\hat{\beta}_1=\beta_0$. The slope of the second subset is then compared to ${\beta}_0$ under the null hypothesis of $H_0: {\beta}_2 = {\beta}_0$ and alternate hypothesis of $H_a: {\beta}_2\neq {\beta}_0$. If $H_0$ is accepted, then the testimator in this round, $\hat{{\beta}_{\omega}}^1$, is calculated using equation (2).
168: 
169: \item[b.] In the second round, the slope of the third subset, ${\beta}_3$, is calculated and denoted by $\hat{\beta}_3$. The testimator from the first round, represented as $\hat{{\beta}_{\omega}}^1=\beta_0$, is used in the hypothesis testing for this round. 
170: The null and alternate hypotheses in this round become $H_0: {\beta}_3 ={\beta}_0$ and $H_a:{\beta}_3 \neq {\beta}_0$.
171: If $H_0$ is accepted, a new testimator, $\hat{\beta_\omega}^2$, is calculated using equation (2).
172: 
173: \item[c.] In the $i^{th}$ round, the slope of the $(i+1)^{th}$ subset, ${\beta}_{i+1}$, estimated by $\hat{\beta}_{i+1}$, is calculated. The testimator from previous ($i-1$) round is denoted as $\hat{{\beta}_{\omega}}^{i-1}=\beta_0$. The null and alternative hypothesis in this round become $H_0:{\beta}_{i+1} = {\beta}_0$ and $H_a:{\beta}_{i+1} \neq {\beta}_0$. If $H_0$ is accepted, then $\hat{\beta_\omega}^i=k\hat{\beta}_{i+1}+(1-k)\beta_0$ with $k$ refined from equation (3). 
174: 
175: \item[d.] This is repeated until $i=n_g$ round or the null hypothesis is rejected in the $i^{th}$ round, which indicates a change in slope for the $(i+1)^{th}$ subset. 
176: 
177: \item[e.] Since in principle there will be a total of $n_g$ hypothesis testings, the Bonferroni testing procedure requires that 
178: $t_{\mathrm critical}=t_{\alpha/2n_g,\nu}$ in each round.
179:  
180: \end{itemize}
181: 
182: \ni Throughout the paper, we will adopt $\alpha=0.05$ to ensure the overall confidence level is more than 95\% in our test. 
183: The first two rounds of our testimator procedures to the study the possible non-linear LMC P-L relation is illustrated in Figure \ref{testimator}.
184: 
185: \begin{deluxetable*}{cccccccccc}
186: \tabletypesize{\scriptsize}
187: %\rotate
188: \tablecaption{Testimator results for the fake data sets.\label{tab1}}
189: \tablewidth{0pt}
190: \tablehead{
191: \colhead{Subset} & \colhead{Period range} & \colhead{$n$} & \colhead{$\hat{\beta}$} & \colhead{$\beta_0$} 
192: & \colhead{$|t_{\mathrm observed}|$} & \colhead{$t_{\mathrm critical}$} 
193: & \colhead{$k$} & \colhead{Decision} & \colhead{$\hat{\beta_{\omega}}$} \\
194: \colhead{(1)} & \colhead{(2)} & \colhead{(3)} & \colhead{(4)} 
195: & \colhead{(5)} & \colhead{(6)} & \colhead{(7)} & \colhead{(8)} & \colhead{(9)}
196: & \colhead{(10)}
197: }
198: \startdata
199: \multicolumn{10}{c}{``Fake'' data set from a linear P-L relation} \\
200: 1 & 0.2315-0.4421 & 200 & $-2.182\pm0.403$ & ---    & ---   & ---   &  ---  & ---          & ---    \\
201: 2 & 0.4422-0.5005 & 200 & $-3.658\pm0.949$ & $-2.182$ & 1.556 & 2.718 & 0.572 & accept $H_0$ & $-3.027$ \\
202: 3 & 0.5006-0.5508 & 200 & $-1.955\pm1.128$ & $-3.027$ & 0.951 & 2.718 & 0.350 & accept $H_0$ & $-2.652$ \\
203: 4 & 0.5512-0.6079 & 200 & $-3.006\pm1.025$ & $-2.652$ & 0.345 & 2.718 & 0.127 & accept $H_0$ & $-2.697$ \\
204: 5 & 0.6080-0.7349 & 200 & $-2.733\pm0.442$ & $-2.697$ & 0.081 & 2.718 & 0.030 & accept $H_0$ & $-2.698$ \\
205: 6 & 0.7349-0.9610 & 200 & $-2.841\pm0.234$ & $-2.698$ & 0.611 & 2.718 & 0.225 & accept $H_0$ & $-2.730$ \\
206: 7 & 0.9610-1.3553 & 200 & $-2.493\pm0.155$ & $-2.730$ & 1.531 & 2.718 & 0.563 & accept $H_0$ & $-2.597$ \\
207: 8 & 1.3652-2.6170 & 100 & $-2.684\pm0.095$ & $-2.597$ & 0.921 & 2.748 & 0.335 & accept $H_0$ & $-2.626$ \\
208: \multicolumn{10}{c}{``Fake'' data set from a non-linear P-L relation} \\
209: 1 & 0.2315-0.4421 & 200 & $-2.442\pm0.403$ & ---    & ---   & ---   &  ---  & ---          & ---    \\
210: 2 & 0.4422-0.5005 & 200 & $-3.918\pm0.949$ & $-2.442$ & 1.556 & 2.718 & 0.572 & accept $H_0$ & $-3.287$ \\
211: 3 & 0.5006-0.5508 & 200 & $-2.215\pm1.128$ & $-3.287$ & 0.951 & 2.718 & 0.350 & accept $H_0$ & $-2.912$ \\
212: 4 & 0.5512-0.6079 & 200 & $-3.266\pm1.025$ & $-2.912$ & 0.345 & 2.718 & 0.127 & accept $H_0$ & $-2.957$ \\
213: 5 & 0.6080-0.7349 & 200 & $-2.993\pm0.442$ & $-2.957$ & 0.081 & 2.718 & 0.030 & accept $H_0$ & $-2.958$ \\
214: 6 & 0.7349-0.9610 & 200 & $-3.101\pm0.234$ & $-2.958$ & 0.611 & 2.718 & 0.225 & accept $H_0$ & $-2.990$ \\
215: 7 & 0.9610-1.3553 & 200 & $-2.170\pm0.155$ & $-2.990$ & 5.281 & 2.718 & 1.943 & reject $H_0$ & ---    
216: \enddata
217: \tablecomments{See text for the description for each columns. Period ranges are given in $\log(P)$.}
218: \end{deluxetable*}
219: 
220: In order to demonstrate the reliability of this procedure, we apply the testimator method to two simulated data sets: one built from a non-linear P-L 
221: relation with a break at 10 days and another one developed from a linear P-L relation. For demonstration purpose, we select one set of the simulated data (out of many simulations) in each cases as representation for testing the testimator method.
222: The plots of these two fake data sets, each of them containing 1500 data points, can be found in figure 1 of \citet{nge06}. 
223: Full details of our procedure for developing these two ``fake'' 
224: data sets are described in \citet{nge06}. The results of applying the testimator procedures as described to these two fake data sets are given in Table \ref{tab1}. 
225: In this table, column 1 denotes the subsets; column 2 gives the range of the period in each subsets; column 3 lists the number of data points, $n$,
226: in each subsets; column 4 \& 5 are the fitted slopes in each subsets and the assigned values of $\beta_0$ that used in the hypothesis testing; column 6 \& 7 are 
227: the observed and critical $t$-values for each of the hypothesis testing; column 8 \& 9 are the corresponding $k$-value and the outcome of the 
228: hypothesis testing; finally column 10 is the values of testimator if the null hypothesis is accepted. Since we know which fake data set is intrinsic 
229: linear and non-linear when constructing the P-L relation, we can verify the results found in Table \ref{tab1}. For the fake data with linear P-L
230: relation, our testimator results show that the slopes for each subsets are consistent with the smoothed slopes given from the previous subsets, and
231: the hypothesis testings correctly indicate that there is no changes in slope across all the period ranges. In contrast, the hypothesis testings for the 
232: fake data with non-linear P-L relation show that subset 7 has a different slopes than the previous subsets, which indicates a change of slope in this 
233: subset. Furthermore, the testimator procedures also correctly pick up the ``break period'' in subset 7, which brackets
234: the input break period at 10 days, from the outcome of hypothesis testing. Therefore the testimator method can pick up the P-L relation which is 
235: intrinsically non-linear.
236: 
237: \subsection{The Schwarz Information Criterion}
238: 
239: The problem of deciding whether the LMC Cepheid data are more consistent with two lines of significantly different slopes rather than a 
240: single line is exactly analogous to deciding the dimensionality of the model that will fit the given LMC Cepheid data.
241: The method of maximizing the likelihood tends to choose the highest possible dimension. \citet{aka74} suggested maximizing the likelihood
242: subject to a penalty depending on the dimensionality of the model under consideration (Akaike Information Criterion, AIC): $AIC=-2\ln L + 2k_p$,
243: where $L$ is the likelihood function of the model of dimension $k_p$ \citep[see][as an example in the application of astronomy]{tak00}. However, \citet{sch78} showed that maximum likelihood
244: estimators can be obtained from large sample limits of Bayes estimates for certain classes of a priori distributions. These distributions only
245: put positive probability on the subspaces of the parameter space corresponding to the competing models. \citet{sch78} derived the following
246: criterion (Schwarz Information Criterion, SIC; or sometimes also referred as Bayesian Information Criterion, BIC, in the literature): choose the model for which 
247: 
248: \begin{eqnarray}
249: SIC = -2\ln L + k_p\ln N
250: \end{eqnarray}
251: 
252: \ni is a minimum, where $N$ is the total number of data points and $k_p=p+1$ \citep[with $p$ being the number of fitted parameters, see][]{sch78}. Some use of the BIC for models selection in astronomical and astrophysical literature can be found, for examples, in \citet{are01}, \citet{han00,han02}, \citet{koe96,koen99,koe06}, \citet{koe99}, \citet{koe00}, \citet{koe93,koe03}, \citet{lid04,lid07}, \citet{muk98}, \citet{por06} and \citet{ste99}.
253: 
254: \subsubsection{Application to the Cepheid P-L relation}
255: 
256: To test the non-linearity of the Cepheid P-L relation with the SIC method, we consider the models with a linear P-L relation (the null hypothesis) and a non-linear P-L relation with a break period (in days) at $P_0$ (the alternate hypothesis) in this paper. For the former case, we have:
257: 
258: \begin{eqnarray}
259: H_0: m = \hat{m} & = & \hat{\beta} \log(P) + \hat{a},  \nonumber \\
260:  & &   \mathrm{with} \ \hat{\sigma}^2=\frac{1}{N-2}\sum_{i=1}^{i=N} (m_i-\hat{m}_i),  \nonumber \\
261: L & = & \frac{1}{(\sqrt{2\pi \hat{\sigma}^2})^N} \exp{[-\frac{1}{2\hat{\sigma}^2} \sum^{N}_{i=1} (m_i - \hat{m_i})^2]}, \nonumber \\
262: SIC(H_0) & = & -2\ln L + 3\ln N. \nonumber
263: \end{eqnarray}
264: 
265: \ni Similarly, for the alternate model, we have:
266: 
267: \begin{eqnarray}
268: H_A: m = \hat{m}  & = &\left\{  \begin{array}{l}
269:                    \hat{\beta}_S \log(P) + \hat{a}_S, \ \mathrm{for} \log(P) < \log(P_0),  \\
270:                      \ \ \ \ \mathrm{with}\ \hat{\sigma}_S^2=\frac{1}{N_S-2}\sum_{i=1}^{i=N_S} (m_i-\hat{m}_i),  \\
271:                    \hat{\beta}_L \log(P) + \hat{a}_L, \ \mathrm{for} \log(P) \geq \log(P_0),  \\
272:                      \ \ \ \ \mathrm{with}\ \hat{\sigma}_L^2=\frac{1}{N_L-2}\sum_{i=1}^{i=N_L} (m_i-\hat{m}_i),
273:                   \end{array}\right. \nonumber 
274: \end{eqnarray}
275: \begin{eqnarray}
276: L & = & \frac{1}{(\sqrt{2\pi})^N}\frac{1}{(\hat{\sigma}_S)^{N_S}}\frac{1}{(\hat{\sigma}_L)^{N_L}} \nonumber \\ 
277: & & \exp{[-\frac{1}{2\hat{\sigma}_S^2} \sum^{N_S}_{i=1} (m_i - \hat{m_i})^2 -\frac{1}{2\hat{\sigma}_L^2} \sum^{N_L}_{i=1} (m_i - \hat{m_i})^2]}, \nonumber 
278: \end{eqnarray}
279: \begin{eqnarray}
280: SIC(H_A)  =  -2\ln L + 5\ln N. \nonumber
281: \end{eqnarray}
282: 
283: \ni In these expressions, $N_S+N_L=N$ and $m$ is the observed magnitudes after correcting for extinction. The slope ($\beta$) and zero-point ($a$) parameters in the above models are obtained from the maximum likelihood estimation (MLE, which is equivalent to standard least square estimation in our case). Note that the sample variance ($\sigma^2$) from MLE is a biased estimate. We corrected the bias with $N_{(L,S)}-2$ degrees of freedom. For the alternate models, $SIC(H_A)$ is calculated with a range of $\log(P_0)$ that increment in steps of, for example, $0.001$. Therefore, a model with linear P-L relation and a range of models with non-linear P-L relations at different break period are tested with the SIC method. The model with smallest $SIC$ value is the preferred model. In case of $SIC(H_A)<SIC(H_0)$,  the minimum value of $SIC(H_A)$ not only suggests that the P-L relation is non-linear, but also gives an estimate of the break period. 
284: 
285: To test the SIC method, the same simulated data sets as in the case of the testimator method in Section 2.1.1 were used. For the ``fake'' data set a with linear P-L relation, the values of $SIC(H_0)$ and $SIC(H_A)$ is $-164.65$ and $-161.21$, respectively. While for the ``fake'' data set with non-linear P-L relation, we found $SIC(H_0)=-100.56$ and the minimum value of $SIC(H_A)=-154.62$ occurs at $\log(P_0)=0.983$. We then test the SIC method for our application further with various simulations. We first ran two sets of simulations: one set of simulations use the linear P-L relation as input P-L relation, and another set of simulations include the non-linear P-L relation with a break at $\log(P_0)=1.0$. These simulations mimic the period distribution and the observed dispersions along the P-L relation from the real data. The details for constructing these simulations can be found in \citet{nge06}. In either sets of the simulations, a large number of simulations is run (typically 1000) and the break period (in $\log[P_0]$) is searched with the SIC method. If the break period cannot be found then this implies the linear P-L relation is the preferred model, and vice versa. The top panels of Figure \ref{simu_dist} display the distributions of the break periods from these two sets of data. For the case of a linear P-L relation, the SIC method did not find any break period $\sim90$\% of the time. While for the case of no-linear P-L relation, the SIC method detects a range of break period with a peak at $\log(P_0)\sim1.0$. Therefore, the SIC method can be used to correctly identify the P-L relation that is either intrinsically linear or non-linear at a given break period. 
286: 
287: \begin{figure*}
288: %\epsscale{.80}
289: \plottwo{f2a.eps}{f2b.eps}
290: \caption{Distributions of the estimated break periods, $\log(P_0)$, using the SIC method with various simulations. The top panels show the histograms from two simulations at which the input P-L relations to the simulation is linear (dashed histogram) and non-linear with a break period at $\log(P_0)=1.0$ (thick histogram), respectively. The bottom panels show the histograms from two additional simulations using the same non-linear P-L relation with a break period again at $\log(P_0)=1.0$ as input P-L relation: one simulation has uniform distribution of the periods (in $\log[P]$) in the simulated data, and another simulation did not include the intrinsic dispersion of the P-L relation. The right panels show the blown-up region of the left panels. \label{simu_dist}}
291: \end{figure*}
292: 
293: The relatively large dispersion around $\log(P_0)\sim1.0$ and the long tail toward shorter period that is exhibited in the top panels of Figure \ref{simu_dist} can be due to a combination of two effects: (1) the existence of the intrinsic dispersion along the P-L relation; and (2) the non-uniform distribution of the periods in the data (see \citealt{nge05} and \citealt{nge06} for more discussion about the period distribution for Cepheid variables). To portray the impact of these effects on the application of the SIC method for detecting the break period, we ran two additional experiments. The first retains the original period distribution but the intrinsic dispersion of the P-L relation is excluded (however the random photometric errors still persist in the simulation), while the second simulation use a uniform period distribution (in $\log[P]$) and the intrinsic dispersion of the P-L relation is not excluded. The resulted distributions of the break period from SIC method are presented in the bottom panels of Figure \ref{simu_dist}. It can be seen that the long tail of the distribution present in the top panels is reduced when a uniform period distribution is assumed. Furthermore, if the intrinsic dispersion does not exist in the Cepheid P-L relation, then the SIC method is very efficient to detect the intrinsic break period (at $\log[P_0]=1.0$ in our case). In reality, the intrinsic dispersion along the P-L relation cannot be eliminated or reduced (at least in the optical bands) and the period distribution of the Cepheid variables will not be uniform (for the reasons discussed in \citealt{nge06}). We emphasize that the theoretical pulsation modelings are needed to identify the location of the break period or to confirmation the break period at $\log(P_0)\sim1.0$ \citep{nge05}. 
294: 
295: 
296: \section{Data and Results}
297: 
298: In this section, we apply both the testimator and SIC methods to the real LMC Cepheids data in order to investigate whether the $V$-band Cepheid P-L 
299: relation at {\it mean light} is non-linear or not. We concentrate on the $V$-band mean light data because the data 
300: available in the literature are mostly in the $V$-band mean light and also because of the evidence for non-linearity as a function of phase is clear 
301: \citep{nge06a}. The data sets we used in this study include the OGLE data adopted from \citet{kan06} and the 
302: MACHO data adopted from \citet{nge05}. They are referred as the ``OGLE'' sample (with 641 Cepheids) and the ``MACHO'' data (with 1216 Cepheids), 
303: respectively. Note that both data sets have been corrected for extinction using the method described in the corresponding papers. It
304: is also important to point out that these two are independent data sets. To investigate the influence of longer period stars in our testing
305: as well as increasing the number of Cepheids in the OGLE sample, we append the data from \citet{seb02} to the OGLE sample after proper removal of 
306: duplication of the Cepheids in both samples and the correction of extinction. This third data set is called ``OGLE+SEBO'' sample (with 723 Cepheids) and 
307: it extends to $\log(P)\sim2.0$. 
308: 
309: 
310: \begin{deluxetable*}{cccccccccc}
311: \tabletypesize{\scriptsize}
312: %\rotate
313: \tablecaption{Testimator results for the real data sets.\label{tab2}}
314: \tablewidth{0pt}
315: \tablehead{
316: \colhead{Subset} & \colhead{Period range} & \colhead{$n$} & \colhead{$\hat{\beta}$} & \colhead{$\beta_0$} 
317: & \colhead{$|t_{\mathrm observed}|$} & \colhead{$t_{\mathrm critical}$} 
318: & \colhead{$k$} & \colhead{Decision} & \colhead{$\hat{\beta_{\omega}}$} \\
319: \colhead{(1)} & \colhead{(2)} & \colhead{(3)} & \colhead{(4)} 
320: & \colhead{(5)} & \colhead{(6)} & \colhead{(7)} & \colhead{(8)} & \colhead{(9)}
321: & \colhead{(10)}
322: }
323: \startdata
324: \multicolumn{10}{c}{OGLE sample, Test 1} \\
325: 1 & 0.4022-0.4771 & 100 & $-1.427\pm0.967$ & ---    & ---   & ---   &  ---  & ---          & ---    \\
326: 2 & 0.4787-0.5293 & 100 & $-2.273\pm1.399$ & $-1.427$ & 0.605 & 2.627 & 0.230 & accept $H_0$ & $-1.622$ \\
327: 3 & 0.5294-0.5889 & 100 & $-0.746\pm1.095$ & $-1.622$ & 0.800 & 2.627 & 0.304 & accept $H_0$ & $-1.355$ \\
328: 4 & 0.5891-0.6703 & 100 & $-1.887\pm0.675$ & $-1.355$ & 0.788 & 2.627 & 0.300 & accept $H_0$ & $-1.515$ \\
329: 5 & 0.6704-0.7891 & 100 & $-3.055\pm0.703$ & $-1.515$ & 2.193 & 2.627 & 0.835 & accept $H_0$ & $-2.801$ \\
330: 6 & 0.7900-1.6768 & 141 & $-2.462\pm0.082$ & $-2.801$ & 4.106 & 2.612 & 1.572 & reject $H_0$ & --- \\
331: \multicolumn{10}{c}{OGLE sample, Test 2} \\
332: 1 & 0.4022-0.5043 & 150 & $-2.547\pm0.647$ & ---    & ---   & ---   &  ---  & ---          & ---    \\
333: 2 & 0.5043-0.5889 & 150 & $-1.783\pm0.641$ & $-2.547$ & 1.193 & 2.421 & 0.493 & accept $H_0$ & $-2.171$ \\
334: 3 & 0.5891-0.7083 & 150 & $-2.347\pm0.401$ & $-2.171$ & 0.438 & 2.421 & 0.181 & accept $H_0$ & $-2.203$ \\
335: 4 & 0.7103-1.6768 & 191 & $-2.590\pm0.075$ & $-2.203$ & 5.139 & 2.415 & 2.128 & reject $H_0$ & --- \\
336: \multicolumn{10}{c}{OGLE+SEBO sample, Test 1} \\
337: 1 & 0.4022-0.4746 & 100 & $-0.989\pm0.882$ & ---    & ---   & ---   &  ---  & ---          & ---    \\
338: 2 & 0.4752-0.5242 & 100 & $-2.476\pm1.202$ & $-0.989$ & 1.237 & 2.693 & 0.459 & accept $H_0$ & $-1.672$ \\
339: 3 & 0.5245-0.5729 & 100 & $-4.743\pm1.339$ & $-1.672$ & 2.292 & 2.693 & 0.851 & accept $H_0$ & $-4.286$ \\
340: 4 & 0.5734-0.6469 & 100 & $-2.743\pm0.907$ & $-4.286$ & 1.701 & 2.693 & 0.632 & accept $H_0$ & $-3.311$ \\
341: 5 & 0.6491-0.7320 & 100 & $-2.921\pm0.933$ & $-3.311$ & 0.418 & 2.693 & 0.155 & accept $H_0$ & $-3.250$ \\
342: 6 & 0.7330-0.9071 & 100 & $-3.315\pm0.400$ & $-3.250$ & 0.162 & 2.693 & 0.060 & accept $H_0$ & $-3.254$ \\
343: 7 & 0.9112-2.1268 & 123 & $-2.497\pm0.089$ & $-3.254$ & 8.535 & 2.682 & 3.181 & reject $H_0$ & --- \\
344: \multicolumn{10}{c}{OGLE+SEBO, Test 2} \\
345: 1 & 0.4022-0.4977 & 150 & $-2.545\pm0.546$ & ---    & ---   & ---   &  ---  & ---          & ---    \\
346: 2 & 0.4891-0.5729 & 150 & $-2.826\pm0.706$ & $-2.545$ & 0.398 & 2.529 & 0.157 & accept $H_0$ & $-2.589$ \\
347: 3 & 0.5734-0.6831 & 150 & $-2.557\pm0.432$ & $-2.589$ & 0.073 & 2.529 & 0.029 & accept $H_0$ & $-2.588$ \\
348: 4 & 0.6831-0.9071 & 150 & $-3.153\pm0.253$ & $-2.588$ & 2.234 & 2.529 & 0.883 & accept $H_0$ & $-3.087$ \\
349: 5 & 0.9112-2.1268 & 123 & $-2.497\pm0.089$ & $-3.087$ & 6.651 & 2.536 & 2.623 & reject $H_0$ & --- \\
350: \multicolumn{10}{c}{MACHO sample} \\
351: 1 & 0.4008-0.4715 & 200 & $-2.391\pm0.958$ & ---    & ---   & ---   &  ---  & ---          & ---    \\
352: 2 & 0.4719-0.5226 & 200 & $-1.843\pm1.189$ & $-2.391$ & 0.462 & 2.601 & 0.178 & accept $H_0$ & $-2.294$ \\
353: 3 & 0.5231-0.5787 & 200 & $-2.623\pm1.127$ & $-2.294$ & 0.292 & 2.601 & 0.112 & accept $H_0$ & $-2.331$ \\
354: 4 & 0.5795-0.6851 & 200 & $-1.851\pm0.809$ & $-2.331$ & 0.594 & 2.601 & 0.228 & accept $H_0$ & $-2.222$ \\
355: 5 & 0.6588-0.7891 & 200 & $-2.948\pm0.524$ & $-2.222$ & 1.385 & 2.601 & 0.533 & accept $H_0$ & $-2.608$ \\
356: 6 & 0.7910-1.4501 & 216 & $-2.123\pm0.122$ & $-2.608$ & 3.991 & 2.599 & 1.536 & reject $H_0$ & --- 
357: \enddata
358: \end{deluxetable*}
359: 
360: The results from using the testimator method to these three LMC Cepheid data sets are summarized in Table \ref{tab2}, with identical layout as Table \ref{tab1}.
361: In the case for the OGLE and OGLE+SEBO data sets, we have tried different sample subset sizes
362: by dividing the
363: samples to $n=100$ and $n=150$, which are referred as Test 1 and Test 2 in the table, respectively. In all cases, the testimator method implies that there 
364: is a change of slope in the last subset of the samples. Similar results found from Test 1 and Test 2 suggest that our results are not affected by the 
365: size of each subset. This indicates the LMC P-L relation becomes non-linear as the period increases through 10 days to longer periods. 
366: The last subset also brackets the fiducial break period at/around 10 days: this is consistent with previous work of \citet{nge05}.
367: 
368: 
369: \begin{figure}
370: %\epsscale{.80}
371: \plotone{f3.eps}
372: \caption{Results of the $SIC(H_0)$ and $SIC(H_A)$ as a function of the choosing break period, $\log(P_0)$, for the three LMC Cepheid data sets. The thick horizontal lines are the results for $SIC(H_0)$, which are independent of the chosen break period. The ``curves'' are the results for $SIC(H_A)$. The horizontal dotted lines represent the chosen break period in the literature \citep[e.g.,][]{tam02,kan04,san04,nge05}. \label{sic_result}}
373: \end{figure}
374: 
375: \begin{figure}
376: %\epsscale{.80}
377: \plotone{f4.eps}
378: \caption{Resulted histograms from the bootstrap re-sampling at the break period given in Table \ref{tab3} for the three LMC Cepheid data sets. See text for details. \label{sic_hist}}
379: \end{figure}
380: 
381: 
382: The results from using the SIC method are presented in Figure \ref{sic_result} and Table \ref{tab3} for the same data sets. In Figure \ref{sic_result}, the values of SIC for both $SIC(H_0)$ and $SIC(H_A)$ are plotted as a function of the chosen break period, $\log(P_0)$. Since $SIC(H_0)$ is independent of $\log(P_0)$, this represents a straight horizontal line in the figure, and the values of $SIC(H_0)$ for these three data sets are given in Table \ref{tab3}. For the case of $SIC(H_A)$ as a function of $\log(P_0)$, the figure bears witness to the fact that there is a range of $\log(P_0)$ at which the values of $SIC(H_A)$ are smaller than $SIC(H_0)$ in all three data sets. This implies that the non-linear P-L relation is preferred within these period ranges. This result also reinforces the findings of Figure \ref{simu_dist} that it is difficult to determine the exact break period of the P-L relation with SIC method (see Section 2.2.1 as well), if it is present. The minimum values for $SIC(H_A)$ found from the figure, and the corresponding $\log(P_0)$ are summarized in Table \ref{tab3} as well. The confidence intervals for the break period can be estimated using bootstrap re-sampling methods. For the model with given $\log(P_0)$ in  Table \ref{tab3}, the errors of the regression, $\epsilon_i=m_i-\hat{m}_i$, are randomly drawn (with replacement) to construct a ``new'' data set, and a new break period is estimated. This is repeated many times to build up the distribution of the break periods. The resulting histograms for these three sets of data are presented in Figure \ref{sic_hist}. From these distributions, the $5^{\mathrm{th}}$-, $25^{\mathrm{th}}$-, $75^{\mathrm{th}}$- and $95^{\mathrm{th}}$-percentile are estimated for each of the data sets. The results are given in the last four columns of Table \ref{tab3}. At first glance the break period found from the MACHO data seems to be inconsistent with the OGLE and OGLE+SEBO results. This is due to the difficulty of accurately estimating the break period with the existence of the instability strip. To demonstrate this, we use the {\it exact} periods in MACHO data as input periods to our simulations, and generate three different sets of simulations: (1) a simulation with intrinsic non-linear P-L relation; (2) a simulation with linear P-L relation; and (3) a simulation with intrinsic non-linear P-L relation but without the intrinsic dispersion. The resulting histograms for these three sets of simulation are displayed in Figure \ref{macho}. From this figure it is clear that our result of the break period for MACHO data does not imply an inconsistency to the OGLE and OGLE+SEBO results. The break period found in the data, $\log(P_0)=0.833$, is within the range of the break periods found from the simulations. This figure also portray the difficulty of estimating the break period from real data when the intrinsic dispersion along the P-L relation is present. Therefore, the break periods given in Table \ref{tab3} are consistent with the results from testimator (Table \ref{tab2}), the result from non-linear estimation procedure applied in \citet[][$\log(P_0)=0.934$ with upper and lower 95\% confidence level of 1.089 and 0.778, respectively]{nge05} and the adopted $\log(P_0)=1.0$ in the literature. Note that in previous studies \citep[e.g.,][]{tam02,kan04,kan06,san04,nge05,nge06c} the break period is conveniently chosen to be at 10 days, which is represented as dotted vertical line in Figure \ref{sic_result}. The SIC results also supported the non-linear P-L relation to be the preferred model at $\log(P_0)=1.0$. 
383: 
384: \begin{figure}
385: %\epsscale{.80}
386: \plotone{f5.eps}
387: \caption{Comparisons of the histograms from three sets of simulations for the MACHO data: (1) a simulation that takes a non-linear P-L relation with a break at $\log(P_0)=1.0$ as input P-L relation and the intrinsic dispersion is included; (2) a simulation with a linear P-L relation as input P-L relation and the intrinsic dispersion is included; and (3) a simulation that takes a non-linear P-L relation with a break at $\log(P_0)=1.0$ as input P-L relation but without the intrinsic dispersion. Unlike other simulations done in this paper, the periods that go into the simulations are from the {\it actual} MACHO data.  \label{macho}}
388: \end{figure}
389: 
390: \begin{deluxetable*}{lccccccc}
391: \tabletypesize{\scriptsize}
392: %\rotate
393: \tablecaption{SIC results for the real data sets.\label{tab3}}
394: \tablewidth{0pt}
395: \tablehead{
396: \colhead{Dataset} & \colhead{$SIC(H_0)$} & \colhead{$SIC(H_A)$} & \colhead{$\log(P_0)$} 
397: & \colhead{$5^{\mathrm{th}}$-percentile} & \colhead{$25^{\mathrm{th}}$-percentile}
398: & \colhead{$75^{\mathrm{th}}$-percentile} & \colhead{$95^{\mathrm{th}}$-percentile}
399: }
400: \startdata
401: OGLE sample                 & $-179.29$ & $-188.86$ & 1.041 & 0.550 & 1.002 & 1.041 & 1.101 \\
402: OGLE+SEBO sample            & $-296.43$ & $-304.28$ & 1.041 & 0.560 & 0.922 & 1.052 & 1.131 \\
403: MACHO sample                & $182.61$  & $156.20$  & 0.833 & 0.806 & 0.826 & 0.838 & 0.936 
404: \enddata
405: \end{deluxetable*}
406: 
407: \section{Conclusion and Discussion}
408: 
409: Using two additional statistical approaches, the method of testimators and SIC, to the OGLE, the OGLE+SEBO and the MACHO Cepheids data, we have found strong statistical 
410: evidence for a change of slope in the Cepheid P-L relation in the LMC at longer period range. These results also strongly support the previous results 
411: obtained from the $F$-test. Therefore, the {\it observed} LMC P-L relation is non-linear based on these rigorous statistical tests. This implies that either
412: the LMC P-L relation is indeed non-linear or there are some hidden factors in the analysis \citep[see][for more discussion on this]{nge06}. Furthermore, the break periods, or the range of permissible break periods found from this study are consistent with the conveniently chosen break period at 10 days in previous studies. However, our study, both with real and fake data, implies that it is difficult to accurately estimate the break period with both the testimator and  SIC methods. This is due to the existence of the intrinsic dispersion along the P-L relation. The confirmation of the break period at/around 10 days has to be done from stellar pulsation modeling studies. 
413: 
414: The implications 
415: of a non-linear LMC P-L relation on the extra-galactic distance scale and the Hubble constant have been discussed in \citet{nge05w} and \citet{nge06c} and will 
416: not be repeated here.
417: A number of authors, including \citet{spe06}, \citet{teg06}, \citet{mac06}, \citet{oll06} and the reference therein, have commented on how an independent estimate of 
418: the Hubble constant accurate to $1\%$ can significantly reduce the error bars on ${\Omega}$, the total density of the universe. Applying the correct form of the 
419: Cepheid P-L relation will help in reducing the systematic error of the Hubble constant in the future studies \citep{nge06c,nge06}. 
420: Over and above this, if the 
421: Cepheid P-L relation does indeed have a change of slope at 10 days, it is important to understand this from a stellar
422: pulsation and evolution point of view 
423: and investigate fully the ramifications of this new feature \citep{kan06,mar05}. 
424: 
425: \citet{nge06} has investigated various factors that may cause the observed non-linear LMC  P-L relation, including the influence of outliers and lack of
426: longer period Cepheids in the sample. However the results from that study suggest that none of these factors are
427: responsible for the observed non-linear LMC P-L relation.
428: We emphasize that the samples we used in the our studies have been cleaned up for obvious outliers. Further, the testimator approach estimates the slope 
429: with a variance which is smaller than the standard formula (property 2 stated in Section 2.1.1) is able to minimize the effect of (additional) outliers by smoothing.
430: Regarding the lacks of longer period Cepheids in the sample, we have use the OGLE+SEBO as an expansion sample to the OGLE sample with the increase 
431: of period coverage. Both of the samples have shown the same testimator ans SIC results. Therefore, we believe this should not be the cause for the observed 
432: non-linear LMC P-L relation. 
433: 
434: \acknowledgements
435: 
436: The authors would like to thank the referee for useful suggestions. This research was supported in part by NASA through the American Astronomical Society's Small Research Grant Program. 
437: 
438: \appendix
439: \section{Proof for the Properties of the Testimator}
440: 
441: We prove the two properties of the testimator as described in Section 2.1 here. To prove that the testimator is an unbiased estimator under 
442: $H_0$, we note that the testimator from equation (1) is:
443: 
444: $$\hat{\beta}_{\omega} = k({\hat{\beta}} - {\beta}_0) + {\beta}_0,$$
445: 
446: \ni where $k$ is defined in equation (3). Therefore the above expression can be re-written as:
447: 
448: \begin{eqnarray}
449: \hat{\beta}_{\omega} = {{|\hat{\beta} - {\beta}_0|(\hat{\beta} - {\beta}_0)}\over{t_{{\alpha}/2,\nu} \sqrt{MSE/S_{XX}}}} + \beta_0.
450: \end{eqnarray}
451: 
452: \ni This implies that,
453: 
454: $$E({\hat{{\beta}_{\omega}}}) = {\sqrt{S_{XX}}\over{t_{{\alpha}/2,\nu}}}E({{1}\over{\sqrt{MSE}}}) E[|{\hat{{\beta}}} - {\beta}_0|({\hat{{\beta}}}-{\beta}_0)] + \beta_0.$$
455: 
456: \ni Since $E(|z|z)=0$ for variable $z=\hat{\beta} - \beta_0$ with standard normal distribution, and from the above expression, we obtain
457: 
458: $$E({\hat{{\beta}_{\omega}}}) = {\beta}_0$$
459: 
460: \ni as desired. The second assertion states that $\mathrm{Var}(\hat{{\beta}_{\omega}}) < \mathrm{Var}(\hat{{\beta}})$. To proof this, we first re-arrange equation (A1) such that:
461: 
462: $$(\hat{\beta}_{\omega} - \beta_0)^2 = \frac{(\hat{\beta} - \beta_0)^4}{t^2_{\alpha/2,\nu}\ MSE} S_{XX}.$$
463: 
464: \ni Assume $\hat{\beta}$ is normally distributed with $N(\beta_0,\sigma_\beta)$ and define $Z=\frac{\hat{\beta} - {\beta}_0}{\sigma_{\hat{\beta}}}$, 
465: then $Z$ has a standard
466: normal distribution with $N(0,1)$. Note that $\sigma^2_{\hat{\beta}}\equiv \mathrm{Var}(\hat{\beta})=\sigma^2/S_{XX}$, where $\sigma^2$ is the variance of the 
467: linear regression $y={\beta} x + a$, the above expression is reduced to:
468: 
469: $$ (\hat{\beta}_{\omega} - \beta_0)^2 = Z^4\frac{\sigma^2}{MSE} \frac{1}{t^2_{\alpha/2,\nu}}\frac{\sigma^2}{S_{XX}}.$$
470: 
471: \ni Hence, we have 
472: 
473: \begin{eqnarray}
474: Var(\hat{\beta}_{\omega}) & = & E(Z^4) E(\frac{\sigma^2}{MSE}) \frac{1}{t^2_{\alpha/2,\nu}} \frac{\sigma^2}{S_{XX}},
475: \end{eqnarray}
476: 
477: \ni as the last two terms are constants and $\mathrm{Var}(\hat{\beta}_{\omega})\equiv E[(\hat{\beta}_{\omega} - \beta_0)^2]$. For $E(Z^4)$, since the
478: forth moment of a standard normal distribution (the Kurtosis) is 3, then $E(Z^4)=3$. For $E(\frac{\sigma^2}{MSE})$, we observe that 
479: $\frac{\sigma^2}{MSE}=\frac{1}{MSE/\sigma^2}=(N-2)/\sum(\frac{y_i-\hat{a}-\hat{\beta}x_i}{\sigma})^2$. Therefore, $(N-2)MSE/\sigma^2$ is $\chi^2$ distributed 
480: with $\nu=N-2$ degree of freedom. It is well-known that if $X$ is $\chi^2$ distributed with $\nu$ degree of freedom, then $E(1/X)=1/(\nu-2)$, hence 
481: $E(\frac{\sigma^2}{MSE})=(N-2)/(N-4)$. Recall that $\sigma^2/S_{XX}=\mathrm{Var}(\hat{\beta})$, then equation (A2) is reduced to:  
482: 
483: $$ \mathrm{Var}(\hat{\beta}_{\omega}) = 3\frac{(N-2)}{N-4}\frac{1}{t^2_{\alpha/2,\nu}} \mathrm{Var}(\hat{\beta}).$$
484: 
485: \ni If $t_{\alpha/2,\nu}>\sqrt{3(N-2)/(N-4)}$, we then have
486: 
487: $$\mathrm{Var}(\hat{{\beta}_{\omega}}) < \mathrm{Var}(\hat{{\beta}})$$
488: 
489: \ni as the assertion states. Due to the Bonferroni testing procedure, condition $t_{\alpha/2n_g,\nu}>\sqrt{3(N-2)/(N-4)}$ is satisfied when $N>5$ and ${\alpha}<0.1$.
490: 
491: \begin{thebibliography}{}
492: \bibitem[Akaike(1974)]{aka74} Akaike, H., 1974, IEEE Transactions on Automatic Control, 19, 716
493:  
494: \bibitem[Arentoft et al.(2001)]{are01} Arentoft, T., Sterken, C. \& Handler, G., 2001, \mnras, 326, 192
495: 
496: \bibitem[Arnold \& Katti(1972)]{arn72} Arnold, J. C. \& Katti, S. K., 1972, Journal of Multivariate Analysis, 2, 236
497: 
498: \bibitem[Bancroft(1944)]{ban44} Bancroft, T. A., 1944, Annals of Mathematical Statistics, 15, 190
499: 
500: \bibitem[Bancroft(1964)]{ban64} Bancroft, T. A., 1964, Biometrics, 20, 427
501: 
502: \bibitem[Bock et al.(1973)]{boc73} Bock, M. E., Yancey, T. A. \& Judge, G. G., 1973, Journal of the American Statistical Association, 68, 109
503: 
504: \bibitem[Ghosh \& Sinha(1988)]{gho88} Ghosh, M. \& Sinha, B. K., 1988, Journal of Multivariate Analysis, 27, 206
505: 
506: \bibitem[Han(1978)]{han78} Han, C. P., 1978, Journal of American Statistical Association, 73, 855
507: 
508: \bibitem[Handler et al.(2000)]{han00} Handler, G., et al., 2000, \mnras, 318, 511
509: 
510: \bibitem[Handler et al.(2002)]{han02} Handler, G., Romero-Colmenero, E. \& Montgomery, M. H., 2002, \mnras, 335, 399
511: 
512: \bibitem[Huntsberger(1955)]{hun55} Huntsberger, D. V., 1955, Annals of Mathematical Statistics, 26, 734
513: 
514: \bibitem[Kanbur \& Ngeow(2004)]{kan04} Kanbur, S. \& Ngeow, C., 2004, \mnras, 350, 962
515: 
516: \bibitem[Kanbur \& Ngeow(2006)]{kan06} Kanbur, S. \& Ngeow, C., 2006, \mnras, 369, 705
517: 
518: \bibitem[Koen(1996)]{koe96} Koen, C., 1996, \mnras, 283, 471
519: 
520: \bibitem[Koen(1999)]{koen99} Koen, C., 1999, \mnras, 309, 769
521: 
522: \bibitem[Koen(2006)]{koe06} Koen, C., 2006, \mnras, 365, 489
523: 
524: \bibitem[Koen \& Lombard(1993)]{koe93} Koen, C. \& Lombard, F., 1993, \mnras, 263, 287
525: 
526: \bibitem[Koen \& Schumann(1999)]{koe99} Koen, C. \& Schumann, R., 1999, \mnras, 310, 618
527: 
528: \bibitem[Koen \& Laney(2000)]{koe00} Koen, C. \& Laney, D., 2000, \mnras, 311, 636
529: 
530: \bibitem[Koen \& Lombard(2003)]{koe03} Koen, C. \& Lombard, F., 2003, \mnras, 343, 241
531: 
532: \bibitem[Liddle(2004)]{lid04} Liddle, A., 2004, \mnras, 351, L49
533: 
534: \bibitem[Liddle(2007)]{lid07} Liddle, A., 2007, astro-ph/0701113
535: 
536: \bibitem[Macri et al.(2006)]{mac06} Macri, L., Stanek, K., Bersier, D., Greenhill, L. \& Reid, M., 2006, \apj, 652, 1133
537: 
538: \bibitem[Marconi et al.(2005)]{mar05} Marconi, M., Musella, I. \& Fiorentino, G., 2005, \apj, 632, 590
539: 
540: \bibitem[Mukherjee et al.(1998)]{muk98} Mukherjee, S., Feigelson, E. D., Jogesh Babu, G., Murtagh, F., Fraley, C. \& Raftery, A., 1998, \apj, 508, 314
541: 
542: \bibitem[Ngeow \& Kanbur(2005)]{nge05w} Ngeow, C. \& Kanbur, S., 2005, \mnras, 360, 1033
543: 
544: \bibitem[Ngeow et al.(2005)]{nge05} Ngeow, C., Kanbur, S., Nikolaev, S., Buonaccorsi, J., Cook, K. \& Welch, D., 2005, \mnras, 363, 831
545: 
546: \bibitem[Ngeow \& Kanbur(2006a)]{nge06a} Ngeow, C. \& Kanbur, S., 2006a, \mnras, 369, 723
547: 
548: \bibitem[Ngeow \& Kanbur(2006b)]{nge06c} Ngeow, C. \&  Kanbur, S., 2006b, \apjl, 642, L29
549: 
550: \bibitem[Ngeow \& Kanbur(2006c)]{nge06} Ngeow, C. \& Kanbur, S., 2006c, \apj, 650, 180
551: 
552: \bibitem[Olling(2006)]{oll06} Olling, R., 2006, \mnras, submitted (astro-ph/0607607)
553: 
554: \bibitem[Pandey \& Malik(1990)]{pan90} Pandey, B. N. \& Malik, H. J., 1990, Communications in Statistics - Theory and Methods, 19, 629
555: 
556: \bibitem[Pandey et al.(1995)]{pan95} Pandey, B. N., Malik, H. J. \& Dubey, P. K., 1995, Communications in Statistics - Theory and Methods, 24, 2261
557: 
558: \bibitem[Pandey(1997)]{pan97} Pandey, B. N., 1997, Communications in Statistics - Theory and Methods, 26, 2191
559: 
560: \bibitem[Pandey \& Srivastava(2001)]{pan01} Pandey, B. N. \& Srivastava, A. K., 2001, IAPQR, 26, 109
561: 
562: \bibitem[Paul(1950)]{pau50} Paul, A. E., 1950, Annals of Mathematical Statistics, 21, 539
563: 
564: \bibitem[Porciani \& Norberg(2006)]{por06} Porciani, C. \& Norberg, P., 2006, \mnras, 371, 1824
565: 
566: \bibitem[Sandage et al.(2004)]{san04} Sandage, A., Tammann, G. A. \& Reindl, B., 2004, \aap, 424, 43
567: 
568: \bibitem[Schwarz(1978)]{sch78} Schwarz, G., 1978, Annals of Statistics, 6, 461
569: 
570: \bibitem[Spergel et al.(2006)]{spe06} Spergel, D., et al., 2006, \apj, in-press (astro-ph/0603449)
571: 
572: \bibitem[Sterken et al.(1999)]{ste99} Sterken, C., Broens, E. \& Koen, C., 1999, \aap, 342, 167
573: 
574: \bibitem[Takeuchi(2000)]{tak00} Takeuchi, T., 2000, Astrophys. \& Space Sci., 271, 213
575: 
576: \bibitem[Tammann \& Reindl(2002)]{tam02} Tammann, G. A. \& Reindl, B., 2002, Astrophys. \& Space Sci., 280, 165
577: 
578: \bibitem[Tegmark et al.(2006)]{teg06} Tegmark, M., et al., 2006, Phys. Rev. D, 74, 123507 
579: 
580: \bibitem[Sebo et al.(2002)]{seb02} Sebo, K., Rawson, D., Mould, J., Madore, B., Putman, M., Graham, J., Freedman, W., Gibson, B. \& Germany, L., 2002, ApJS, 142, 71
581: 
582: \bibitem[Waikar et al.(1984)]{wai84} Waikar, V. B., Schuurmann, F. J. \& Raghunathan, T. E., 1984, Communications in Statistics - Theory and Methods, 13, 1901
583: 
584: \bibitem[Waikar et al.(2002)]{wai01} Waikar, V. B., Ratnaparkhi, M. V. \& Schuurmann, F. J., 2002, {\it Monte Carlo and quasi-Monte Carlo methods 2000: Proceedings of a conference held at Hong Kong Baptist University, Hong Kong SAR, China}, Ed. Fang, K.-T., Hickernell, F. \& Niederreiter, H., Springer, Berlin, pg. 487
585: 
586: \bibitem[Weisberg(1980)]{wei80} Weisberg, S., 1980, {\it Applied Linear Regression}, 1st Ed., Wiley, New York.
587: 
588: \bibitem[Yancey et al.(1989)]{yan89} Yancey, T. A., Judge, G. G. \& Bohrer, R., 1989, Econometrica, 57, 1221
589: 
590: \end{thebibliography}
591: 
592: \end{document}
593: 
594: