c82a565521e4561d.tex
1: \begin{abstract}
2: We develop a model-based empirical Bayes approach to variable selection problems
3: in which the number of predictors is very large, possibly much larger than the number
4: of responses (the so-called “large p, small n” problem). We consider the multiple linear 
5: regression setting, where the
6: response is assumed to be a continuous variable and it is a linear function of
7: the predictors plus error. The explanatory variables in the linear model can have a positive
8: effect on the response, a negative effect, or no effect. We model the effects
9: of the linear predictors as a three-component mixture
10: in which a key assumption is
11: that only a small (unknown) fraction of the candidate predictors have a non-zero effect on
12: the response variable. By treating the coefficients as random effects we develop
13: an approach that is computationally
14: efficient because the number of parameters that have to be estimated is small, and
15: remains constant regardless of the number of explanatory variables. 
16: The model parameters are estimated using the EM algorithm which
17: is scalable and leads to significantly faster convergence, compared with simulation-based methods.
18: \end{abstract}
19: