1: \begin{abstract}
2: We construct a semiparametric estimator in case-control studies where
3: the gene and the environment are assumed to be independent. A discrete or
4: continuous parametric distribution of the genes is assumed in the
5: model. A discrete distribution of the genes can be used to model
6: the mutation or presence of certain group of genes.
7: A continuous distribution allows the distribution
8: of the gene effects to be in a finite-dimensional parametric
9: family and can hence be used to model the gene expression levels.
10: We leave the distribution of the environment totally unspecified.
11: The estimator is derived through calculating the efficiency score
12: function in a hypothetical setting where a close approximation to
13: the samples is random. The resulting estimator is proved to be
14: efficient in the hypothetical situation. The efficiency of the
15: estimator is further demonstrated to hold in the case-control
16: setting as well.
17: %The proposed estimator in the discrete gene distribution model
18: %performs very closely to the method in Chatterjee and Carroll [\textit{Biometrika}
19: %hence a further study of the equivalence of the two methods would be of
20: %interest.
21: \end{abstract}