53e02edc4da08ce5.tex
1: \begin{abstract}
2:   We consider estimation of an optimal individualized treatment rule
3:   from observational and randomized studies when a high-dimensional
4:   vector of baseline variables is available. Our optimality criterion
5:   is with respect to delaying expected time to occurrence of an event
6:   of interest (e.g., death or relapse of cancer). We leverage
7:   semiparametric efficiency theory to construct estimators with
8:   desirable properties such as double robustness. We propose two
9:   estimators of the optimal rule, which arise from considering two
10:   loss functions aimed at (i) directly estimating the conditional
11:   treatment effect (also know as the blip function), and (ii)
12:   recasting the problem as a weighted classification problem that uses
13:   the 0-1 loss function. Our estimated rules are \textit{super
14:     learning} ensembles that minimize the cross-validated risk of a
15:   linear combination in a user-supplied library of candidate
16:   estimators. We prove oracle inequalities bounding the finite sample
17:   excess risk of the estimator. The bounds depend on the excess risk
18:   of the oracle selector and a doubly robust term related to
19:   estimation of the nuisance parameters. We discuss some important
20:   implications of these oracle inequalities such as the convergence
21:   rates of the value of our estimator to that of the oracle
22:   selector. We illustrate our methods in the analysis of a phase III
23:   randomized study testing the efficacy of a new therapy for the
24:   treatment of breast cancer.
25: \end{abstract}
26: