1: \begin{abstract}
2: We consider estimation of an optimal individualized treatment rule
3: from observational and randomized studies when a high-dimensional
4: vector of baseline variables is available. Our optimality criterion
5: is with respect to delaying expected time to occurrence of an event
6: of interest (e.g., death or relapse of cancer). We leverage
7: semiparametric efficiency theory to construct estimators with
8: desirable properties such as double robustness. We propose two
9: estimators of the optimal rule, which arise from considering two
10: loss functions aimed at (i) directly estimating the conditional
11: treatment effect (also know as the blip function), and (ii)
12: recasting the problem as a weighted classification problem that uses
13: the 0-1 loss function. Our estimated rules are \textit{super
14: learning} ensembles that minimize the cross-validated risk of a
15: linear combination in a user-supplied library of candidate
16: estimators. We prove oracle inequalities bounding the finite sample
17: excess risk of the estimator. The bounds depend on the excess risk
18: of the oracle selector and a doubly robust term related to
19: estimation of the nuisance parameters. We discuss some important
20: implications of these oracle inequalities such as the convergence
21: rates of the value of our estimator to that of the oracle
22: selector. We illustrate our methods in the analysis of a phase III
23: randomized study testing the efficacy of a new therapy for the
24: treatment of breast cancer.
25: \end{abstract}
26: