61a8e1446bef5355.tex
1: \begin{abstract}
2: We study optimal variance reduction solutions for 
3: count and ratio metrics in online controlled experiments.
4: Our methods leverage flexible machine learning tools to incorporate covariates that are independent from the treatment but have predictive power for the outcomes, and employ 
5: the cross-fitting technique to remove the bias in complex machine learning models. 
6: We establish CLT-type asymptotic inference based on our estimators under mild convergence conditions. 
7: Our procedures 
8: are optimal (efficient) for the corresponding targets
9: as long as the machine learning estimators are consistent, 
10: without any requirement for their convergence rates. 
11: In complement to the general optimal procedure, 
12: we also derive a linear adjustment method for ratio metrics 
13: as a special case  that 
14: is computationally efficient and can flexibly incorporate any pre-treatment covariates. 
15: We evaluate the proposed variance reduction procedures with
16: comprehensive simulation studies and provide  practical suggestions regarding 
17: commonly adopted assumptions in computing ratio metrics. 
18: When tested on real online experiment data from LinkedIn, 
19: the proposed optimal procedure for ratio metrics can reduce 
20: up to 80\% of variance compared to the standard difference-in-mean estimator and also further reduce up to 30\% of variance compared to the CUPED approach by going beyond linearity and incorporating a large number of extra covariates.
21: \end{abstract}
22: