1: \begin{abstract}
2: We consider the variable selection problem for two-sample tests, aiming to select the most informative features to best distinguish samples from two groups.
3: We propose a kernel maximum mean discrepancy~(MMD) framework to solve this problem and further derive its equivalent mixed-integer programming formulations for linear, quadratic, and Gaussian types of kernel functions.
4: Our proposed framework admits advantages of both computational efficiency and nice statistical properties: (i) A closed-form solution is provided for the linear kernel case.
5: Despite NP-hardness, we provide an exact mixed-integer semi-definite programming formulation for the quadratic kernel case, which further motivates the development of exact and approximation algorithms.
6: We propose a convex-concave procedure that finds critical points for the Gaussian kernel case.
7: (ii) We provide non-asymptotic uncertainty quantification of our proposed formulation under null and alternative scenarios.
8: Experimental results demonstrate good performance of our framework.
9: \end{abstract}
10: