c3812ca79d4968bc.tex
1: \begin{abstract}
2: In a lot of real-world data mining and machine learning applications, data are  provided by multiple providers and  each maintains private records of different feature sets about common entities. %, which is also called as vertically partitioned  data.
3: It is challenging  to train these vertically partitioned data effectively and efficiently while keeping  data privacy for traditional data mining and  machine learning algorithms. In this paper,  we focus on nonlinear learning with kernels, and    propose a  federated doubly stochastic kernel learning (FDSKL) algorithm for vertically partitioned data.
4: Specifically, we use random features to approximate the kernel mapping function  and use   doubly stochastic gradients to update the solutions, which  are all computed federatedly without the disclosure of data. %Note that the nonlinear prediction model is stored separately  in different workers.
5: Importantly, we prove that FDSKL has a sublinear convergence rate, and can guarantee the
6: data security under the semi-honest assumption. Extensive experimental results on  a variety of benchmark datasets  show  that FDSKL is significantly faster than state-of-the-art federated learning methods when dealing with kernels,  while  retaining the similar generalization performance.
7: \end{abstract}
8: