abstract:66c76058c08fc85f.tex

1: \begin{abstract}

2:   Annotating data for sensitive labels (e.g., disease,  smoking)  poses a  potential threats to individual privacy in many real-world scenarios.   To  cope with this problem,  we  propose a novel setting to protect  privacy of each  instance, namely learning from concealed labels for multi-class classification.  Concealed labels prevent sensitive labels from appearing in the label set during the label collection  stage, as shown in Figure \ref{motivation}, which specifies none  and  some random sampled  insensitive labels  as concealed labels set to annotate sensitive data. In this paper, an unbiased estimator can be established from concealed data under mild assumptions, and the learned multi-class classifier can not only classify the instance from insensitive labels accurately but also recognize the instance from the sensitive labels. Moreover, we bound the estimation error and show that the multi-class classifier achieves the optimal parametric convergence rate. Experiments demonstrate the significance and effectiveness of the proposed method for concealed labels in synthetic and real-world datasets. Source code is available at \href{https://github.com/WilsonMqz/CLF}{https://github.com/WilsonMqz/CLF}

3: \end{abstract}

4: