fe9da63bfdade8ff.tex
1: \begin{abstract}
2: In many real-world applications such as text mining, it is desirable
3: to select the most relevant features or variables to improve the
4: generalization ability, or to provide a better interpretation of the
5: prediction models.  {In this paper, a novel adaptive feature scaling
6: (AFS) scheme is proposed  by introducing a feature scaling {vector
7: $\d \in [0, 1]^m$} to alleviate the bias problem brought by the
8: scaling bias of the diverse features.} By reformulating the resultant
9: AFS model to semi-infinite programming problem, a novel feature
10: generating method is presented to identify the most relevant features
11: for classification problems. In contrast to the traditional feature
12: selection methods, the new formulation has the advantage of solving
13: extremely high-dimensional and large-scale problems.  With an exact
14: solution to the worst-case analysis in the identification of relevant
15: features, the proposed feature generating scheme converges globally.
16: More importantly, the proposed scheme facilitates the group selection
17: with or without special structures. Comprehensive experiments on a
18: wide range of synthetic and real-world datasets demonstrate that the
19: proposed method {achieves} better or competitive performance compared
20: with the existing methods on (group) feature selection in terms of
21: generalization performance and training efficiency. The C++ and
22: MATLAB implementations of our algorithm  can be available at
23: \emph{http://c2inet.sce.ntu.edu.sg/Mingkui/robust-FGM.rar}.
24: 
25: \end{abstract}