abstract:fe9da63bfdade8ff.tex

1: \begin{abstract}

2: In many real-world applications such as text mining, it is desirable

3: to select the most relevant features or variables to improve the

4: generalization ability, or to provide a better interpretation of the

5: prediction models.  {In this paper, a novel adaptive feature scaling

6: (AFS) scheme is proposed  by introducing a feature scaling {vector

7: $\d \in [0, 1]^m$} to alleviate the bias problem brought by the

8: scaling bias of the diverse features.} By reformulating the resultant

9: AFS model to semi-infinite programming problem, a novel feature

10: generating method is presented to identify the most relevant features

11: for classification problems. In contrast to the traditional feature

12: selection methods, the new formulation has the advantage of solving

13: extremely high-dimensional and large-scale problems.  With an exact

14: solution to the worst-case analysis in the identification of relevant

15: features, the proposed feature generating scheme converges globally.

16: More importantly, the proposed scheme facilitates the group selection

17: with or without special structures. Comprehensive experiments on a

18: wide range of synthetic and real-world datasets demonstrate that the

19: proposed method {achieves} better or competitive performance compared

20: with the existing methods on (group) feature selection in terms of

21: generalization performance and training efficiency. The C++ and

22: MATLAB implementations of our algorithm  can be available at

23: \emph{http://c2inet.sce.ntu.edu.sg/Mingkui/robust-FGM.rar}.

24:

25: \end{abstract}