abstract:aff3eda39856baac.tex

1: \begin{abstract}

2:    In neural architecture search (NAS), differentiable architecture search (DARTS) has recently attracted much attention due to its high efficiency.

3:    It defines an over-parameterized network with mixed edges, each of which represents all operator candidates, and jointly optimizes the weights of the network and its architecture in an alternating manner.

4:    However, this method finds a model with the weights converging faster than the others, and such a model with fastest convergence often leads to overfitting.

5:    Accordingly, the resulting model cannot always be well-generalized.

6:    To overcome this problem, we propose a method called minimum stable rank DARTS (MSR-DARTS), for finding a model with the {\it best generalization error} by replacing architecture optimization with the selection process using the minimum stable rank criterion.

7:    Specifically, a convolution operator is represented by a matrix, and MSR-DARTS selects the one with the smallest stable rank.

8:    We evaluated MSR-DARTS on CIFAR-10 and ImageNet datasets.

9:    It achieves an error rate of $2.54\%$ with $4.0$M parameters within $0.3$ GPU-days on CIFAR-10, and a top-1 error rate of $23.9\%$ on ImageNet.

10:    The official code is available at \url{https://github.com/mtaecchhi/msrdarts.git}.

11: \end{abstract}

12: