abstract:4448e62e28417bdb.tex

1: \begin{abstract}

2: In this work, we introduce a learning model designed to

3: meet the needs of applications in

4: which computational resources are limited, and robustness and

5: interpretability are prioritized.

6: %

7: Learning problems can be formulated as

8: constrained stochastic optimization problems,

9: with the constraints originating mainly

10: from model assumptions that define a trade-off between

11: complexity and performance.

12: %

13: This trade-off is closely related to

14: over-fitting, generalization capacity, and robustness to

15: noise and adversarial attacks,

16: and

17: depends on both the structure and complexity of the model,

18: %(e.g., the number of neurons in a neural network)

19: as well as the properties

20: of the optimization methods used.

21: %

22: We develop an online prototype-based learning algorithm

23: based on annealing optimization

24: that is formulated as an online gradient-free stochastic approximation algorithm.

25: %

26: The learning model can be viewed as an interpretable and

27: progressively growing competitive-learning neural network model

28: to be used for supervised, unsupervised, and reinforcement learning.

29: %

30: The annealing nature of the algorithm contributes to

31: minimal hyper-parameter tuning requirements,

32: poor local minima prevention, and

33: robustness with respect to the initial conditions.

34: %

35: At the same time, it provides online control over the performance-complexity trade-off

36: by progressively increasing the complexity of the learning model as needed,

37: through an intuitive bifurcation phenomenon.

38: %

39: Finally, the use of stochastic approximation enables the study

40: of the convergence of the learning algorithm through

41: mathematical tools from dynamical systems and control,

42: and allows for its integration with reinforcement learning algorithms,

43: constructing an adaptive state-action aggregation scheme.

44: %

45: % We illustrate the properties and evaluate the performance

46: % of the proposed algorithm in

47: % supervised, unsupervised, and reinforcement learning problems.

48: %

49: \end{abstract}

50: