1: \begin{abstract}
2: In this work, we introduce a learning model designed to
3: meet the needs of applications in
4: which computational resources are limited, and robustness and
5: interpretability are prioritized.
6: %
7: Learning problems can be formulated as
8: constrained stochastic optimization problems,
9: with the constraints originating mainly
10: from model assumptions that define a trade-off between
11: complexity and performance.
12: %
13: This trade-off is closely related to
14: over-fitting, generalization capacity, and robustness to
15: noise and adversarial attacks,
16: and
17: depends on both the structure and complexity of the model,
18: %(e.g., the number of neurons in a neural network)
19: as well as the properties
20: of the optimization methods used.
21: %
22: We develop an online prototype-based learning algorithm
23: based on annealing optimization
24: that is formulated as an online gradient-free stochastic approximation algorithm.
25: %
26: The learning model can be viewed as an interpretable and
27: progressively growing competitive-learning neural network model
28: to be used for supervised, unsupervised, and reinforcement learning.
29: %
30: The annealing nature of the algorithm contributes to
31: minimal hyper-parameter tuning requirements,
32: poor local minima prevention, and
33: robustness with respect to the initial conditions.
34: %
35: At the same time, it provides online control over the performance-complexity trade-off
36: by progressively increasing the complexity of the learning model as needed,
37: through an intuitive bifurcation phenomenon.
38: %
39: Finally, the use of stochastic approximation enables the study
40: of the convergence of the learning algorithm through
41: mathematical tools from dynamical systems and control,
42: and allows for its integration with reinforcement learning algorithms,
43: constructing an adaptive state-action aggregation scheme.
44: %
45: % We illustrate the properties and evaluate the performance
46: % of the proposed algorithm in
47: % supervised, unsupervised, and reinforcement learning problems.
48: %
49: \end{abstract}
50: