4448e62e28417bdb.tex
1: \begin{abstract}
2: In this work, we introduce a learning model designed to
3: meet the needs of applications in
4: which computational resources are limited, and robustness and
5: interpretability are prioritized.
6: %
7: Learning problems can be formulated as 
8: constrained stochastic optimization problems, 
9: with the constraints originating mainly 
10: from model assumptions that define a trade-off between 
11: complexity and performance.
12: %
13: This trade-off is closely related to 
14: over-fitting, generalization capacity, and robustness to 
15: noise and adversarial attacks, 
16: and
17: depends on both the structure and complexity of the model, 
18: %(e.g., the number of neurons in a neural network)
19: as well as the properties
20: of the optimization methods used.
21: %
22: We develop an online prototype-based learning algorithm 
23: based on annealing optimization
24: that is formulated as an online gradient-free stochastic approximation algorithm.
25: %
26: The learning model can be viewed as an interpretable and 
27: progressively growing competitive-learning neural network model
28: to be used for supervised, unsupervised, and reinforcement learning.
29: %
30: The annealing nature of the algorithm contributes to
31: minimal hyper-parameter tuning requirements, 
32: poor local minima prevention, and 
33: robustness with respect to the initial conditions.
34: %
35: At the same time, it provides online control over the performance-complexity trade-off
36: by progressively increasing the complexity of the learning model as needed, 
37: through an intuitive bifurcation phenomenon.
38: %
39: Finally, the use of stochastic approximation enables the study 
40: of the convergence of the learning algorithm through 
41: mathematical tools from dynamical systems and control, 
42: and allows for its integration with reinforcement learning algorithms,
43: constructing an adaptive state-action aggregation scheme.
44: %
45: % We illustrate the properties and evaluate the performance 
46: % of the proposed algorithm in 
47: % supervised, unsupervised, and reinforcement learning problems. 
48: %
49: \end{abstract}
50: