738efcb7b264f016.tex
1: \begin{abstract}
2: This paper presents a compact, matrix-based representation of neural networks in a self-contained tutorial fashion. 
3: % 
4: % Specifically, we develop neural networks as a composition of several vector-valued functions. 
5: % 
6: Although neural networks are well-understood pictorially in terms of interconnected neurons, neural networks are mathematical nonlinear functions constructed by composing several vector-valued functions. 
7: % 
8: Using basic results from linear algebra, we represent a neural network as an alternating sequence of linear maps and scalar nonlinear functions, also known as activation functions. 
9: % , which are parameterized by matrix multiplications, and nonlinear maps. 
10: % 
11: The training of neural networks requires the minimization of a cost function, which in turn requires the computation of a gradient. 
12: % 
13: Using basic multivariable calculus results, the cost gradient is also shown to be a function composed of a sequence of linear maps and nonlinear functions.
14: % also known as backpropagation.   
15: % 
16: In addition to the analytical gradient computation, we consider two gradient-free training methods and compare the three training methods in terms of convergence rate and prediction accuracy.  
17: % Three gradient-free training schemes are implemented and the results of training are compared with the gradient-based training. 
18: % Finally, a novel data-drive, gradient free training algorithm is presented and compared. 
19: 
20: \end{abstract}
21: