6ca25f1a46f30c04.tex
1: \begin{abstract}
2:  \setlength{\parindent}{0.0cm}
3:  An unsupervised learning procedure based on maximizing the mutual 
4:  information between the outputs of two networks receiving different
5:  but statistically dependent inputs is analyzed 
6:  (Becker and Hinton, Nature, 355, 92, 161). For a generic data model, 
7:  I show that in the large sample limit the structure in the data is
8:  recognized by mutual information maximization.
9:  For a more restricted model, where the networks are similar
10:  to perceptrons, I calculate the learning curves for zero-temperature
11:  Gibbs learning. These show that convergence can be rather slow, and
12:  a way of regularizing the procedure is considered.
13: \end{abstract}
14: