1: \begin{abstract}
2: \setlength{\parindent}{0.0cm}
3: An unsupervised learning procedure based on maximizing the mutual
4: information between the outputs of two networks receiving different
5: but statistically dependent inputs is analyzed
6: (Becker and Hinton, Nature, 355, 92, 161). For a generic data model,
7: I show that in the large sample limit the structure in the data is
8: recognized by mutual information maximization.
9: For a more restricted model, where the networks are similar
10: to perceptrons, I calculate the learning curves for zero-temperature
11: Gibbs learning. These show that convergence can be rather slow, and
12: a way of regularizing the procedure is considered.
13: \end{abstract}
14: