1: \begin{abstract}
2: Cooperative information sharing is important to theories of human learning and has potential implications for machine learning.
3: Prior work derived conditions for achieving optimal Cooperative Inference given relatively restrictive assumptions.
4: We demonstrate convergence for any discrete joint distribution, robustness through equivalence classes and stability under perturbation, and effectiveness by deriving bounds from structural properties of the original joint distribution.
5: We provide geometric interpretations, connections to and implications for optimal transport and to importance sampling, and conclude by outlining open questions and challenges to realizing the promise of Cooperative Inference.
6: \end{abstract}
7: