1: \begin{abstract}
2: Learning to compare two objects are essential in applications,
3: such as digital forensics, face recognition, and brain network analysis,
4: especially when labeled data are scarce and imbalanced.
5: As these applications make high-stake decisions and involve societal values like fairness
6: and transparency,
7: it is critical to explain the learned models.
8: We aim to study post-hoc explanations of Siamese networks (SN) widely used in learning to compare.
9: We characterize the instability of gradient-based explanations due to the additional compared object in SN,
10: in contrast to architectures with a single input instance.
11: We propose an optimization framework that derives global invariance from unlabeled data using self-learning to promote the stability of local explanations tailored for specific query-reference pairs.
12: The optimization problems can be solved using gradient descent-ascent (GDA) for constrained optimization, or SGD for KL-divergence regularized unconstrained optimization, with convergence proofs, especially when
13: the objective functions are nonconvex due to the Siamese architecture.
14: Quantitative results and case studies on tabular and graph data from neuroscience and chemical engineering show that the framework respects the self-learned invariance while robustly optimizing the faithfulness and simplicity of the explanation.
15: We further demonstrate the convergence of GDA experimentally.
16: \end{abstract}
17: