e32c84d9212bf9fb.tex
1: \begin{abstract}
2:   Siamese-network-based self-supervised learning (SSL) suffers from slow convergence
3:   and instability in training.
4:   To alleviate this, we propose a framework to exploit intermediate self-supervisions in each stage of deep nets, called the {\it Ladder Siamese Network}.
5:    Our self-supervised losses encourage the intermediate layers
6:    to be consistent with different data augmentations to single samples, which facilitates training progress and enhances the discriminative ability of the intermediate layers themselves.
7:    While some existing work has already utilized multi-level self supervisions in SSL, ours is different in that 1) we reveal its usefulness with non-contrastive Siamese frameworks in both theoretical and empirical viewpoints, and 2) ours improves image-level classification, instance-level detection, and pixel-level segmentation simultaneously.
8:    Experiments show that the proposed framework can improve BYOL baselines by 1.0\% points in ImageNet linear classification, 1.2\% points in COCO detection,
9:    and 3.1\% points in PASCAL VOC segmentation.
10:    In comparison with the state-of-the-art methods, our Ladder-based model achieves competitive and balanced performances in all tested benchmarks without causing large degradation in one. 
11: \end{abstract}
12: