abstract:798053f1e4fd1022.tex

1: \begin{abstract}

2: In the training of deep learning models, how the model parameters are initialized greatly affects the model performance, sample efficiency, and convergence speed.

3: Representation learning for model initialization has recently been actively studied in the remote sensing field.

4: In particular, the appearance characteristics of the imagery obtained using the a synthetic aperture radar (SAR) sensor are quite different from those of general electro-optical (EO) images, and thus representation learning is even more important in remote sensing domain.

5: Motivated from contrastive multiview coding, we propose multi-modal representation learning for SAR semantic segmentation.

6: Unlike previous studies, our method jointly uses EO imagery, SAR imagery, and a label mask.

7: Several experiments show that our approach is superior to the existing methods in model performance, sample efficiency, and convergence speed.

8: \end{abstract}

9: