798053f1e4fd1022.tex
1: \begin{abstract}
2: In the training of deep learning models, how the model parameters are initialized greatly affects the model performance, sample efficiency, and convergence speed.
3: Representation learning for model initialization has recently been actively studied in the remote sensing field.
4: In particular, the appearance characteristics of the imagery obtained using the a synthetic aperture radar (SAR) sensor are quite different from those of general electro-optical (EO) images, and thus representation learning is even more important in remote sensing domain.
5: Motivated from contrastive multiview coding, we propose multi-modal representation learning for SAR semantic segmentation.
6: Unlike previous studies, our method jointly uses EO imagery, SAR imagery, and a label mask.
7: Several experiments show that our approach is superior to the existing methods in model performance, sample efficiency, and convergence speed.
8: \end{abstract}
9: