5e87cb9e85df5e3b.tex
1: \begin{abstract}
2: In this work we propose a novel Convolutional Neural Network (CNN)
3: architecture for the matching of pairs of image patches acquired by
4: different sensors. Our approach utilizes two CNN sub-networks, where the
5: first is a Siamese CNN and the second is a subnetwork consisting of dual
6: non-weight-sharing CNNs. This allows simultaneous joint and disjoint
7: processing of the input pair of multimodal image patches. The convergence of
8: the training and the test accuracy is improved by introducing auxiliary
9: losses, and a corresponding hard negative mining scheme. The proposed
10: approach is experimentally shown to compare favorably with contemporary
11: state-of-the-art schemes when applied to multiple datasets of multimodal
12: images. The code implementing the proposed scheme was made publicly
13: available.
14: \end{abstract}
15: