1: \begin{abstract}
2: Cross-view image geolocalization provides an estimate of an agent's global position by matching a local ground image to an overhead satellite image without the need for GPS. It is challenging to reliably match a ground image to the correct satellite image since the images have significant viewpoint differences. Existing works have demonstrated localization in constrained scenarios over small areas but have not demonstrated wider-scale localization. Our approach, called Wide-Area Geolocalization (\sysname{}), combines a neural network with a particle filter to achieve global position estimates for agents moving in GPS-denied environments, scaling efficiently to city-scale regions. \sysname{} introduces a trinomial loss function for a Siamese network to robustly match non-centered image pairs and thus enables the generation of a smaller satellite image database by coarsely discretizing the search area. A modified particle filter weighting scheme is also presented to improve localization accuracy and convergence. Taken together, \sysname{}'s network training and particle filter weighting approach achieves city-scale position estimation accuracies on the order of 20 meters, a 98\% reduction compared to a baseline training and weighting approach. Applied to a smaller-scale testing area, \sysname{} reduces the final position estimation error by 64\% compared to a state-of-the-art baseline from the literature. \sysname{}’s search space discretization additionally significantly reduces storage and processing requirements. A video highlight demonstrating particle filter convergence results for \sysname{} compared to the baseline for the Chicago test area is available at \url{https://youtu.be/06MOR0ozQeI}.
3: %We include in our submission a video demonstrating particle filter convergence results for \sysname{} compared to the baseline for the Chicago test area.
4: \end{abstract}
5: