a59fa8acfc9fb0cf.tex
1: \begin{abstract}
2: 	%In tracking community,  most algorithms focus on developing powerful classifiers/templates to determine the center of object while overlook the accurate bounding box estimation. They usually adopt a multi-scale search scheme to predict the scale change of object assuming that the aspect ratio of target is fixed during tracking. However, this assumption is not always true in real scenario, which limits the precision of predicted bounding box.  What's more, online updating a tracking model to adapt to object appearance variations is also crucial for improving tracking performance. For SGD-based model optimization, using a large learning rate may help to converge the model faster but has the risk of letting the loss wander wildly. Thus traditional optimization methods usually choose a relatively small learning rate and iterate for more steps to converge the model, which is time-consuming.
3: 	In this paper, we design a tracking model consisting of response generation and bounding box regression, where the first component produces a heat map to indicate the presence of the object at different positions and the second part regresses the relative bounding box shifts to anchors mounted on sliding-window locations. Thanks to the resizable convolutional filters used in both components to adapt to the shape changes of objects,  our tracking model does not need to enumerate different sized anchors, thus saving model parameters.
4: 	To effectively adapt the model to appearance variations, we propose to offline train a recurrent neural optimizer to update tracking model in a meta-learning setting,  which can converge the model in a few gradient steps. This improves the convergence speed of updating the tracking model while achieving better performance. We extensively evaluate our trackers, ROAM and ROAM++, on the OTB, VOT, LaSOT, GOT-10K and TrackingNet benchmark and our methods perform favorably against state-of-the-art algorithms.
5: \end{abstract}
6: