29252cba282147a2.tex
1: \begin{abstract}
2: % Exploiting user interests from historical interaction sequences and predicting the next item is one of the most important tasks for recommender systems. Recently, the research focus has shifted from recurrent neural networks to Attention-based methods and Graph-based methods. Despite the effectiveness, few existing methods take into account the diversity of behaviors in the sequences, which is extremely common in practical recommendation scenarios. For example, in short video platforms, users are driven by different aspects of their interests to have four kinds of behaviors: view, add-to-favorite, share and like. Encoding multi-behavior signals in sequences can further enhance recommendation performance, while the challenges come from two aspects: \textbf{a)}multi-behavior patterns and distributions in sequences vary by users due to the personalization; \textbf{b)}multi-behavior dependencies are complex and multi-order, hard to be captured by sequential models. To address the above two challenges, we propose a novel dual-view learning framework with \textbf{m}ulti-scale attention transformer view and \textbf{h}ypergraph view for modeling complex relations in multi-behavior sequential \textbf{rec}ommendation (named \textbf{\baby}). \baby is devised to capture the various distribution of multi-behavior signals in sequences by utilizing multi-scale attention and learn the multi-order behavior dependencies with a hypergraph structure. The two views are incorporated in a \textit{Cloze} task, which can better exploit the training data. Extensive experiments over three real-world datasets demonstrate the effectiveness of \baby, by significantly boosted performance, faster convergence and small extra time complexity.
3: % \end{abstract}
4: