1: \begin{abstract}
2: The on-orbit intelligent planning of satellites swarm has attracted increasing attention from scholars. Especially in tasks such as the pursuit and attachment of non-cooperative satellites, satellites swarm must achieve coordinated cooperation with limited resources.
3: The study proposes a reinforcement learning framework that integrates the transformer and expert networks.
4: Firstly, under the constraints of incomplete information about non-cooperative satellites, an implicit multi-satellites cooperation strategy was designed using a communication sharing mechanism.
5: Subsequently, for the characteristics of the pursuit-attachment tasks,
6: the multi-agent reinforcement learning framework is improved by introducing transformers and expert networks inspired by transfer learning ideas.
7: To address the issue of satellites swarm scalability, sequence modelling based on transformers is utilized to craft memory-augmented policy networks, meanwhile increasing the scalability of the swarm.
8: By comparing the convergence curves with other algorithms, it is shown that the proposed method is qualified for pursuit-attachment tasks of satellites swarm.
9: Additionally, simulations under different maneuvering strategies of non-cooperative satellites respectively demonstrate the robustness of the algorithm and the task efficiency of the swarm system. The success rate of pursuit-attachment tasks is analyzed through Monte Carlo simulations.
10: \end{abstract}
11: