1: \begin{abstract}
2: Control of large-scale networked systems often necessitates the availability of complex models for the interactions amongst the agents.
3: However, in many applications building accurate models of these interactions might be prohibitive due to the curse of dimensionality or their inherent complexity.
4: In the meantime, data-guided control methods can circumvent model complexity by directly synthesizing the controller from the observed data.
5: In this paper, we propose a distributed $Q$-learning algorithm to design a feedback mechanism given an underlying graph structure parameterizing the agents' communication.
6: We assume that the distributed nature of the system arises from a common cost and show that for the particular case of identical dynamically decoupled systems, the learned controller converges to the optimal Linear Quadratic Regulator controller for each subsystem.
7: We provide a convergence analysis and verify the result with an example.
8:
9: \noindent \\ Keywords: \textit{Distributed $Q$-learning, data-guided control, linear quadratic regulator, networked control systems}
10: \end{abstract}
11: