1: \begin{abstract}
2: Federated learning (FL) is a privacy-preserving collaboratively machine learning paradigm. Traditional FL requires all data owners (a.k.a. FL clients) to train the same local model. This design is not well-suited for scenarios involving data and/or system heterogeneity. Model-Heterogeneous Personalized FL (MHPFL) has emerged to address this challenge. Existing MHPFL approaches often rely on having a public dataset with the same nature of the learning task, or incur high computation and communication costs. To address these limitations, we propose the \underline{Fed}erated \underline{S}emantic \underline{S}imilarity \underline{A}ggregation (\methodname{}) approach, which splits each client's model into a heterogeneous (structurally-different) feature extractor and a homogeneous (structurally-same) classification header. It performs local-to-global knowledge transfer via semantic similarity-based header parameter aggregation.
3: %Since classification header parameters from different clients corresponding to the same class are semantically similar, we allow each client to upload the header parameters matching with its seen classes to the server. The server aggregates header parameters by class to integrate knowledge from heterogeneous local models.
4: In addition, global-to-local knowledge transfer is achieved via an adaptive parameter stabilization strategy which fuses the seen-class parameters of historical local headers with that of the latest global header for each client. In this way, \methodname{} does not rely on public datasets, while only requiring partial header parameter transmission (thereby saving costs). Theoretical analysis proves the convergence of \methodname{}. Extensive experiments demonstrate that \methodname{} achieves up to $3.62 \times\%$ higher accuracy, $15.54$ times higher communication efficiency, and $15.52 \times$ higher computational efficiency compared to 7 state-of-the-art MHPFL baselines.
5: \end{abstract}
6: