1: \begin{abstract}
2: %% context and SoA limitation
3: Mesh partitioning is an indispensable tool for efficient parallel numerical simulations.
4: Its goal is to minimize communication between the processes of a simulation while achieving load balance.
5: Established graph-based partitioning tools yield a high solution quality; however, their scalability is limited.
6: Geometric approaches usually scale better, but their solution quality may be unsatisfactory for ``non-trivial'' mesh topologies.
7:
8: %% new techniques
9: In this paper, we present a scalable version of $k$-means that is adapted to yield balanced clusters.
10: Balanced $k$-means constitutes the core of our new partitioning algorithm \bakpa.
11: Bootstrapping of initial centers is performed with space-filling curves, leading to fast convergence of the subsequent balanced k-means algorithm.
12:
13: Our experiments with up to \numprint{16384} MPI processes on numerous benchmark meshes show
14: the following: (i) \bakpa produces partitions with a lower communication volume than state-of-the-art geometric partitioners from the \zoltan package;
15: (ii) \bakpa scales well on large inputs;
16: (iii) a Delaunay mesh with a few billion vertices and edges can be partitioned in a few seconds.
17: \end{abstract}
18: