cc50764d5e0d801f.tex
1: \begin{abstract}
2: %Mainly about our design as of now. Add problem specification.
3: 
4: Distributed machine learning (DML) can be an important capability for modern military to take advantage of data and devices distributed at multiple vantage points to adapt and learn. The existing distributed machine learning frameworks, however, cannot realize the full benefits of DML, because they are all based on the simple linear aggregation framework, but simple, linear aggregation cannot handle military-setting challenges, when learning data at different devices are heterogeneous and the ability for devices to communicate is substantially limited.
5: 
6: The key aspect of our design is that in aggregating steps, instead of taking a linear average of received models like all previous work, we instead take a quasi-arithmetic mean of the received values. We specifically focus on the weighted power mean (WPM). This can be viewed as a form of distributed mirror descent, where the quasi-arithmetic mean aggregation step is a linear averaging in the mirror descent dual space. %design section
7: Our theoretical convergence bounds recover the same big \(O\) bounds as numerous papers in both distributed machine learning and mirror descent. The bound also produces theoretical justification for the improved handling of differing local models: when a weighted power mean with power \(p\) is used under constant learning rate \(\eta\), the bound is \(O(m^{\frac{p+2}{p+1}})\), and can thus be improved by selecting larger \(p\). %theory section
8: Additionally, through extensive experiments on the Extendable Mobile Ad-hoc Network Emulator, we show that when \(p>1\), our design can improve the convergence speed of the model by up to 57\% and it also empirically scales well to more devices in low-communication environments%under time-varying networks
9: , all with little additional computation overhead compared to traditional aggregation.  %experiment section
10: \end{abstract}
11: