012231425310aad4.tex
1: \begin{abstract}
2: 	Increasingly complex data analysis tasks motivate the study of the dependency of  distributions of multivariate continuous random variables on scalar or vector predictors.   Statistical regression models for distributional responses so far have primarily been investigated for the case of one-dimensional response distributions. We investigate  here the  case  of multivariate response distributions while adopting the 2-Wasserstein metric in the distribution space. The challenge is that  unlike the situation in the univariate case, the optimal transports that correspond to geodesics in the space of distributions with the 2-Wasserstein metric do not  have an explicit representation for multivariate distributions.  We show that under some regularity assumptions the conditional Wasserstein barycenters constructed for a geodesic in the Euclidean predictor space form a corresponding geodesic in the Wasserstein distribution space and demonstrate how the notion of conditional barycenters can be harnessed to interpolate as well as extrapolate multivariate distributions. The utility of distributional inter- and extrapolation is explored in simulations and examples.  We study both  global parametric-like and  local smoothing-like models to implement  conditional Wasserstein barycenters and establish asymptotic convergence properties for the corresponding estimates. For algorithmic implementation  we make use of a Sinkhorn entropy-penalized algorithm.   Conditional Wasserstein barycenters and distribution extrapolation are illustrated  with applications in climate science and studies of aging.\\
3: 	
4: 	\no KEY WORDS: Optimal Transport, Fr\'echet Mean, Wasserstein Metric, Geodesics,  Sinkhorn Penalty,  Climate Change, Baltimore Longitudinal Study of Aging.
5: \end{abstract}
6: