1: \begin{abstract}
2: We consider the estimation of densities in multiple subpopulations, where the available sample size in each subpopulation greatly varies.
3: For example, in epidemiology, different diseases may share similar pathogenic mechanism but differ in their prevalence.
4: Without specifying a parametric form, our proposed approach pools information from the population and estimate the density in each subpopulation in a data-driven fashion.
5: Low-dimensional approximating density families in the form of exponential families are constructed from the principal modes of variation in the log-densities, within which subpopulation densities are then fitted based on likelihood principles and shrinkage.
6: The approximating families increase in their flexibility as the number of components increases and can approximate arbitrary infinite-dimensional densities with discrete observations, for which we derived convergence results.
7: The proposed methods are shown to be interpretable and efficient in simulation as well as applications to electronic medical record and rainfall data.
8: \end{abstract}