abstract:2eb3b7891df91932.tex

1: \begin{abstract}

2: This paper presents a framework for deep transfer learning, which aims to leverage information from multi-domain upstream data with a large number of samples ($n$) to  a single-domain downstream task with a considerably smaller number of samples ($m$), where  $m \ll n$, in order to enhance performance on downstream task.

3: Our  framework has several  intriguing features. First,  it allows the existence of

4: both shared and specific features among multi-domain  data and provides a framework for  automatic identification,  achieving precise transfer and utilization of information.

5: Second, our model framework explicitly indicates  the upstream features   that contribute to downstream tasks, establishing a relationship  between upstream domains and  downstream tasks, thereby enhancing interpretability. Error analysis demonstrates that the transfer under our framework  can significantly improve the convergence rate for learning Lipschitz functions in downstream supervised tasks, reducing it from $\tlO(m^{-\frac{1}{2(d+2)}}+n^{-\frac{1}{2(d+2)}})$ (``no transfer") to $\tlO(m^{-\frac{1}{2(d^*+3)}} + n^{-\frac{1}{2(d+2)}})$ (``partial transfer"), and even to $\tlO(m^{-1/2}+n^{-\frac{1}{2(d+2)}})$ (``complete transfer"),

6: where {$d^* \ll d$} and $d$ is the dimension of the observed data.

7: Our theoretical findings are substantiated by empirical experiments conducted on image classification datasets, along with a regression dataset.

8: \end{abstract}

9: