18ae1c9746347074.tex
1: \begin{abstract}
2: %
3: In many statistical  linear inverse problems, one needs to recover classes of similar objects 
4: from their noisy images under an operator that does not have a bounded inverse. 
5: Problems of this kind appear in many areas of application. Routinely, in such problems  
6: clustering is carried out at a pre-processing step and then the inverse problem is solved for 
7: each of the cluster averages separately. As a result, the errors of the procedures are usually 
8: examined for the estimation  step only. The objective of this paper is to examine, both theoretically 
9: and via simulations,  the effect of clustering on the accuracy of the solutions of general ill-posed 
10: linear inverse problems. 
11: In particular, we assume that one observes 
12: $X_m = A f_m + \del \eps_m$, $m=1, \cdots, M$, where functions $f_m$ can be grouped into $K$ classes
13: and one needs to recover a vector function $\bof= (f_1,\cdots, f_M)^T$. 
14: %
15: We construct  an estimator  for $\bof$ as a solution of  a penalized optimization problem
16: which corresponds to the clustering before estimation setting.  
17: We derive an oracle inequality for its precision and confirm that the estimator is 
18: minimax optimal or nearly minimax optimal up to a  logarithmic factor of the number of observations. 
19: One of the advantages of our approach  is that we do not assume that the number of clusters is 
20: known in advance. Subsequently, we compare the accuracy of the above procedure with the precision of estimation 
21: without clustering, and clustering following the recovery of each of the unknown functions separately.
22: 
23: We conclude that clustering at the pre-processing step is beneficial when the problem is moderately ill-posed.
24: It should be applied with extreme care when the problem is severely ill-posed. 
25: \\
26:  
27: 
28: \noindent
29: {\bf  Keywords: } ill-posed linear inverse problem, clustering, oracle inequality, minimax convergence rates  \\ 
30: %
31: {\bf  AMS  classification:}  Primary: 65R32, 62H30; secondary 62C20, 62G05   
32: 
33: \end{abstract}
34: