cond-mat0206286/PRE.tex
1: %\documentclass[twocolumn,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}
2: \documentclass[preprint,showpacs,preprintnumbers,amsmath,amssymb]{revtex4}
3: %%%%%%% Some other (several out of many) possibilities
4: %%%%%%%\documentclass[preprint,aps]{revtex4}
5: %%%%%%%\documentclass[preprint,aps,draft]{revtex4}
6: %%%%%%%\documentclass[prb]{revtex4}% Physical Review B
7: \usepackage{graphicx}
8: \usepackage{dcolumn}
9: \usepackage{bm}
10: \newcommand{\mbx}[1]{\mbox{\boldmath $#1$}}
11: %\nofiles
12: \begin{document}
13: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
14: \preprint{cond-mat/0206286}
15: \title{Statistical Mechanics of \\
16: the Bayesian Image Restoration under Spatially Correlated Noise}
17: %%%%%%%%%%%%%%%%%%%%%%%
18: \author{Jun  Tsuzurugi${}^{\dagger}$${}^{\ddagger}$}
19: \email{juntuzu@brain.riken.go.jp}
20: %%%%%%%%%%%%%%%%%%%%%%%
21: \author{Masato  Okada${}^{\ddagger}$}%
22: \affiliation{
23: ${}^{\dagger}$Nara Institute of Science and Technology,
24: Takayama-cho 8916-5, Ikoma-shi, Nara, 630-0101 Japan\\
25: ${}^{\ddagger}$RIKEN Brain Science Institute
26: Hirosawa 2-1, Wakou-shi, Saitama, 351-0456 Japan
27: }
28: %%%%%%%%%%%%%%%%%%%%%%%
29: \date{\today}% It is always \today, today,
30:              %  but any date may be explicitly specified
31: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
32: \begin{abstract}
33: We investigated the use of the Bayesian inference to restore noise-degraded images  under conditions of spatially correlated noise. The generative statistical models used for the original image and the noise were assumed to obey multi-dimensional Gaussian distributions whose covariance matrices are translational invariant. We derived an exact description to be used as the expectation for the restored image by the Fourier transformation and restored an image distorted by spatially correlated noise by using a spatially uncorrelated noise model. 
34: We found that the resulting hyperparameter estimations for the minimum error and maximal posterior marginal criteria did not coincide when the generative probabilistic model and the model used for restoration were in different classes, while they did coincide when they were in the same class. 
35: \end{abstract}
36: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
37: \pacs{07.05.Mh, 87.80.Xa, 87.18.Sn}% PACS
38: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
39: \keywords{Image restoration, Bayesian inference, Hyper-parameter estimation, 
40: Fourier transformation}
41: \maketitle
42: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
43: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
44: \section{Introduction}
45: Recent research has shown that the Bayesian inference is a useful approach to the image restoration problem \cite{morina1}-\cite{inoue4}. However, previous research has usually assumed that the superimposed noise is not correlated between pixels \cite{morina1}-\cite{tanaka1}. Statistical mechanics provides a useful method for analyzing the image restoration problem \cite{nisi1},\cite{nisi2}. Still, when we take optical effects into consideration, it seems natural to take spatially correlated noise into account. 
46: We have investigated image restoration under the condition of spatially correlated noise. We considered a case in which both the original image and noise obey a Gaussian distribution composed of translational-symmetric matrices. 
47: We were able to diagonalize the Gaussian distribution by using Fourier transformation. 
48: While previous research was done using spatially uncorrelated noise, we considered 
49: the availability of a spatially uncorrelated noise model against spatially correlated noise. 
50: 
51: We considered tow methods for estimating the hyperparameters. 
52: The first is minimizing the mean squared error - the hyperparameters are set so that the mean squared error is minimized. However, minimizing the mean squared error is not applicable to the image restoration problem because we need the original image to practice this method.
53: The second method is maximizing the marginal likelihood 
54: \cite{tanaka2} - the hyperparameters are estimated by maximizing the marginal likelihood acquired from only the distorted image. When the noise probabilities for generating and restoring coincidence are the same, the results given by the two methods are the same. When the probabilities are different, we found that they are not. 
55: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
56: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
57: \section{Model}
58: The images we see in our daily life are always two-dimensional, although we assume a $d$-dimensional image for mathematical generalization, where the length of each side is $L$ and the total number of pixels, $N$, is $L^d$. 
59: The $\mbx{i}$-th pixel value, $\xi_{\mbx{i}}$, of an original image can be written in 
60: a multiple Gaussian distribution as
61: %%%%%%%%%%%%%%
62: \begin{equation}
63: P({\mbx{\xi}})=\frac{1}{Z_{\rm{prior}}(\beta,h)}\exp
64: 		\left[
65: 			-H(\mbx{\xi},\beta,h)
66: 		\right], \label{prior}
67: \end{equation}
68: %%%%%%%%%%%%%%
69: \begin{equation}\label{hamil}
70: 	H(\mbx{\xi},\beta,h)=\mbx{\xi}^{T}(\beta G + hI)\mbx{\xi},
71: \end{equation}
72: %%%%%%%%%%%%%%
73: where $\beta$ and~$h$ are positive scalar values, $G$ is a matrix, and $I$ is the unit matrix. The partition function, $Z_{\rm{prior}}(\beta,h)$, is given by
74: 
75: %%%%%%%%%%%%%%
76: \begin{equation}
77: Z_{\rm{prior}}(\beta,h)
78: 	=	(2\pi)^{\frac{N}{2}}
79: 		\left|
80: 			2(\beta G + h I)
81: 		\right|^{-\frac{1}{2}}.\label{Zprior}
82: \end{equation}
83: %%%%%%%%%%%%%%
84: Note that Eq. (\ref{hamil}) is called the Hamiltonian in statistical mechanics.
85: In this paper, we argue that matrix $G$ is translational invariant and that element $G_{{\mbx{i}},{\mbx{j}}}$ is given by \cite{nisi2}
86: 
87: %%%%%%%%%%%%%%
88: \begin{equation}
89: 	G_{{\mbx{i}},{\mbx{j}}}
90: 		= 2d \delta_{({\mbx{i}}-{\mbx{j}}),{\mbx{0}}}
91: 		-\sum_{\mbx{\delta}}
92: 			\delta_{({\mbx{i}}-{\mbx{j}}),-{\mbx{\delta}}}
93: 		-\sum_{\mbx{\delta}}
94: 			\delta_{({\mbx{i}}-{\mbx{j}}),{\mbx{\delta}}},\label{greenG}
95: \end{equation}
96: %%%%%%%%%%%%%%
97: where the $d$-dimensional vector ${\mbx{\delta}}$ is
98: 
99: %%%%%%%%%%%%%%
100: \begin{equation}
101: 	{\mbx{\delta}} = (1,0,\cdots,0),(0,1,0,\cdots,0),\cdots,(0,\cdots,0,1).\label{pots}
102: \end{equation}
103: %%%%%%%%%%%%%%
104: The $\delta_{\mbx{i},\mbx{j}}$ in Eq. (\ref{greenG}) is the Kronecker delta, which is assumed to be
105: 
106: %%%%%%%%%%%%%%
107: \begin{equation}
108: 	\delta_{\mbx{i},\mbx{j}}=
109: 	\left\{
110: 		\begin{array}{rl}
111: 		1 & \mbox{(\(\mbx{i}=\mbx{j}\))} \\
112: 		0 & \mbox{(\(\mbx{i}\neq\mbx{j}\)).}
113: 		\end{array}
114: 	\right.
115: \end{equation}
116: %%%%%%%%%%%%%%
117: Equation (\ref{greenG}) means that interactions occur only between nearest-neighbor pixels.
118: If $\beta$ takes a large value, i.e., the interaction of the first term has a large effect, the neighboring pixels tend to take the same value. When $h$ is large, the absolute value of the pixels tends to be small. Furthermore, we use a periodic boundary condition, 
119: 
120: %%%%%%%%%%%%%%
121: \begin{equation}
122: 	\mbx{\xi}_{\mbx{i}} = \mbx{\xi}_{\mbx{i}+L\mbx{\delta}}.
123: \end{equation}
124: %%%%%%%%%%%%%%
125: The distorted image, ${\mbx{\tau}}= \{\tau_{\mbx{i}}\}$, is generated according to the following conditional probability, which obeys a Gaussian distribution: 
126: %%%%%%%%%%%%%%
127: \begin{equation}
128: 	P_{\rm{out}}({\mbx{\tau}}|{\mbx{\xi}})=
129: 		\frac{1}{Z_{\rm{noise}}}
130: 		\exp\left[
131: 			-\frac{1}{2}
132: 				({\mbx{\tau}}-{\mbx{\xi}})^T
133: 				R^{-1}
134: 				({\mbx{\tau}}-{\mbx{\xi}})
135: 		\right],\label{noize}
136: \end{equation}
137: %%%%%%%%%%%%%%
138: %%%%%%%%%%%%%%
139: \begin{equation}
140: Z_{\rm{noise}}(R)=(2\pi)^{\frac{N}{2}}|R|^{\frac{1}{2}},\label{Znoize}
141: \end{equation}
142: %%%%%%%%%%%%%%
143: where $R$ is a translational invariant covariance matrix.
144: In this paper, we consider the following spatially correlated noise:
145: 
146: %%%%%%%%%%%%%%
147: \begin{equation}\label{soukan}
148: 	R_{{\mbx{i}},{\mbx{j}}}
149: 		= (1-a)b^2 \delta_{{\mbx{i}},{\mbx{j}}}
150: 			+a b^2 \exp\left[-\frac{({\mbx{i}}-{\mbx{j}})^2}
151: 						{\kappa^2}
152: 			\right],
153: \end{equation}
154: %%%%%%%%%%%%%%
155: $0\leq a \leq 1$.
156: The noise obeys an identical independent Gaussian distribution with mean $0$ and variance $b^2$, when $a=0$. 
157: 
158: A general strategy commonly used in image restoration is to apply the Bayes formulation to the posterior probability. Here, $\mbx{\sigma}$ is a restoration image  based on the Bayes formulation. When a distorted image, $\mbx{\tau}$, is given, one can use the formulation to calculate the restored image, $\mbx{\sigma}$:
159: 
160: %%%%%%%%%%%%%%
161: 	\begin{equation}
162: 	P(\mbx{\sigma}|\mbx{\tau})
163: 	=
164: 		\frac{
165: 			P_{out}(\mbx{\tau}|\mbx{\sigma})P(\mbx{\sigma})
166: 		}
167: 		{
168: 			\int d\sigma
169: 				P_{out}(\mbx{\tau}|\mbx{\sigma})P(\mbx{\sigma})
170: 		}
171: 	=\frac
172: 		{
173: 			\exp \left[ -H_{eff} \right]
174: 		}
175: 		{
176: 			\int d \sigma
177: 			\exp{\left[ -H_{eff} \right]}
178: 		},\label{Bayes}
179: \end{equation}
180: %%%%%%%%%%%%%%
181: where
182: 
183: %%%%%%%%%%%%%%
184: \begin{equation}
185: H_{eff} = 
186: 			\mbx{\sigma}^{T}(\beta G + hI)\mbx{\sigma}
187: 			+\frac{1}{2}
188: 				({\mbx{\tau}}-{\mbx{\sigma}})^T
189: 				R^{-1}
190: 				({\mbx{\tau}}-{\mbx{\sigma}}),\label{yuukouA}
191: \end{equation}
192: %%%%%%%%%%%%%%
193: is an effective Hamiltonian.
194: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
195: \section{Theory}
196: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
197: \subsection{Expectation of the restored image}
198: Since the covariance matrices given by Eqs. (\ref{greenG}) and (\ref{soukan}) are translational invariant, they can be diagonalized using the discrete Fourier transformation.
199: 
200: The discrete Fourier transformation is
201: %%%%%%%%%%%%%%
202: \begin{equation}\label{Fourier}
203: 	\tilde{\xi}_{\mbx{k}} = \frac{1}{\sqrt{N}}
204: 					\sum_{\mbx{j}} \xi_{\mbx{j}}
205: 						e^{-i{\mbx{k\cdot j}}} ,
206: \end{equation}
207: %%%%%%%%%%%%%%
208: and the inverse Fourier transformation is
209: 
210: %%%%%%%%%%%%%%
211: \begin{equation}\label{gaku}
212: 	\xi_{\mbx{j}} = \frac{1}{\sqrt{N}}
213: 					\sum_{\mbx{k}} \tilde{\xi}_{\mbx{k}}
214: 						e^{i{\mbx{k\cdot j}}} ,
215: \end{equation}
216: %%%%%%%%%%%%%%
217: where $i$ is the imaginary unit, and $\mbx{k}$ is a $d$-dimensional vector with the same as $\mbx{j}$, and the degree of freedom is $L^d$.
218: %%%%%%%%%%%%%%
219: \begin{equation}
220: 	\sum_{\mbx{j}}=L^d=N,~~~\sum_{\mbx{k}}=L^d=N.
221: \end{equation}
222: %%%%%%%%%%%%%%
223: Moreover, each component of ${\mbx{k}}$ takes the value
224: %%%%%%%%%%%%%%
225: \begin{equation}
226: 	0,\frac{2}{L}\pi, \frac{4}{L}\pi,\cdots,\frac{2(L-1)}{L}\pi.
227: \end{equation}
228: %%%%%%%%%%%%%%
229: We can diagonalize Eq. (\ref{prior}) by using the Fourier representation:
230: %%%%%%%%%%%%%%
231: \begin{equation}
232: 	P({\mbx{\xi}})=
233: 	\frac{1}{Z_{\rm{prior}}(\beta,h)}\exp
234: 		\left[
235: 				-\sum_{\mbx{k}}
236: 				(\beta \tilde{G}_{\mbx{k}} + h)
237: 				\tilde{\xi}_{\mbx{k}}\tilde{\xi}_{-\mbx{k}}
238: 		\right],
239: 	\label{priorF}
240: \end{equation}
241: %%%%%%%%%%%%%%
242: %%%%%%%%%%%%%%
243: \begin{equation}\label{green}
244: 	\tilde{G}_{\mbx{k}}
245: 		= \sum_{\mbx{\delta}}
246: 			[2-2\cos({\mbx{k}}\cdot{\mbx{\delta}})].
247: \end{equation}
248: %%%%%%%%%%%%%%
249: Furthermore, we diagonalize $R_{\mbx{i},\mbx{j}}$ in Eq. (\ref{soukan}):
250: 
251: %%%%%%%%%%%%%%
252: \begin{equation}\label{soukanF}
253: 	\tilde{R}_{\mbx{k}}
254: 	=(1-a)b^2+ab^2\sum_{\mbx{l}}
255: 		e^{-\frac{{\mbx{l}}^2}
256: 						{\kappa^2}
257: 		}
258: 				\cos({\mbx{k}}\cdot{\mbx{l}}),
259: \end{equation}
260: %%%%%%%%%%%%%%
261: where the range of $l$, which is a component of vector $\mbx{l}$, is given by
262: 
263: %%%%%%%%%%%%%%
264: \begin{equation}
265: -(L-1)\leq l \leq L-1.
266: \end{equation}
267: %%%%%%%%%%%%%%
268: We can execute the Fourier transformation on the inside of $\exp$ in Eq. (\ref{noize}) 
269: in the same way as in Eq. (\ref{prior}).
270: 
271: %%%%%%%%%%%%%%
272: \begin{eqnarray}
273: 	& &P_{out}({\mbx{\tau}}|{\mbx{\xi}})
274: 	=\frac{1}{Z_{\rm{noise}}} \nonumber \\
275: 	& &\times \exp\left[
276: 			-\frac{1}{2}\sum_{\mbx{k}}
277: 				\tilde{R}_{\mbx{k}}^{-1}
278: 				(\tilde{\tau}_{\mbx{k}}-\tilde{\xi}_{\mbx{k}})
279: 				(\tilde{\tau}_{-{\mbx{k}}}-\tilde{\xi}_{-{\mbx{k}}})
280: 		\right]. \label{Pout}
281: \end{eqnarray}
282: %%%%%%%%%%%%%%
283: We define $\left< \cdot \right>$ as denoting the thermal average 
284: based on the Bayes formula for the posterior probability $P(\mbx{\sigma}|\mbx{\tau})$ in Eq. (\ref{Bayes}). The expectation of the restored image, $\sigma_{\mbx{j}}$, is thus assumed to be
285: %%%%%%%%%%%%%%
286: \begin{equation}\label{repair}
287: 	\left<\sigma_{\mbx{j}}\right>
288: 		=
289: 		\frac{1}{\sqrt{N}}
290: 			\sum_{\mbx{k}}
291: 			\left<
292: 				\tilde{\sigma}_{\mbx{k}}
293: 			\right>
294: 			e^{i{\mbx{k}}\cdot{\mbx{j}}}.
295: \end{equation}
296: %%%%%%%%%%%%%%
297: 
298: Using Eq. (\ref{Bayes}), the expectation of the Fourier component, $\sigma_{\mbx{k}}$, can be written as
299: %%%%%%%%%%%%%%
300: \begin{equation}
301: 		\left< \tilde{\sigma}_{\mbx{k}} \right>
302: 	=\int \prod_{\mbx{k}'} d\tilde{\sigma}_{\mbx{k}'}
303: 		\tilde{\sigma}_{\mbx{k}}
304: 		P(\mbx{\sigma}|\mbx{\tau})
305: 	=\frac
306: 		{
307: 			\int \prod_{\mbx{k}'} d\tilde{\sigma}_{\mbx{k}'}
308: 				\tilde{\sigma}_{\mbx{k}} e^{-\hat{\tilde{H}}_{eff}}
309: 		}
310: 		{
311: 			\int \prod_{\mbx{k}'} d\tilde{\sigma}_{\mbx{k}'}
312: 				e^{-\hat{\tilde{H}}_{eff}}
313: 		},\label{kakkonai}
314: \end{equation}
315: %%%%%%%%%%%%%%
316: where $\hat{\tilde{H}}_{eff}$ is obtained by applying the Fourier transformation to Eq. (\ref{yuukouA}) as follows.
317: %%%%%%%%%%%%%%
318: \begin{eqnarray}
319: 	\hat{\tilde{H}}_{eff} &=&
320: 		\sum_{\mbx{k}}
321: 			({\hat{\beta}} \tilde{G}_{\mbx{k}} + \hat{h})
322: 				\tilde{\sigma}_{\mbx{k}} \tilde{\sigma}_{\mbx{-k}}
323: 				\nonumber \\
324: 		&+&\sum_{\mbx{k}}\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}
325: 				(\tilde{\tau}_{\mbx{k}}-\tilde{\sigma}_{\mbx{k}})
326: 				(\tilde{\tau}_{-{\mbx{k}}}-\tilde{\sigma}_{-{\mbx{k}}}).
327: 					\label{yuukou}
328: \end{eqnarray}
329: %%%%%%%%%%%%%%
330: $\beta$, $h$, and $\tilde{R}_{\mbx{k}}$ are unknown adjustable parameters that determine the properties of the original image and noise. One of our major focus here is how to estimate these hyperparameters precisely. 
331: 
332: To proceed further, we have to assume some explicit form of the source prior and noise posterior formations, which are used when we restore the image. In this paper, we define both of these probability formations as the same formation except that they have different hyperparameters. We define the hyperparameters that correspond to $\beta, h,$, and $\tilde{R}_{\mbx{k}}$ as $\hat{\beta}$, $\hat{h}$, and $\hat{\tilde{R}}_{\mbx{k}}$, respectively. If $\hat{\tilde{R}}_{\mbx{k}}$ is independent of $\mbx{k}$, the noise is spatially uncorrelated.
333: 
334: Substituting
335: %%%%%%%%%%%%%%
336: \begin{equation}
337: 	\hat{\tilde{A}}_{\mbx{k}}
338: 	=\hat{\beta} \tilde{G}_{\mbx{k}}
339: 		+\hat{h} +\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}},~~~~~~~~~~~
340: 	\hat{\tilde{B}}_{\mbx{k}}
341: 	=\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}
342: \end{equation}
343: \label{okikae}
344: %%%%%%%%%%%%%%
345: into Eqs. (\ref{kakkonai}) and (\ref{yuukou}), we get
346: %%%%%%%%%%%%%%
347: \begin{equation}
348: 	\left< \tilde{\sigma}_{\mbx{k}} \right>
349: 	=\frac{\hat{\tilde{B}}_{\mbx{k}}}{\hat{\tilde{A}}_{\mbx{k}}}
350: 	\tilde{\tau}_{\mbx{k}}\label{rep}.
351: \end{equation}
352: %%%%%%%%%%%%%%
353: Consequently, $\left< \sigma_{\mbx{j}} \right>$ in Eq. (\ref{repair}) is given by
354: %%%%%%%%%%%%%%
355: \begin{equation}
356: 	\left< \sigma_{\mbx{j}} \right>=
357: 		\frac{1}{N}
358: 		\sum_{\mbx{k}}\sum_{\mbx{i}}
359: 		\frac{
360: 			\tau_{\mbx{i}}
361: 			\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}
362: 			\cos[
363: 				{{\mbx{k}}\cdot({\mbx{j}}-{\mbx{i}})}
364: 			]
365: 		}
366: 		{
367: 			\hat{\beta} \tilde{G}_{\mbx{k}}+\hat{h}
368: 				+\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}
369: 		}.\label{E1kaiseki}
370: \end{equation}
371: %%%%%%%%%%%%%%
372: In this paper, we regard Eq. (\ref{E1kaiseki}) as representing the restored image.
373: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
374: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
375: \subsection{Minimization of mean squared error}
376: In this subsection, we estimate hyperparameters $\hat{\beta}, \hat{h}$, and $\hat{\tilde{R}}_{\mbx{k}}$ by using the criterion defined by Eq. (\ref{E1kaiseki}) for minimizing the mean squared error between the original and restored images. 
377: The expectation of this mean squared error, $E_1$, is represented by
378: %%%%%%%%%%%%%%
379: \begin{equation}
380: 	E_1=
381: 		\left\|
382: 		\frac{1}{N}\sum_{\mbx{j}}
383: 			\left(
384: 				\xi_{\mbx{j}}
385: 				-\left<
386: 					\sigma_{\mbx{j}}
387: 				\right>
388: 			\right)^2
389: 	\right\|,\label{E1}
390: \end{equation}
391: %%%%%%%%%%%%%%
392: where $\left\| \cdot \right\|$
393: denotes the data average of a simultaneous distribution\\
394: 	$P(\mbx{\tau},\mbx{\xi})=
395: 	P_{out}({\mbx{\tau}}|{\mbx{\xi}})
396: 	P_s({\mbx{\xi}})$.
397: 
398: Applying the Fourier transformation to Eq. (\ref{E1}), we can derive the next representation.
399: %%%%%%%%%%%%%%
400: \begin{equation}
401: 		E_1=
402: 		\frac{1}{N}\sum_{\mbx{k}}
403: 		\left\|
404: 			\Big(
405: 				\tilde{\xi}_{\mbx{k}}
406: 				-\left<\tilde{\sigma}_{\mbx{k}}\right>
407: 			\Big)
408: 			\Big(
409: 				\tilde{\xi}_{-{\mbx{k}}}
410: 				-\left<\tilde{\sigma}_{-{\mbx{k}}}\right>
411: 			\Big)
412: 		\right\|. \label{EE1}
413: \end{equation}
414: %%%%%%%%%%%%%%
415: Since $P(\mbx{\tau},\mbx{\xi})$ is diagonalized by the Fourier transformation, we can easily calculate each ${\mbx{k}}$ as follows,
416: %%%%%%%%%%%%%%
417: \begin{eqnarray}
418: 	& &\left\|
419: 		\Big(
420: 			\tilde{\xi}_{\mbx{k}}
421: 			-\left<
422: 				\tilde{\sigma}_{\mbx{k}}
423: 			\right>
424: 		\Big)
425: 		\Big(
426: 			\tilde{\xi}_{-\mbx{k}}
427: 			-\left<
428: 				\tilde{\sigma}_{-\mbx{k}}
429: 			\right>
430: 		\Big)
431: 		\right\| \\
432: 	&=&	\frac{1}{Z}
433: 		\int\int
434: 			d\tilde{\xi}_{\mbx{k}}
435: 			d\tilde{\tau}_{\mbx{k}}
436: 			\left|
437: 				\tilde{\xi}_{\mbx{k}}
438: 				-\frac{\hat{\tilde{B}}_{\mbx{k}}}{\hat{\tilde{A}}_{\mbx{k}}}
439: 				\tilde{\tau}_{\mbx{k}}
440: 			\right|^2\nonumber \\
441: 	& &\times\exp
442: 		\Bigg[
443: 			-A_{\mbx{k}}
444: 				\left|
445: 					\tilde{\xi}_{\mbx{k}}
446: 					-\frac{\tilde{B}_{\mbx{k}}}{\tilde{A}_{\mbx{k}}}
447: 					\tilde{\tau}_{\mbx{k}}
448: 				\right|^2\nonumber \\
449: 	& &~~~~~~~~~~-
450: 				\left(
451: 					\tilde{B}_{\mbx{k}}
452: 					-\frac{\tilde{B}_{\mbx{k}}^2}{\tilde{A}_{\mbx{k}}}
453: 				\right)
454: 				\left|
455: 					\tilde{\tau}_{\mbx{k}}
456: 				\right|^2
457: 		\Bigg]\\
458: 	&=&\frac{1}{2\tilde{A}_{\mbx{k}}}+
459: 		\left(
460: 			\frac{\tilde{B}_{\mbx{k}}}{\tilde{A}_{\mbx{k}}}
461: 			-\frac{\hat{\tilde{B}}_{\mbx{k}}}{\hat{\tilde{A}}_{\mbx{k}}}
462: 		\right)^2
463: 		\frac{1}{
464: 				2\tilde{B}_{\mbx{k}}
465: 				-\frac{2\tilde{B}_{\mbx{k}}^2}{\tilde{A}_{\mbx{k}}}
466: 		},\label{11}
467: \end{eqnarray}
468: %%%%%%%%%%%%%%
469: where $\tilde{A}_{\mbx{k}}$, $\tilde{B}_{\mbx{k}}$, and $Z$ substitute for
470: %%%%%%%%%%%%%%
471: \begin{equation}
472: 	\tilde{A}_{\mbx{k}}=\beta \tilde{G}_{\mbx{k}} + h
473: 		+\frac{1}{2\tilde{R}_{\mbx{k}}},~~~~~
474: 	\tilde{B}_{\mbx{k}} = \frac{1}{2\tilde{R}_{\mbx{k}}},
475: \end{equation}
476: %%%%%%%%%%%%%%
477: %%%%%%%%%%%%%%
478: \begin{eqnarray}
479: 	Z
480: 	&=&	\int\int
481: 			d\tilde{\xi}_{\mbx{k}}
482: 			d\tilde{\tau}_{\mbx{k}}
483: 		\exp
484: 		\Bigg[
485: 			-\tilde{A}_{\mbx{k}}
486: 				\left|
487: 					\tilde{\xi}_{\mbx{k}}
488: 					-\frac{\tilde{B}_{\mbx{k}}}{\tilde{A}_{\mbx{k}}}
489: 					\tilde{\tau}_{\mbx{k}}
490: 				\right|^2\nonumber \\
491: 	& &~~~~~~~~~~~~~~~~~~~~~~~~~-
492: 				\left(
493: 					\tilde{B}_{\mbx{k}}
494: 					-\frac{\tilde{B}_{\mbx{k}}^2}{\tilde{A}_{\mbx{k}}}
495: 				\right)
496: 				\left|
497: 					\tilde{\tau}_{\mbx{k}}
498: 				\right|^2
499: 		\Bigg].
500: \end{eqnarray}
501: %%%%%%%%%%%%%%
502: Therefore, the mean squared error is represented by
503: %%%%%%%%%%%%%%
504: \begin{eqnarray}
505: 	& &E_1= 
506: 		\frac{1}{2N}\sum_{\mbx{k}}
507: 	\Biggl[
508: 		\frac{1}{\beta \tilde{G}_{\mbx{k}}+h +\frac{1}{2\tilde{R}_{\mbx{k}}}}
509: 		\nonumber \\
510: 	& &+\left(
511: 			\frac{\frac{1}{2\tilde{R}_{\mbx{k}}}}
512: 				{\beta \tilde{G}_{\mbx{k}} +h +\frac{1}{2\tilde{R}_{\mbx{k}}}}
513: 			-\frac{\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}}
514: 				{\hat{\beta} \tilde{G}_{\mbx{k}}+\hat{h}
515: 				+\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}}
516: 		\right)^2\nonumber \\
517: 	& &
518: 		~~~~~~~~~~~~~~\times\frac{
519: 				\beta \tilde{G}_{\mbx{k}}
520: 				+h +\frac{1}{2\tilde{R}_{\mbx{k}}}
521: 		}
522: 		{
523: 			\frac{1}{2\tilde{R}_{\mbx{k}}}
524: 			\left(
525: 				\beta \tilde{G}_{\mbx{k}}+h
526: 			\right)
527: 		}
528: 		\Biggr].\label{Eend}
529: \end{eqnarray}
530: %%%%%%%%%%%%%%
531: When the conditions 
532: %%%%%%%%%%%%%%
533: \begin{equation}
534: \hat{\beta}=\beta, ~\hat{h}=h, {\rm and} ~\hat{{\tilde{R}}}_{\mbx{k}} = {\tilde{R}}_{\mbx{k}},
535: ~\label{saitekipara}
536: \end{equation}
537: %%%%%%%%%%%%%%
538: hold, the mean squared error takes the following minimum:
539: %%%%%%%%%%%%%%
540: \begin{equation}
541: 	E_{1_{min}}
542: 	= \frac{1}{2N}\sum_{\mbx{k}}
543: 	\left[
544: 		\frac{1}{\beta \tilde{G}_{\mbx{k}}+h +\frac{1}{2\tilde{R}_{\mbx{k}}}}
545: 	\right]\label{saitei},
546: \end{equation}
547: %%%%%%%%%%%%%%
548: More precisely, 
549: when the ratio $\hat{\beta}: \hat{h}:\hat{\tilde{R}}_{\mbx{k}}$
550: is equal to the ratio ${\beta}:{h}:{\tilde{R}}_{\mbx{k}}$, 
551: $E_1$ is the minimum given by Eq. (\ref{saitei}). 
552: Note that this equation represents the limit of the restoration.
553: 
554: Likewise, we can derive the mean squared error, $E_2$, between the original image and the distorted image as being represented by
555: %%%%%%%%%%%%%%
556: \begin{eqnarray}
557: 	E_2&=&\left\|
558: 		\frac{1}{N}\sum_{\mbx{j}}
559: 			\left(
560: 				\tilde{\xi}_{\mbx{j}}
561: 				-\tilde{\tau}_{\mbx{j}}
562: 			\right)^2
563: 	\right\|\label{E2} \\
564: 	&=&\frac{1}{N}
565: 		\sum_{\mbx{k}}{\tilde{R}_{\mbx{k}}}
566: 	=\frac{1}{N}\sum_{\mbx{i}}\tilde{R}_{\mbx{i},\mbx{i}}=b^2
567: 	\label{kaisekiE2}
568: \end{eqnarray}
569: %%%%%%%%%%%%%%
570: $E_2$ depends only on the diagonal element of the noise covariance matrix.
571: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
572: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
573: \subsection{Maximization of the marginal likelihood}
574: The minimized mean squared error criterion generally cannot be used since the unknown original image itself is needed to evaluate the squared error. If we already had the original image, there would be no need to restore the distorted image. Hence, in this paper, we argue for using the maximization of the marginal likelihood. 
575: 
576: As shown in Eqs. (\ref{prior}) and (\ref{noize}), we already know the source prior and noise probabilities. Using both probabilities, we can derive $P({\mbx{\tau}})$:
577: 
578: %%%%%%%%%%%%%%
579: \begin{equation}
580: P({\mbx{\tau}})=\int d{\mbx{S}}
581: 	P_{\rm{out}} (\mbx{\tau}|\mbx{S};\hat{R})P(\mbx{S};\hat{\beta},\hat{h}).\label{syuhen1}
582: \end{equation}
583: %%%%%%%%%%%%%%
584: $P({\mbx{\tau}})$ is called the marginal likelihood.
585: 
586: Maximization of the marginal likelihood is used to set the hyperparameters, $\hat{\beta}, \hat{h}$, and $\hat{R}$, to use to obtain the maximum value of $P(\mbx{\tau})$ for distorted image $\mbx{\tau}$.
587: 
588: 
589: With Eqs. (\ref{Zprior}) and (\ref{Znoize}), the marginal likelihood is given by
590: %%%%%%%%%%%%%%
591: \begin{equation}
592: P({\mbx{\tau}})=\frac{Z_{\rm{posterior}}(\hat{\beta},\hat{h},\hat{R})}
593: 			{Z_{\rm{noise}}(\hat{R})Z_{\rm{prior}}(\hat{\beta},\hat{h})},
594: \end{equation}
595: %%%%%%%%%%%%%%
596: where
597: %%%%%%%%%%%%%%
598: \begin{eqnarray}
599: 	& &Z_{\rm{posterior}}(\hat{\beta},\hat{h},\hat{R})
600: 	=\prod_{\mbx{k}}\int d\tilde{S}_{\mbx{k}}\exp(-\hat{\tilde{H}}_{eff})\\
601: 	&=&\pi^{\frac{N}{2}}
602: 			\prod_{\mbx{k}}{\frac{1}{\sqrt{\hat{\tilde{A}}_{\mbx{k}}}}}
603: 		\exp
604: 				\left[
605: 					-(\hat{\tilde{B}}_{\mbx{k}}
606: 					 - \frac{\hat{\tilde{B}}^2_{\mbx{k}}}{\hat{\tilde{A}}_{\mbx{k}}})
607: 						\left|\tilde{\tau}_{\mbx{k}}\right|^2
608: 				\right].\label{Zpost}
609: \end{eqnarray}
610: %%%%%%%%%%%%%%
611: 
612: Because $\ln$ is a monotonically increasing function that maximizes the logarithmic marginal likelihood, maximizing $\ln P(\mbx{\tau})$ is equivalent to maximizing $P(\mbx{\tau})$.
613: 
614: The logarithmic marginal likelihood $\ln P(\mbx{\tau})$ can be written as,
615: %%%%%%%%%%%%%%
616: \begin{eqnarray}
617: 	& &\ln\left(P({\mbx{\tau}})\right)
618: 	=\ln
619: 		\left(
620: 			\frac{Z_{\rm{posterior}}(\hat{\beta},\hat{h},\hat{R})}
621: 			{Z_{\rm{noise}}(\hat{R})Z_{\rm{prior}}(\hat{\beta},\hat{h})}
622: 		\right)\nonumber \\
623: 	&=&-\frac{1}{2}\sum_{\mbx{k}}\ln
624: 		\left(
625: 			\hat{\beta} \tilde{G}_{\mbx{k}}+\hat{h}+\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}
626: 		\right)
627: 		-\frac{N}{2}\ln(2\pi)\nonumber \\
628: 	& &-\frac{1}{2}\sum_{\mbx{k}}\ln 
629: 	\left(\hat{\tilde{R}}_{\mbx{k}}\right)
630: 	+\frac{1}{2}\sum_{\mbx{k}}\ln
631: 		\left(
632: 			\hat{\beta} G_{\mbx{k}} + \hat{h}
633: 		\right)\nonumber \\
634: 	& &-\sum_{\mbx{k}}
635: 		\frac{
636: 			\left(\hat{\beta} \tilde{G}_{\mbx{k}}
637: 			 + \hat{h}\right)\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}
638: 		}
639: 		{
640: 			\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h} +\frac{1}{2\hat{\tilde{R}}_{\mbx{k}}}
641: 		}
642: 		\left|\tilde{\tau}_{\mbx{k}}\right|^2.\label{syuhen2}
643: \end{eqnarray}
644: %%%%%%%%%%%%%%
645: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
646: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
647: \section{result}
648: In this section, we consider a method to restore an image distorted by spatially correlated noise by means of the spatially uncorrelated noise model. To do this, we replace $R$ as one of the hyperparameters:
649: %%%%%%%%%%%%%%
650: \begin{equation}
651: \hat{R}_{\mbx{i},\mbx{j}}=\hat{r}\delta_{\mbx{i},\mbx{j}},
652: ~~~~~~~~~~~\hat{\tilde{R}}_{\mbx{k}}=\hat{r}.\label{kinjiF}
653: \end{equation}
654: %%%%%%%%%%%%%%
655: We can now write Eq. (\ref{E1kaiseki}) as
656: %%%%%%%%%%%%%%
657: \begin{equation}
658: 		\left< \sigma_{\mbx{j}}^{app} \right>
659: 	=\frac{1}{N}
660: 		\sum_{\mbx{k}}\sum_{\mbx{i}}
661: 		\frac{
662: 			\tau_{\mbx{i}}
663: 			\frac{1}{2r}
664: 			\cos[
665: 				{{\mbx{k}}\cdot({\mbx{j}}-{\mbx{i}})}
666: 			]
667: 		}
668: 		{
669: 			\hat{\beta} \tilde{G}_{\mbx{k}}+\hat{h}
670: 				+\frac{1}{2\hat{r}}
671: 		}.\label{junsaiteki}
672: \end{equation}
673: %%%%%%%%%%%%%%
674: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
675: %%%%%%%%%
676: \begin{figure}[ht]
677: 	\centering
678: 	\includegraphics[width=8.5cm,clip]{graf1_3.eps}
679: \caption{
680: Mean squared error between original and each restored image. Horizontal axis denotes $a$ from Eq. (\ref{soukan}); vertical axis denotes mean squared error. (A) Optimum decode given by Eq. (\ref{saitei}). (B) Restored using criterion of minimum restored error. (C) Restored using $\hat{r}$ estimated by maximizing marginal likelihood. (D) Restored using $\hat{\beta}, \hat{h}$, and $\hat{r}$, all estimated by maximizing marginal likelihood. (E) Error before decoding given by Eq. (\ref{kaisekiE2}).
681: }
682: \label{kekkagraf}
683: \end{figure}
684: %%%%%%%%%
685: \begin{figure}[ht]
686: 	\centering
687: 	\includegraphics[width=8cm,clip]{r.eps}
688: \caption{
689: 	Estimated $\hat{r}$. 
690: 	Horizontal axis denotes $a$ from Eq. (\ref{soukan}); 
691: 	vertical axis denotes average estimated 
692: 	$\hat{r}$ 
693: 	obtained by maximizing marginal likelihood 
694: 	or optimal $\hat{r}$ 
695: 	obtained using criterion of minimum restored error. 
696: 	Solid line (B) denotes $\hat{r}$ used in Fig. \ref{kekkagraf}(B), 
697: 	dashed line (C) denotes $\hat{r}$ used in Fig. \ref{kekkagraf}(C), 
698: 	and dotted line (D) denotes $\hat{r}$ used in Fig. \ref{kekkagraf}(D).
699: }
700: \label{reps}
701: \end{figure}
702: %%%%%%%%%
703: %%%%%%%%%
704: \begin{figure}[ht]
705: 	\centering
706: 	\includegraphics[width=8cm,clip]{hb.eps}
707: \caption{
708: 	Estimated $\hat{h}$ (dotted line) and $\hat{\beta}$ (dashed line). 
709: 	Both values were averaged with respect to 100 artificially generated images. 
710: 	Horizontal axis denotes $a$ from Eq. (\ref{soukan}); 
711: 	vertical axis denotes average estimated $\hat{h}$ or $\hat{\beta}$
712: 	or actual $h$ or $\beta$. 
713: 	Solid line denotes actual $h (=\beta=1.0$).
714: }
715: \label{hbeps}
716: \end{figure}
717: %%%%%%%%%
718: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
719: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
720: Figure \ref{kekkagraf} shows the analytical 
721: and simulated mean squared errors between the original image and each restored image, 
722: Fig. \ref{reps} shows the estimated $\hat{r}$, 
723: and Fig. \ref{hbeps} show the estimated parameters $\hat{h}$ and $\hat{\beta}$. 
724: We used 100 artificially generated two-dimensional $N=16^2$ images with hyperparameters $\beta=1.0$, $h=1.0$, $b=0.75$, and $\kappa=3.0$. 
725: The horizontal axes of these three figures denote $a$ from Eq. (\ref{soukan}); 
726: $a$ represents the magnitude of the spatial correlation of the noise ($a=0$ implies no spatial correlation). 
727: 
728: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
729: \subsection{Minimization of mean squared error}
730: %%%%%%%%% E %%%%%%%%%
731: Curve (E) in Fig. \ref{kekkagraf} represents error $E_2$ given by Eq. (\ref{kaisekiE2}), that is, the mean squared error between the original and distorted images. (The dashed line represents the analytical error, and the solid line represents the simulated results.) Because the numbers of pixels ($N=16^2$) and samples ($100$) were finite, the simulated results have some fluctuation. This fluctuation tends to asymptotically disappear as the numbers are increased.
732: 
733: %%%%%%%%% A %%%%%%%%%
734: Curve (A) in Fig. \ref{kekkagraf} represents minimum error $E_{1_{min}}$ given by Eq. (\ref{saitei}), that is, the mean squared error between the original and distorted images restored using the spatially correlated noise model with the best parameters. Hence, $E_{1_{min}}$ implies the limits of image restoration-no further error reduction is possible.
735: %%%%%%%%% B %%%%%%%%%
736: Curve (B) in Fig. \ref{kekkagraf} represents the error in an image restored using the uncorrelated noise model. The restoration was done using hyperparameter $\hat{r}$ from Eq. (\ref{E1}), which was acquired minimizing the mean squared error. (The dashed line represents the analytical results, and the solid line represents the simulated results.)
737: Curve (B) in Fig. \ref{reps} shows the optimal values of $\hat{r}$ 
738: by minimizing the mean squared error.
739: 
740: Note that when $a=0.0$, curve (A) in Fig. \ref{kekkagraf} is equal to curve (B).
741: %%%%%%%%% B %%%%%%%%%
742: Curve (B) in Fig. \ref{kekkagraf} shows the optimum performance 
743: when using the minimization of the mean squared error as the criterion. 
744: However, as mentioned, the hyperparameters cannot be estimated in practice by minimizing the mean squared error. 
745: Therefore, we will discuss how the error can be reduced 
746: by maximizing the logarithmic marginal likelihood in the next subsection.
747: 
748: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
749: \subsection{Maximization of the marginal likelihood}
750: Substituting Eq. (\ref{syuhen2}) for Eq. (\ref{kinjiF}), we get
751: %%%%%%%%%%%%%%
752: \begin{eqnarray}
753: 	\ln\left(P_{app}({\mbx{\tau}})\right)
754: 		&=&-\frac{1}{2}\sum_{\mbx{k}}\ln
755: 		\left(
756: 			\hat{\beta} \tilde{G}_{\mbx{k}}+\hat{h}+\frac{1}{2\hat{r}}
757: 		\right)-\frac{N}{2}\ln(2\pi)\nonumber \\
758: 	& &	-\frac{N}{2}\ln \left(\hat{r}\right)
759: 		+\frac{1}{2}\sum_{\mbx{k}}\ln
760: 		\left(
761: 			\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h}
762: 		\right)\nonumber \\
763: 	& &-\sum_{\mbx{k}}
764: 		\frac{
765: 			\left(\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h}\right)\frac{1}{2\hat{r}}
766: 		}
767: 		{
768: 			\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h} +\frac{1}{2\hat{r}}
769: 		}
770: 		\left|\tilde{\tau}_{\mbx{k}}\right|^2.\label{syuhenapp}
771: \end{eqnarray}
772: %%%%%%%%%%%%%%
773: We can derive the conditions necessary for the extremal $P_{app}({\mbx{\tau}})$, and we can represent them using the following nonlinear simultaneous equations.
774: %%%%%%%%%%%%%%
775: \begin{eqnarray}
776: 	\frac{1}{\hat{\beta}}
777: 	&=&	\left(
778: 			\sum_{\mbx{k}}
779: 				\frac{
780: 					\tilde{G}_{\mbx{k}}
781: 				}
782: 				{
783: 					\tilde{G}_{\mbx{k}}+\frac{\hat{h}}{\hat{\beta}}
784: 				}
785: 		\right)^{-1}
786: 		\Bigg[
787: 			\sum_{\mbx{k}}\frac{\tilde{G}_{\mbx{k}}}
788: 						{
789: 							\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h} + \frac{1}{2\hat{r}}
790: 						}\nonumber \\
791: 	& &	~~~~~~+\sum_{\mbx{k}}\left|\tau_{\mbx{k}}\right|^2
792: 			\frac{\tilde{G}_{\mbx{k}}}{2\hat{r}^2
793: 				\left(
794: 					\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h} + \frac{1}{2\hat{r}}
795: 				\right)^2}
796: 		\Bigg],\label{b}
797: \end{eqnarray}
798: %%%%%%%%%%%%%%
799: %%%%%%%%%%%%%%
800: \begin{eqnarray}
801: 	\hat{r}
802: 	&=&	\frac{1}{2N}
803: 			\sum_{\mbx{k}}\frac{\tilde{G}_{\mbx{k}}}
804: 						{
805: 							\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h} + \frac{1}{2\hat{r}}
806: 						}\nonumber \\
807: 	& &~~~~~~~~~+\frac{1}{N}
808: 		\sum_{\mbx{k}}\left|\tilde{\tau}_{\mbx{k}}\right|^2
809: 			\frac{(\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h})^2}
810: 				{
811: 				\left(
812: 					\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h} + \frac{1}{2\hat{r}}
813: 				\right)^2},\label{q}
814: \end{eqnarray}
815: %%%%%%%%%%%%%%
816: %%%%%%%%%%%%%%
817: \begin{eqnarray}
818: 	\frac{1}{\hat{h}}
819: 	&=&	\left(
820: 			\sum_{\mbx{k}}
821: 				\frac{
822: 						1
823: 				}
824: 				{
825: 					1+\frac{\hat{\beta} \tilde{G}_{\mbx{k}}}{h}
826: 				}
827: 		\right)^{-1}
828: 		\Bigg[
829: 			\sum_{\mbx{k}}\frac{1}
830: 						{
831: 							\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h} + \frac{1}{2\hat{r}}
832: 						}\nonumber \\
833: 	& &~~~~~~+\sum_{\mbx{k}}\left|\tau_{\mbx{k}}\right|^2
834: 			\frac{1}{2\hat{r}^2
835: 				\left(
836: 					\hat{\beta} \tilde{G}_{\mbx{k}} + \hat{h} + \frac{1}{2\hat{r}}
837: 				\right)^2}
838: 		\Bigg].\label{h}
839: \end{eqnarray}
840: %%%%%%%%%%%%%%
841: We can then determine the convergence point by means of an iterative calculation.
842: 
843: %%%%%%%%% C %%%%%%%%%
844: We next determined the unknown hyperparameter, $\hat{r}$, by maximizing the marginal likelihood, assuming that $\hat{\beta} = \beta$ and $\hat{h}=h$. Thus, we restricted ourselves to the case where the generation model of the image was already known, as was $\beta$ and $h$.
845: Curve (C) in Fig. \ref{kekkagraf} shows the mean squared error when the image was 
846: restored using the estimated $\hat{r}$. 
847: Curve (C) in Fig. \ref{reps} shows the estimated $\hat{r}$.
848: 
849: Note that the difference between
850: curve (B) in Fig. \ref{reps}, the limit of the restoration,
851: and curve (C), obtained by minimizing the mean squared error 
852: increased with the noise correlation parameter, $a$. 
853: Therefore, the error obtained by maximizing the marginal likelihood 
854: (curve (C) in Fig. \ref{kekkagraf}) increases with the noise correlation parameter, $a$.
855: This implies that without knowing the noise model, estimating the hyperparameters by maximizing the marginal likelihood does not work well. That is, the conventional uncorrelated noise model cannot cope with spatially correlated noise.
856: 
857: %%%%%%%%% D %%%%%%%%%
858: To confirm this, we used maximization of the marginal likelihood to estimate 
859: $\hat{\beta}$ and $\hat{h}$-the hyperparameters of the image-generation probability-as well as $\hat{r}$. 
860: Curve (D) in Fig. \ref{kekkagraf} shows the mean squared error estimated using hyperparameters $\hat{\beta}$, $\hat{h}$, and $\hat{r}$. 
861: Curve (D) in Fig. \ref{reps} shows the obtained $\hat{r}$, and Fig. \ref{hbeps} shows the 
862: obtained $\hat{\beta}$ and $\hat{h}$. 
863: The error shown by curve (D) in Fig. \ref{kekkagraf} is much greater than that shown by curve (C). 
864: 
865: %%%%%%%%%%%%%%%%%%
866: \begin{figure}[ht]
867: 	\centering
868: 	\includegraphics[width=8cm,clip]{1yuudo.eps}
869: \caption{
870: 	Mean squared error and marginal likelihood.
871: 	Horizontal axis denotes hyperparameter $\hat{r}$;
872: 	left vertical axis denotes mean squared error (B) given by Eq.(\ref{Eend}); 
873: 	right vertical axis denotes logarithmic marginal likelihood (C)
874: 	given by Eq.(\ref{syuhenapp}).
875: 	The parameters were 
876: 	$a=1.0$, $\beta=1.0$, $h = 1.0$, and $\kappa = 3.0$.
877: 	Those used for (B) correspond to those for curve (B) in 
878: 	Fig. \ref{kekkagraf} when $a=1.0$.
879: 	Likewise, those used for (C) correspond to those for curve (C) in 
880: 	Fig. \ref{kekkagraf}(C) when $a=1.0$.
881: }
882: \label{yuudo}
883: \end{figure}
884: %%%%%%%%%%%%%%%%%%
885: %%%%%%%%% END %%%%%%%%%
886: 
887: Next, we demonstrate that the logarithmic marginal likelihood, curve (C) in
888: Fig. \ref{kekkagraf}, does not have a local maximum and that
889: the iteratively obtained solution of Eq. (\ref{q}) corresponds to the global maximum. 
890: Curve (B) in Fig. \ref{yuudo} shows the mean squared error, 
891: $E_1$, given by Eq.(\ref{Eend}), and curve (C) 
892: shows the logarithmic marginal likelihood, $\ln(P_{app} (\mbx{\tau}))$,
893: given by Eq. (\ref{syuhenapp}). 
894: The horizontal axis denotes $\hat{r}$. 
895: Curve (B) shows that $E_1$ takes a minimum value, $E_{1}=0.12$, at $\hat{r}=3.2$. 
896: The parameters used for curve (B) correspond to those for  
897: curve (B) in Fig. \ref{kekkagraf} when $a=1.0$. 
898: Curve (C) shows that that the logarithmic marginal likelihood 
899: takes a maximum value, $E_{1}=0.28$, at $\hat{r}=0.2$. 
900: The parameters used for curve (C) correspond to those for 
901: curve (C) in Fig. \ref{kekkagraf} when $a=1.0$. 
902: The important point is that the logarithmic marginal likelihood, 
903: curve (C) in Fig. \ref{yuudo}, does not have a local maximum. 
904: %%%%%%%%%
905: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
906: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
907: \subsection{Sample of images}
908: Next, we show the practical significance of these differences
909: for a typical set of artificial images. 
910: %%%%%%%%%%%%%%
911: \begin{figure}[ht]
912: 	\centering
913: 	\includegraphics[width=7.5cm,clip]{art64.eps}
914: 	\caption{Artificial images. $N=64^2$.
915: 	}
916: 	\label{art}
917: \end{figure}
918: %%%%%%%%%%%%%%
919: Figure \ref{art}(a) shows an original image generated using hyperparameters $\beta=0.5$ and $h=1.0^{-4}$. Figure \ref{art}(c) shows the same image after it was distorted by the noise shown in Fig. \ref{art}(b) ($a=1.0, b=0.75$, and $\kappa=3.0$). 
920: The mean squared error, $E_2$, between the original and distorted images was 0.57. Figure \ref{art}(d) shows the image after it was restored using the complete restoration model: the generation model and noise model were consistent with the original models, and optimum values were used for the hyperparameters. 
921: When $a=1.0$, curve (A) in Fig. \ref{kekkagraf} corresponds to Fig. \ref{art}(d),
922: and the restored mean squared error, $E_1$, is 0.27. 
923: Figure \ref{art}(e) shows the image after it was restored using an uncorrelated noise model with hyperparameters obtained by maximizing the marginal likelihood. When $a=1.0$, curve (D) in Fig. \ref{kekkagraf} corresponds to Fig. \ref{art}(e), and the restored mean squared error is 0.45. 
924: The image in Fig. \ref{art}(d) resembles the original one, Fig. \ref{art}(a), while the one in Fig. \ref{art}(e) is similar to the distorted one in Fig. \ref{art}(c) and looks slightly out of focus.
925: 
926: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
927: Furthermore, we show the results when natural images were used.
928: %%%%%%%%%%%%%%
929: \begin{figure}[ht]
930: 	\centering
931: 	\includegraphics[width=6.74cm,clip]{natural2.eps}
932: 	\caption{Natural images. $N=64^2$.
933: 	}
934: 	\label{natural}
935: \end{figure}
936: %%%%%%%%%%%%%%
937: Figure \ref{natural} shows the natural images and a table of the estimated parameters. 
938: We used $b = 0.75$ and $\kappa = 3.0$ for the noise parameters and three values for $a$: 0.0, 0.5, and 1.0. 
939: The three images on the left are distorted, 
940: and those on the right are the same images restored using 
941: $\hat{\beta}$, $\hat{h}$, and $\hat{r}$ estimated using 
942: the maximized marginal likelihood. 
943: In this simulation as well, $E_1$ increased with the noise correlation parameter, $a$. 
944: This tendency is reflected in the three restored images. For $a=1.0$, the lower right image, the noise looks like a stain that was not completely removed. 
945: 
946: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
947: \subsection{Discussion}
948: Here, we consider why line (A) in Fig. \ref{kekkagraf} tends to decrease 
949: as $a$ approaches $1.0$. This phenomenon appears to be universal, and also occurs with another parameter set as shown in Fig. \ref{a_eps}(ii). 
950: For reference, we also show line (i) which is based on the parameter set that was used in Fig. \ref{kekkagraf}(A) with only $N$ changed to $N=64^2$.
951: (Note that line (A) is independent of $N$). 
952: %%%%%%%%%
953: \begin{figure}[ht]
954: 	\centering
955: 	\includegraphics[width=8cm,clip]{a.eps}
956: \caption{
957: 	Mean squared error between original image and restored image. 
958: 	The horizontal axis denotes $a$ from Eq. (\ref{soukan}), and the	vertical axis denotes the mean squared error. 
959: 	As a criterion, the optimal decode given by Eq. (\ref{saitei}) is used. 
960: 	The parameters $b=0.75$, and $N=64^2$ are fixed. 
961: 	For the solid line (i), 
962: 	the other parameters were set as $\beta=1.0, h=1.0$, and $\kappa=3.0$. 
963: 	For the dashed line (ii), 
964: 	the other parameters were set as $\beta=0.5, h=10^{-4}$, and $\kappa=7.0$. 
965: }
966: \label{a_eps}
967: \end{figure}
968: %%%%%%%%%
969: 
970: For the original image, we used the Gaussian model represented by Eq. (\ref{prior}), 
971: which has neighboring interactions on each pixel. For the noise image, we used the Gaussian model represented by Eq. (\ref{noize}), 
972: in which the interaction is Gaussian functionally decreased. 
973: We expected the difference 
974: between these two models to become conspicuously large near $a=1.0$, and this difference between the models would enable successful restoration near $a=1.0$. 
975: Figure \ref{samp_eps} shows the typical images about Fig. \ref{a_eps}(ii). 
976: %%%%%%%%%
977: \begin{figure}[ht]
978: 	\centering
979: 	\includegraphics[width=8cm,clip]{samp.eps}
980: \caption{
981: 	Typical images about Fig. \ref{a_eps} (ii). $N=64^2$. 
982: 	Parameters for the original images were $\beta=0.5$ and $h=10^{-4}$. The three original prepared images were the same. 
983: 	Parameters for noise images were $\beta=0.5, h=10^{-4}, \kappa=7.0$, and $b=0.75$. Only $a$ was changed. 
984: 	Note that when $a=0.55$, the original and noise images were visually similar, and the restoration quality was lower in this case. 
985: }
986: \label{samp_eps}
987: \end{figure}
988: %%%%%%%%%
989: The distorted image was most difficult to restore in Fig. \ref{samp_eps} when $a=0.55$, as shown in Fig. \ref{a_eps}(ii).
990: In this case, the original and the noise images were visually similar; 
991: more precisely, the correlation lengths of the two images were similar. 
992: Thus, to successfully restore a distorted image, 
993: it is desirable that the statistical properties of the original image 
994: should be different from those of the noise image. 
995: 
996: Here, we summarize our explanation of why line (A) in Fig. \ref{kekkagraf}
997: decreases near $a=1.0$. 
998: As $a$ approaches $1.0$, the mean squared error decreases because the difference 
999: of the statistical properties, especially the correlation length, between the two images enables easier restoration.
1000: Conversely, the mean squared error is greater
1001: at $a=0.55$ on line (ii) in Fig. \ref{a_eps} 
1002: because 
1003: the correlation-length similarity between the images makes it difficult to distinguish between them, 
1004: and that also makes it difficult to restore the distorted image. 
1005: To ensure successful restoration, 
1006: the correlation length of the original image should be different from that of the noise image, which is the case when $a=0.00$ and $a=1.00$ in Fig. \ref{samp_eps}.
1007: 
1008: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1009: \section{Conclusion}
1010: We investigated the use of the Bayesian inference to restore images under conditions of spatially correlated noise. We assumed that both the original image and the noise obeyed a Gaussian distribution composed of translational symmetric matrices. We used the Fourier transformation to diagonalize the covariance matrices, which enabled us to apply various forms of statistical analysis. 
1011: We obtained the expected value of a restored image and obtained 
1012: the optimal analytical hyperparameters by minimizing the mean squared error and used 
1013:  these hyperparameters to determine the generation and noise models. 
1014: Furthermore, we discussed whether 
1015: the conventional spatially uncorrelated noise model could cope with the spatially correlated noise or not. 
1016: We used two methods to estimate the hyperparameters: minimizing the mean squared error and maximizing the marginal likelihood. The difference between the errors obtained using the two methods increased with the noise correlation parameter. The restoration error was larger when we used the hyperparameters obtained using the maximization of the marginal likelihood method. 
1017: Thus, the conventional spatially uncorrelated noise model could not cope with the spatially correlated noise. 
1018: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1019: \begin{thebibliography}{99}
1020: %%%%%%%%%%%%%%
1021: \bibitem{morina1}
1022: \newblock R. Morina,
1023: \newblock {IEEE Trans. Image Process.}, {\bf 8}, 231 (1999).
1024: %%%%%%%%%%%%%%
1025: \bibitem{morina2}
1026: \newblock R. Morina and K. Katsaggelos,
1027: \newblock {IEEE Trans. Image Process.}, {\bf 8}, 231 (1999).
1028: %%%%%%%%%%%%%%
1029: \bibitem{inoue4}
1030: \newblock J. Inoue and K. Tanaka,
1031: \newblock {Phys. Rev. E}, {\bf 65},016125-1 (2002).
1032: %%%%%%%%%%%%%
1033: \bibitem{nisi1}
1034: \newblock H. Nishimori and K. Y. M .Wong,
1035: \newblock {Phys. Rev. E}, {\bf 60}, 132 (1999).
1036: %%%%%%%%%%%%%
1037: \bibitem{nisi2}
1038: \newblock H. Nishimori,
1039: \newblock {BUSSEI KENKYU. Jpn.}, {\bf 73}, 850 (2000).
1040: %%%%%%%%%%%%%%
1041: \bibitem{tanaka2}
1042: \newblock K. Tanaka and J. Inoue,
1043: \newblock {IEICE Trans. Inf. \& Syst}, E{\bf 84}-D, 546 (2002).
1044: %%%%%%%%%%%%%
1045: \bibitem{Geman1}
1046: \newblock H. Derin, H. Elliott, R. Cristi, and D. Geman,
1047: \newblock {IEEE Trans. Pattern Anal. \& Machine Intell.}, {\bf 6}, 707 (1984).
1048: %%%%%%%%%%%%%
1049: \bibitem{Geman2}
1050: \newblock S. Geman and D. Geman,
1051: \newblock {IEEE Trans. Pattern Anal. \& Machine Intell.}, {\bf 6}, 721 (1984).
1052: %%%%%%%%%%%%%%%
1053: \bibitem{inoue3}
1054: \newblock J. Inoue,
1055: \newblock {Phys. Rev. E}, {\bf 60}, 2547 (1999).
1056: %%%%%%%%%%%%%
1057: \bibitem{inoue2}
1058: \newblock J. Inoue,
1059: \newblock {Phys. Rev. E}, {\bf 63} 046114 (2001).
1060: %%%%%%%%%%%%%
1061: \bibitem{inoue1}
1062: \newblock J. Inoue,
1063: \newblock {Phys. Rev. E}, {\bf 64}, 036121 (2001).
1064: %%%%%%%%%%%%%%
1065: \bibitem{tanaka1}
1066: \newblock J. Inoue and K. Tanaka,
1067: \newblock {Neural Netw. Signal Process. XI}, 383 (2001).
1068: %%%%%%%%%%%%%%
1069: \end{thebibliography}
1070: %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
1071: \end{document}
1072: