0610:math0610221/FLRD.tex

1: \documentclass[a4paper,12pt]{article}%

2: \usepackage{amsmath}

3: \usepackage{amsfonts}

4: \usepackage{amssymb}

5: \usepackage[dvips]{graphicx}%

6: \setcounter{MaxMatrixCols}{30}

7: %TCIDATA{OutputFilter=latex2.dll}

8: %TCIDATA{Version=5.00.0.2570}

9: %TCIDATA{CSTFile=40 LaTeX article.cst}

10: %TCIDATA{Created=Saturday, February 04, 2006 16:47:20}

11: %TCIDATA{LastRevised=Monday, October 02, 2006 09:44:39}

12: %TCIDATA{<META NAME="GraphicsSave" CONTENT="32">}

13: %TCIDATA{<META NAME="SaveForMode" CONTENT="1">}

14: %TCIDATA{<META NAME="DocumentShell" CONTENT="Standard LaTeX\Blank - Standard LaTeX Article">}

15: \newtheorem{theorem}{Theorem}

16: \newtheorem{acknowledgement}[theorem]{Acknowledgement}

17: \newtheorem{algorithm}[theorem]{Algorithm}

18: \newtheorem{axiom}[theorem]{Axiom}

19: \newtheorem{case}[theorem]{Case}

20: \newtheorem{claim}[theorem]{Claim}

21: \newtheorem{conclusion}[theorem]{Conclusion}

22: \newtheorem{condition}[theorem]{Condition}

23: \newtheorem{conjecture}[theorem]{Conjecture}

24: \newtheorem{corollary}[theorem]{Corollary}

25: \newtheorem{criterion}[theorem]{Criterion}

26: \newtheorem{definition}[theorem]{Definition}

27: \newtheorem{example}[theorem]{Example}

28: \newtheorem{exercise}[theorem]{Exercise}

29: \newtheorem{lemma}[theorem]{Lemma}

30: \newtheorem{notation}[theorem]{Notation}

31: \newtheorem{problem}[theorem]{Problem}

32: \newtheorem{proposition}[theorem]{Proposition}

33: \newtheorem{remark}[theorem]{Remark}

34: \newtheorem{solution}[theorem]{Solution}

35: \newtheorem{summary}[theorem]{Summary}

36: \newenvironment{proof}[1][Proof]{\noindent\textbf{#1.} }{\ \rule{0.5em}{0.5em}}

37: \begin{document}

38:

39: \author{Andr\'{e} MAS$^{\ast}$, Besnik PUMO$^{\dag}\medskip$\\$\left(  ^{\ast}\right)  $ {\small Institut de Mod\'{e}lisation

40: Math\'{e}matique de Montpellier,}\\{\small cc 051, Place Eug\`{e}ne Bataillon,}\\{\small 34095, Montpellier Cedex 5, France,}\\{\small mas@math.univ-montp2.fr\medskip\ }\\$\left(  ^{\dag}\right)  $ {\small Unit\'{e} de Statistiques,}\\{\small UMR A 462 SAGAH,}\\{\small INH, Angers, France}}

41: \title{Functional linear regression with derivatives}

42: \date{}

43: \maketitle

44:

45: \begin{abstract}

46: We introduce a new model of linear regression for random functional inputs

47: taking into account the first order derivative of the data. We propose an

48: estimation method which comes down to solving a special linear inverse

49: problem. Our procedure tackles the problem through a double and synchronized

50: penalization. An asymptotic expansion of the mean square prevision error is

51: given. The model and the method are applied to a benchmark dataset of

52: spectrometric curves and compared with other functional models.

53:

54: \end{abstract}

55:

56: \textbf{Keywords} : Functional data, Linear regression model, Differential

57: operator, Penalization, Spectrometric curves.\bigskip

58:

59: \section{Introduction}

60:

61: Functional Data Analysis is a well-known area of modern statistics. Advances

62: in computer sciences make it now possible to collect data from an underlying

63: continuous-time processe, say $\left(  \xi_{t}\right)  _{t\geq0}$, at high

64: frequencies. The traditional point of view consisting in discretizing $\left(

65: \xi_{t}\right)  $ at $t_{1},...,t_{p}$ and studying it by classical

66: multidimensional tools is outperformed by interpolation methods (such as

67: splines or wavelets). These techniques provide the statistician with a

68: reconstructed curve on which inference may be carried out through what we may

69: call "functional models" i.e. versions of the classical multidimensional

70: models designed and suited for data that are curves. Thus, functional PCA,

71: ANOVA or Canonical Analysis -even density estimation for curves or processes

72: have been investigated. We refer to Ramsay, Silverman (1997, 2002), Bosq

73: (2000), Ferraty Vieu (2006) for monographs on functional data analysis.

74: Recently many authors focused on various versions of the regression model

75: introduced by Ramsay and Dalzell (1991) :%

76: \begin{equation}

77: y_{i}=\int_{0}^{T}X_{i}\left(  t\right)  \rho\left(  t\right)  dt+\varepsilon

78: _{i} \label{mod1}%

79: \end{equation}

80: where we assume that the sample $\left(  \left(  y_{1,}X_{1}\right)

81: ,...,\left(  y_{n},X_{n}\right)  \right)  $ is made of independent copies from

82: $\left(  y_{,}X\right)  .$ Each $X_{i}=\left(  X_{i}\left(  t\right)  \right)

83: _{t\in\left[  0,T\right]  }$ is a curve defined on the set $\left[

84: 0,T\right]  ,$ $T>0$, $y_{i}$ is a real number, $\varepsilon_{i}$ is a white

85: noise and $\rho$ is an unknown function to be estimated. In other words the

86: $X_{i}$'s are random elements defined on an abstract probability space and

87: taking values in a function space, say $\mathcal{F}.$ The vector space

88: $\mathcal{F}$ endowed with norm $\left\Vert \cdot\right\Vert _{\mathcal{F}}$

89: will be described soon.We refer for instance to Cardot, Mas, Sarda (2006) or

90: Cai, Hall (2006) for recent results.

91:

92: In this article we study a new (linear) regression model defined below derived

93: from (\ref{mod1}) and echoing the recent paper of Mas and Pumo (2006). The key

94: idea relies on the fact that most statisticians dealing with functional data

95: do not fully enjoy their functional properties. For instance in several models

96: integrals such as%

97: \[

98: \int X_{i}\left(  s\right)  X_{j}\left(  s\right)  ds

99: \]

100: are computed. The integral above is nothing but a scalar product. Nevertheless

101: derivatives were not given the same interest. Explicit calculations of

102: derivatives sometimes appear indirectly in kernel methods (when estimating the

103: derivatives of the density or the regression function) or through seminorms or

104: norms on $\mathcal{F}$. But surprisingly $X_{i}^{\prime}$ (or $X_{i}^{\left(

105: m\right)  }$) never appear in the models themselves whereas people dealing

106: with functional data often say that "derivatives contain much information,

107: sometimes more than the initial curves themselves". Our starting idea is the

108: following. Since in a functional data framework, the curve-data are

109: explicitely known and not just discretized, their derivatives may also be

110: explicitely computed. As a consequence these derivatives may be "injected" in

111: the model, which may enhance its prediction power. The reader is referred to

112: the forthcoming display (\ref{modele}) for an immediate illustration and to

113: Mas, Pumo (2006) for a first article dealing with a functional autoregressive

114: model including derivatives.

115:

116: The paper is rather theoretic even if it is illustrated by a real case study.

117: It is organized as follows. The next section provides the mathematical

118: material, dealing with Hilbert spaces and linear operators, then the model is

119: introduced. The next section is devoted to presenting the estimation method

120: and its stumbling stones. The main results are given before we focus on a real

121: case application to food industry. The last section contains the derivation of

122: the theorems.

123:

124: \section{About Hilbert spaces and linear operators}

125:

126: Silverman (1996) provided a theoretical framework for a smoothed PCA. Jim

127: Ramsay (2000) enlightened the very wide scope of differential equations in

128: statistical modelling. Our work is in a way based on this mathematically

129: involved article. We are aiming at proving that derivatives may be handled in

130: statistical models quite easily when the space $\mathcal{F}$ is well-chosen.

131:

132: The choice of the space $\mathcal{F}$ is crucial. We have to think that if

133: $X\in\mathcal{F}$, $X^{\prime}$ does not necessarily belong to $\mathcal{F}$

134: but to another space $\mathcal{F}^{\prime}$ that may be tremendously different

135: (larger) than $\mathcal{F}$. We decide to take $\mathcal{F}=W^{2,1},$ the

136: Sobolev space of order $\left(  2,1\right)  $ defined by

137: \[

138: W^{2,1}=\left\{  u\in L^{2}\left[  0,1\right]  ,u^{\prime}\in L^{2}\left[

139: 0,1\right]  \right\}

140: \]

141: for at least three reasons :

142:

143: \begin{itemize}

144: \item If $X\in\mathcal{F}$, $X^{\prime}\in L^{2}\left[  0,1\right]  $ which is

145: a well known space.

146:

147: \item Both spaces are Hilbert spaces as well as%

148: \[

149: W^{2,p}=\left\{  u\in L^{2}\left[  0,1\right]  ,u^{\left(  p\right)  }\in

150: L^{2}\left[  0,1\right]  \right\}  .

151: \]

152: This is of great interest for mathematical reasons : bases are denumerable,

153: projections operators are easy to handle, covariance operators admit spectral

154: representations, etc.

155:

156: \item The classical interpolation methods mentioned above (splines and

157: wavelets) provide estimates belonging to Sobolev spaces. So from a practical

158: point of view $W^{2,1}$ -and in general $W^{m,p},$ $\left(  m,p\right)

159: \in\mathbb{N}^{2},$ (see Adams and Fournier (2003) for definitions)- is a

160: natural space in which our curves should be imbedded.

161: \end{itemize}

162:

163: In the sequel $W^{2,1}$ will be denoted $W$ and $W^{2,0}=L^{2}$ will be

164: denoted $L$ for the sake of simplicity. We keep in mind that $W$ (resp. $L$)

165: could be replaced by a space of higher smoothness index : $W^{2,p}$ where

166: $p>1$ (resp. $W^{2,p-1}$). The spaces $W$ and $L$ are separable Hilbert spaces

167: endowed with scalar product :%

168: \begin{align*}

169: \left\langle u,v\right\rangle _{W}  &  =\int_{0}^{1}u\left(  t\right)

170: v\left(  t\right)  dt+\int_{0}^{1}u^{\prime}\left(  t\right)  v^{\prime

171: }\left(  t\right)  dt.\\

172: \left\langle u,v\right\rangle _{L}  &  =\int_{0}^{1}u\left(  t\right)

173: v\left(  t\right)  dt

174: \end{align*}

175: and with associated norms $\left\Vert \cdot\right\Vert _{W}$ and $\left\Vert

176: \cdot\right\Vert _{L}$. We refer to Ziemer (1989) or to Adams and Fournier

177: (2003) for monographs dedicated to Sobolev spaces. Obviously if we set

178: $Du=u^{\prime}$ then $D$ maps $W$ onto $L$ ($D$ is the ordinary differential

179: operator)$.$ Furthermore Sobolev's imbedding theorem ensures that (see Adams

180: and Fournier (2003) Theorem 4.12 p.85) that%

181: \[

182: \left\Vert Du\right\Vert _{L}\leq C\left\Vert u\right\Vert _{W}%

183: \]

184: (where $C$ is some constant which does not depend on $u$) i.e. $D$\textbf{ }is

185: a bounded operator from\textbf{ }$W$\textbf{ }to\textbf{ }$L.$ This is a

186: crucial point to keep in mind and the fourth reason why the functional space

187: was chosen to be $W^{2,1}$ : the differential operator $D$ may be viewed as a

188: continuous linear mapping from $W$ to $L$.

189:

190: Within all the paper and especially all along the proofs we will need basic

191: notions about operator theory. We recall a few important facts. A linear

192: mapping $T$ from a Hilbert space $H$ to another Hilbert space $H^{\prime}$ is

193: continuous whenever%

194: \begin{equation}

195: \left\Vert T\right\Vert _{\infty}=\sup_{x\in H}\frac{\left\Vert Tx\right\Vert

196: _{H^{\prime}}}{\left\Vert x\right\Vert _{H}}<+\infty. \label{normsup}%

197: \end{equation}

198: The adjoint of operator $T$ will be classically denoted $T^{\ast}$. Some

199: finite rank operators are defined by means of the tensor product : if $u$ and

200: $v$ belong to $H$ and $H^{\prime}$ respectively $u\otimes_{H}v$ is the

201: operator defined on $H$ by, for all $h\in H$~:%

202: \[

203: \left(  u\otimes_{H}v\right)  \left(  h\right)  =\left\langle u,h\right\rangle

204: _{H}v.

205: \]

206:

207:

208: \textbf{Compact operators} : Amongst linear operators the class of compact

209: operators is one of the best known. Compact operators generalize matrix to the

210: infinite-dimensional setting and feature nice properties. The general

211: definition of compact operators may be found in Dunford Schwartz (1988) or

212: Gohberg, Goldberg and Kaashoek (1991) for instance. By $\mathcal{C}_{H}$

213: (resp. $\mathcal{C}_{HH^{\prime}}$) we denote the space of compact operators

214: on the Hilbert space $H$ (resp. mapping the Hilbert space $H$ onto $H^{\prime

215: }$). If $T$ is a compact operator from a Hilbert space $H_{1}$ to another

216: Hilbert space $H_{2},$ $T$ admits the Schmidt decomposition :%

217: \begin{equation}

218: T=\sum_{k\in\mathbb{N}}\mu_{k}\left(  u_{k}\otimes v_{k}\right)

219: \label{Decomp.Schmidt}%

220: \end{equation}

221: where $u_{k}$ (resp. $v_{k}$) is a complete orthonormal system in $H_{1}$

222: (resp. in $H_{2}$) and $\mu_{k}$ are the characteristic numbers of $T$ (i.e.

223: the square root of the eigenvalues of $T^{\ast}T$) and

224: \[

225: \lim_{k\rightarrow+\infty}\mu_{k}=0.

226: \]

227: From (\ref{normsup}) we obtain%

228: \[

229: \left\Vert T\right\Vert _{\infty}=\sup_{k}\left\{  \mu_{k}\right\}  .

230: \]

231: When $T$ is symmetric $\mu_{k}$ is the $k^{th}$ eigenvalue of $T$ (then

232: $u_{k}=v_{k}$). In this situation and from (\ref{Decomp.Schmidt}) one may

233: define the square root of $T$ whenever $T$ maps $H$ ont $H$ and is positive :

234: $T^{1/2}$ is still a linear operator defined by :%

235: \begin{equation}

236: T^{1/2}=\sum_{k\in\mathbb{N}}\sqrt{\mu_{k}}\left(  u_{k}\otimes u_{k}\right)

237: . \label{racop}%

238: \end{equation}

239: Note that finite rank operators are always compact.\newline%

240: \textbf{Hilbert-Schmidt operators} : We also mention the celebrated space of

241: Hilbert-Schmidt operators $\mathcal{HS}\left(  H_{1},H_{2}\right)  $ - a

242: subspace of $\mathcal{C}_{H_{1}H_{2}}.$ Let $\left(  u_{i}\right)  _{i\geq0}$

243: be a basis of $H_{1}$ then $T\in\mathcal{HS}\left(  H_{1},H_{2}\right)  $

244: whenever%

245: \[

246: \sum_{i=1}^{+\infty}\left\Vert T\left(  u_{i}\right)  \right\Vert _{H_{2}}%

247: ^{2}<+\infty.

248: \]

249: The space $\mathcal{HS}$ is itself a separable Hilbert space endowed with

250: scalar product%

251: \[

252: \left\langle T,S\right\rangle _{\mathcal{HS}}=\sum_{i=1}^{+\infty}\left\langle

253: T\left(  u_{i}\right)  ,S\left(  u_{i}\right)  \right\rangle _{H_{2}}%

254: \]

255: and $\left\langle T,S\right\rangle _{\mathcal{HS}}$ does not depend on the

256: choice of the basis $\left(  u_{i}\right)  _{i\geq0}.$ Finally the following

257: bound is valid for all $T\in\mathcal{HS}$ :%

258: \[

259: \left\Vert T\right\Vert _{\infty}\leq\left\Vert T\right\Vert _{\mathcal{HS}}.

260: \]

261: \newline\textbf{Unbounded operators} : If $T$ is a one to one (injective)

262: selfadjoint compact operator mapping a Hilbert space $H$ onto $H$, $T$ admits

263: an inverse $T^{-1}$. The operator $T^{-1}$ is defined on a dense (and

264: distinct) subspace of $H$ :%

265: \[

266: \mathcal{D}\left(  T^{-1}\right)  =\left\{  x=\sum_{p\in\mathbb{N}}x_{p}%

267: u_{p}:\sum_{p\in\mathbb{N}}\dfrac{x_{p}^{2}}{\mu_{p}^{2}}<+\infty\right\}  .

268: \]

269: It is unbounded which also means that $T^{-1}$ is continuous at no point for

270: which it is defined and $\left\Vert T^{-1}\right\Vert _{\infty}=+\infty$.

271:

272: \section{The model}

273:

274: We are now in position to introduce this (random input - linear) regression

275: model :%

276:

277: \begin{equation}

278: y_{i}=\left\langle \phi,X_{i}\right\rangle _{W}+\left\langle \psi

279: ,X_{i}^{\prime}\right\rangle _{L}+\varepsilon_{i} \label{modele}%

280: \end{equation}

281: where all random variables are assumed to be centered. The main result of the

282: paper (see next section) gives an asymptotic expansion for the mean square

283: prediction error in (\ref{modele}).

284:

285: The unknown functions $\phi$ and $\psi$ belong to $W$ and $L$ respectively.

286:

287: Obviously we are going to face two issues :

288:

289: \begin{itemize}

290: \item Studying the identifiability of $\phi$ and $\psi$ in the model above.

291:

292: \item Providing a consistent estimation procedure for $\phi$ and $\psi$.

293: \end{itemize}

294:

295: From now on we suppose that :%

296: \[

297: \mathbf{A1}:\left\Vert X\right\Vert _{W}<M\quad a.s.

298: \]

299:

300:

301: This assumption could be relaxed for milder moment assumptions. We claim that

302: our main result holds whenever%

303: \[

304: \mathbf{A}^{\prime}\mathbf{1}:\mathbb{E}\left\Vert X\right\Vert _{W}^{8}<M.

305: \]

306: is true. But considering $\mathbf{A}^{\prime}\mathbf{1}$ would lead us to

307: longer and more intricate methods of proof.

308:

309: \section{Estimation procedure\label{section.estimation.procedure}}

310:

311: \subsection{The moment method}

312:

313: Inference is based on moment formulas. From (\ref{modele}) we derive the two

314: following normal equation -multiply with $\left\langle X_{i},\cdot

315: \right\rangle $ and $\left\langle X_{i}^{\prime},\cdot\right\rangle $

316: successively then take expectation :%

317: \begin{equation}

318: \left\{

319: \begin{tabular}

320: [c]{l}%

321: $\delta=\Gamma\phi+\Gamma^{\prime}\psi,$\\

322: $\delta^{\prime}=\Gamma^{\prime\ast}\phi+\Gamma^{\prime\prime}\psi.$%

323: \end{tabular}

324: \ \ \ \ \ \ \ \ \ \ \right.  \label{syst}%

325: \end{equation}

326: where $\Gamma,$ $\Gamma^{\prime},$ $\Gamma^{\prime\ast},$ $\Gamma

327: ^{\prime\prime}$ are the covariance and cross-covariance of the couple

328: $\left(  X_{i},X_{i}^{\prime}\right)  _{1\leq i\leq n}$ defined by :%

329: \begin{align*}

330: \Gamma &  =\mathbb{E}\left(  X\otimes_{W}X\right)  ,\;\Gamma^{\prime\ast

331: }=\mathbb{E}\left(  X\otimes_{W}X^{\prime}\right)  ,\\

332: \Gamma^{\prime}  &  =\mathbb{E}\left(  X^{\prime}\otimes_{L}X\right)

333: ,\;\Gamma^{\prime\prime}=\mathbb{E}\left(  X^{\prime}\otimes_{L}X^{\prime

334: }\right)  ,

335: \end{align*}

336: and%

337: \[

338: \delta=\mathbb{E}\left(  yX\right)  \in W,\quad\delta^{\prime}=\mathbb{E}%

339: \left(  yX^{\prime}\right)  \in L.

340: \]

341:

342:

343: Under assumption $\mathbf{A1}$ or $\mathbf{A}^{\prime}\mathbf{1}$ the

344: covariance operators belong to $\mathcal{HS}\left(  W\right)  $,

345: $\mathcal{HS}\left(  W,L\right)  $, $\mathcal{HS}\left(  L,W\right)  $ or to

346: $\mathcal{HS}\left(  L\right)  $. Besides the covariance and cross-covariance

347: mentioned above are linked through the relation%

348: \[

349: \Gamma^{\prime\ast}=D\Gamma,\quad\Gamma^{\prime\prime}=D\Gamma^{\prime}.

350: \]

351:

352:

353: Resolving the system (\ref{syst}) is apparently easy but we should be aware of

354: two facts :

355:

356: \begin{itemize}

357: \item Operators (here, $\Gamma,\Gamma^{\prime}..$.) do not commute !

358:

359: \item The inverse operators of $\Gamma,$ and $\Gamma^{\prime\prime}$ do not

360: necessarily exist and when they do, they are unbounded, i.e. not continuous

361: (recall that $\Gamma,$ and $\Gamma^{\prime\prime}$ are compact operators and

362: that compact operators have no bounded inverses).

363: \end{itemize}

364:

365: Before trying to solve (\ref{syst}) we will first study identifiability of the

366: unknown infinite dimensional parameter $\left(  \phi,\psi\right)  \in W\times

367: L$ in the next subsection. We complete our definitions and notations first.

368:

369: We start from a sample $\left(  X_{i},X_{i}^{\prime}\right)  _{1\leq i\leq n}%

370: $. By $\Gamma_{n},\Gamma_{n}^{\prime},\Gamma_{n}^{\prime\ast},\Gamma

371: _{n}^{\prime\prime},\delta_{n}$ and $\delta_{n}^{\prime}$ we denote the

372: empirical counterparts of the operators and vectors introduced above and based

373: on the sample $\left(  y_{i},X_{i},X_{i}^{\prime}\right)  _{1\leq i\leq n}$.

374: For example :%

375: \begin{align}

376: \Gamma_{n}  &  =\dfrac{1}{n}\sum_{k=1}^{n}X_{k}\otimes_{W}X_{k},\label{esti}\\

377: \Gamma_{n}^{\prime}  &  =\dfrac{1}{n}\sum_{k=1}^{n}X_{k}^{\prime}\otimes

378: _{L}X_{k},\nonumber\\

379: \delta_{n}  &  =\dfrac{1}{n-1}\sum_{k=1}^{n-1}y_{k}X_{k}.\nonumber

380: \end{align}

381:

382:

383: \subsection{Identifiability}

384:

385: Both equations in (\ref{syst}) are the starting point of the estimation

386: procedure. We should make sure that solutions to these equations are well and

387: uniquely defined. Suppose for instance that \textrm{Ker}$\Gamma\neq\left\{

388: 0\right\}  $ and take $h$ in it. Now set $\widetilde{\phi}=\phi+h.$ Then

389: \[

390: \Gamma\widetilde{\phi}=\Gamma\phi+\Gamma h=\Gamma\phi.

391: \]

392: So $\Gamma\widetilde{\phi}=\Gamma\phi$ and since $\Gamma^{\prime\ast}=D\Gamma$

393: it is plain that $\Gamma^{\prime\ast}\widetilde{\phi}=\Gamma^{\prime\ast}%

394: \phi.$ Consequently $\widetilde{\phi}$ is another solution to (\ref{syst}).

395: There are indeed even infinitely many solutions in the space $\phi

396: +$\textrm{Ker}$\Gamma$. For similar reasons about $\psi$ we should impose

397: $\mathrm{Ker}T=\left\{  0\right\}  $ for $T=\left\{  \Gamma,\Gamma^{\prime

398: },\Gamma^{\prime\ast},\Gamma^{\prime\prime}\right\}  .$ It turns out that the

399: only necessary assumption is%

400: \[

401: \mathbf{A2}:\mathrm{Ker}\Gamma=\mathrm{Ker}\Gamma^{\prime\prime}=\left\{

402: 0\right\}  \mathrm{.}%

403: \]

404: It is easily seen that $\mathbf{A2}$ implies $\mathrm{Ker}\Gamma^{\prime

405: }=\mathrm{Ker}\Gamma^{\prime\ast}=\left\{  0\right\}  .$ With other words we

406: suppose that both operators $\Gamma$ and $\Gamma^{\prime\prime}$ above are one

407: to one.

408:

409: We are now ready to solve the identification problem.

410:

411: \begin{proposition}

412: \label{Ident}The couple $\left(  \phi,\psi\right)  \in W\times L$ is

413: identifiable for the moment method proposed in (\ref{syst}) if and only if

414: $\mathbf{A2}$ holds and $\left(  \phi,\psi\right)  \notin\mathcal{N}$ where

415: $\mathcal{N}$ is the vector subspace of $W\times L$ defined by :%

416: \begin{equation}

417: \mathcal{N=}\left\{  \left(  \phi,\psi\right)  :\phi+D^{\ast}\psi=0\right\}  .

418: \label{fluke}%

419: \end{equation}

420:

421: \end{proposition}

422:

423: The above Proposition is slightly abstract but (\ref{fluke}) may be simply

424: rewritten: $\left(  \phi,\psi\right)  \in\mathcal{N}$ whenever for all

425: function $f$ in $W,$%

426: \[

427: \int\left(  f\phi+f^{\prime}\phi^{\prime}+f^{\prime}\phi\right)  =0

428: \]

429:

430:

431: Note that $\mathcal{N}$ is a closed set in $W\times L$. From now on we will

432: assume that :%

433: \[

434: \mathbf{A3}:\left(  \phi,\psi\right)  \notin\mathcal{N}\text{.}%

435: \]

436:

437:

438: \section{Definition of the estimates}

439:

440: The estimates stem from (\ref{syst}) which is a non invertible system. Under

441: assumption $\mathbf{A2}$ the solution exists and is unique :%

442: \begin{equation}

443: \left\{

444: \begin{tabular}

445: [c]{l}%

446: $\phi=\left(  \Gamma-\Gamma^{\prime}\Gamma^{\prime\prime-1}\Gamma^{\prime\ast

447: }\right)  ^{-1}\left[  \delta-\Gamma^{\prime}\Gamma^{\prime\prime-1}%

448: \delta^{\prime}\right]  ,$\\

449: $\psi=\left(  \Gamma^{\prime\prime}-\Gamma^{\prime\ast}\Gamma^{-1}%

450: \Gamma^{\prime}\right)  ^{-1}\left[  \delta^{\prime}-\Gamma^{\prime\ast}%

451: \Gamma^{-1}\delta\right]  .$%

452: \end{tabular}

453: \ \ \ \ \ \ \ \ \right.  \label{soluce}%

454: \end{equation}

455: Let us denote

456: \begin{align*}

457: S_{\phi}  &  =\Gamma-\Gamma^{\prime}\Gamma^{\prime\prime-1}\Gamma^{\prime\ast

458: },\\

459: S_{\psi}  &  =\Gamma^{\prime\prime}-\Gamma^{\prime\ast}\Gamma^{-1}%

460: \Gamma^{\prime}.

461: \end{align*}

462: The reader should note two crucial facts. On the one hand $\Gamma^{-1}$ and

463: $\Gamma^{\prime\prime-1}$ are unbouded operators but closed graphs argument

464: ensure that $\Gamma^{\prime}\Gamma^{\prime\prime-1}\delta^{\prime}$ and

465: $\Gamma^{\prime\ast}\Gamma^{-1}\delta$ exist in $W$ and $L$ respectively. On

466: the other hand $\delta-\Gamma^{\prime}\Gamma^{\prime\prime-1}\delta^{\prime}$

467: (resp. $\delta^{\prime}-\Gamma^{\prime\ast}\Gamma^{-1}\delta$) belong to the

468: domain of the unbounded operator $S_{\phi}^{-1}$ (resp. $S_{\psi}^{-1}$) which

469: also ensures the finiteness of both solutions given in the display above.

470:

471: Finding approximations to the solutions of (\ref{soluce}) is known in the

472: mathematical literature as "solving a linear inverse problem". The book by

473: Tikhonov and Arsenin (1977) -as many other references therein- is devoted to

474: this theory well-known in image reconstruction. The unboundedness of $S_{\phi

475: }^{-1}$ may cause large variation of $S_{\phi}^{-1}x$ even for small

476: variations of $x$. This lack of stability turns out to damage, as well as the

477: traditional "curse of dimensionality", the rates of convergence of our estimates.

478:

479: Unfortunately we cannot simply replace "theoretical" operators and vectors by

480: their empirical estimates because $\Gamma_{n}$ and $\Gamma_{n}^{\prime\prime}$

481: are not invertible. Indeed they are finite-rank operators (for example the

482: image of $\Gamma$ is \textrm{span}$\left(  X_{1},...,X_{n}\right)  $) hence

483: not even injective. We are classically going to add a small perturbation to

484: regularize $\Gamma_{n}$ and $\Gamma_{n}^{\prime\prime}$ (see Tikhonov and

485: Arsenin (1977)) and another one for $S_{\phi}^{-1}$ and make them invertible.

486: At last $\Gamma^{-1}$ is approximated by $\Gamma_{n}^{\dag}=\left(  \Gamma

487: _{n}+\alpha_{n}I\right)  ^{-1},$ $\Gamma^{\prime\prime-1}$ by $\Gamma

488: _{n}^{\prime\prime\dag}=\left(  \Gamma_{n}^{\prime\prime}+\alpha_{n}I\right)

489: ^{-1}$ and $S_{\phi}^{-1}$ by $\left(  S_{n,\phi}+\beta_{n}I\right)  ^{-1}$

490: where%

491: \[

492: S_{n,\phi}=\Gamma_{n}-\Gamma_{n}^{\prime}\left(  \Gamma_{n}^{\prime

493: \prime\dagger}\right)  \Gamma_{n}^{\prime\ast}.

494: \]

495: and $\alpha_{n}>0,$ $\beta_{n}>0.$ We also set :%

496: \begin{align}

497: S_{n,\psi}  &  =\Gamma_{n}^{\prime\prime}-\Gamma_{n}^{\prime\ast}\left(

498: \Gamma_{n}^{\dagger}\right)  \Gamma_{n}^{\prime},\label{Snpsi}\\

499: u_{n,\phi}  &  =\delta_{n}-\Gamma_{n}^{\prime}\left(  \Gamma_{n}^{\prime

500: \prime\dagger}\right)  \delta_{n}^{\prime},\label{Tnphi}\\

501: u_{n,\psi}  &  =\delta_{n}^{\prime}-\Gamma_{n}^{\prime\ast}\left(  \Gamma

502: _{n}^{\dagger}\right)  \delta_{n}. \label{Tnpsi}%

503: \end{align}

504:

505:

506: In the sequel we will assume that both strictly positive sequences $\alpha

507: _{n}$ and $\beta_{n}$ decay to zero in order to get the asymptotic convergence

508: of the estimates.

509:

510: \begin{definition}

511: The estimate of the couple $\left(  \phi,\psi\right)  $ is $\left(

512: \widehat{\phi}_{n},\widehat{\psi}_{n}\right)  $ based on (\ref{soluce}) and

513: defined by :%

514: \begin{equation}

515: \left\{

516: \begin{tabular}

517: [c]{l}%

518: $\widehat{\phi}_{n}=\left(  S_{n,\phi}+\beta_{n}I\right)  ^{-1}u_{n,\phi

519: },\smallskip$\\

520: $\widehat{\psi}_{n}=\left(  S_{n,\psi}+\beta_{n}I\right)  ^{-1}u_{n,\psi}.$%

521: \end{tabular}

522: \ \right.  \label{estim}%

523: \end{equation}

524: The predictor is defined as%

525: \[

526: \widehat{y}_{n+1}=\left\langle \widehat{\phi}_{n},X_{n+1}\right\rangle

527: _{W}+\left\langle \widehat{\psi}_{n},X_{n+1}^{\prime}\right\rangle _{L}.

528: \]

529:

530: \end{definition}

531:

532: \section{Main results and comments}

533:

534: In Mas, Pumo (2006) the authors obtained convergence in probability for their

535: estimates in a quite different model. We are now in position to assess deeper

536: results. Mean square prediction error is indeed given an asymptotic

537: development depending on both smoothing sequences $\alpha_{n}$ and $\beta_{n}$.

538:

539: Before stating the main result of this article, we give and comment the next

540: and last assumption :%

541: \begin{equation}

542: \mathbf{A4}:\left\{

543: \begin{tabular}

544: [c]{l}%

545: $\left\Vert \Gamma^{-1/2}\phi\right\Vert _{W}<+\infty$\\

546: $\left\Vert \left(  \Gamma^{\prime\prime}\right)  ^{-1/2}\psi\right\Vert

547: _{L}<+\infty$%

548: \end{tabular}

549: \ \ \ \ \ \right.  \label{A4}%

550: \end{equation}

551:

552:

553: For the definition of $\Gamma^{-1/2}$ and $\Gamma^{\prime\prime-1/2}$ we refer

554: to (\ref{racop}). Let us explain briefly what both conditions in (\ref{A4})

555: mean. To that aim we rewrite the first by developing $\Gamma^{-1/2}\phi$ in a

556: basis of eigenvectors of $\Gamma,$ say $u_{p}$%

557: \[

558: \Gamma^{-1/2}\phi=\sum_{p=1}^{+\infty}\dfrac{\left\langle \phi,u_{p}%

559: \right\rangle }{\sqrt{\lambda_{p}}}u_{p}%

560: \]

561: hence%

562: \[

563: \left\Vert \Gamma^{-1/2}\phi\right\Vert _{W}^{2}=\sum_{p=1}^{+\infty}%

564: \dfrac{\left\langle \phi,u_{p}\right\rangle ^{2}}{\lambda_{p}}%

565: \]

566: The first part of assumption $\mathbf{A4}$ tells us that "$\left\langle

567: \phi,u_{p}\right\rangle $ should tend to zero quickly enough with respect to

568: $\lambda_{p}$". In other words $\phi$ should belong to an ellipso\"{\i}d of

569: $W$ which may be more or less "flat" depending on the rate of decay of the

570: $\lambda_{p}$'s to zero. Assumption $\mathbf{A4}$ is in fact \textbf{a

571: regularity condition} on functions $\phi$ and $\psi$ : function $\phi$ (resp.

572: $\psi$) should be smoother than $X$ (resp. $X^{\prime}$).\bigskip

573:

574: We could try and state convergence results for $\widehat{\phi}_{n}$ and

575: $\widehat{\psi}_{n}$ separatedly but it turns out that :

576:

577: \begin{itemize}

578: \item The real statistical interest of the model relies on its predictive

579: power. The statistician is mainly interested in $\widehat{y}_{n+1},$ not in

580: $\widehat{\phi}_{n}$ and $\widehat{\psi}_{n}$ in a first attempt. The issue of

581: goodness of fit tests (involving $\phi$ and $\psi$ alone) is beyond the scope

582: of this article.

583:

584: \item Considering the mean square norm of $\left\langle \widehat{\phi}%

585: _{n},X_{n+1}\right\rangle _{W}$ (instead of $\widehat{\phi}_{n}$ or even of

586: $\left\langle \widehat{\phi}_{n},x\right\rangle _{W}$ for a nonrandom $x$) has

587: a smoothing effect on our estimates and partially counterbalance the side

588: effects of the underlying inverse problem as will be seen within the proofs

589: (especially along Lemma \ref{placebo}).

590: \end{itemize}

591:

592: Turning to $\widehat{y}_{n+1},$ the next question is : what should we compare

593: $\widehat{y}_{n+1}$ with~? The right answer is not $y_{n+1}.$ Obviously we

594: could, but it is also plain that, due to the random $\varepsilon_{n+1}$ the

595: best possible prediction for $y_{n+1}$ knowing $X_{n+1}$ (or even the "past"

596: i.e. $X_{1},...,X_{n}$) is the conditional expectation :%

597: \[

598: y_{n+1}^{\ast}=\mathbb{E}\left(  y_{n+1}|X_{1},...,X_{n+1}\right)

599: =\left\langle \phi,X_{n+1}\right\rangle _{W}+\left\langle \psi,X_{n+1}%

600: ^{\prime}\right\rangle .

601: \]

602: We are now ready to state the main theoretical result of this article.

603:

604: \begin{theorem}

605: \label{conv}When assumptions $\mathbf{A1-A4}$ hold the following expansion is

606: valid for the prediction mean square error :%

607: \[

608: \mathbb{E}\left(  \widehat{y}_{n+1}-y_{n+1}^{\ast}\right)  ^{2}=O\left(

609: \frac{\beta^{2}}{\alpha^{2}}\right)  +O\left(  \frac{1}{\alpha^{2}\beta^{2}%

610: n}\right)

611: \]

612:

613: \end{theorem}

614:

615: \begin{remark}

616: Replacing $y_{n+1}^{\ast}$ with $y_{n+1}$ is still possible. We may easily

617: prove that~:%

618: \[

619: \mathbb{E}\left(  \widehat{y}_{n+1}-y_{n+1}\right)  ^{2}=\mathbb{E}\left(

620: \widehat{y}_{n+1}-y_{n+1}^{\ast}\right)  ^{2}+\sigma_{\varepsilon}^{2}.

621: \]

622:

623: \end{remark}

624:

625: \begin{corollary}

626: From Theorem \ref{conv} above an optimal choice for $\beta$ is $\beta^{\ast

627: }\asymp n^{-1/4},$ then the convergence rate is :%

628: \[

629: \mathbb{E}\left(  \widehat{y}_{i}-y_{i}^{\ast}\right)  ^{2}=O\left(  \frac

630: {1}{\alpha^{2}n^{1/2}}\right)

631: \]

632: and may be quite close from $1/n^{1/2}$.

633: \end{corollary}

634:

635: The proof of the Corollary will be omitted. Studying the optimality of this

636: rate of convergence over the classes of functions defined by $\mathbf{A4}$ is

637: beyond the scope of this article but could deserve more attention.

638:

639: \begin{remark}

640: Originally the linear model (\ref{modele}) is subject to serious

641: multicolinearity troubles since $X_{n}^{\prime}=DX_{n}.$ Even if the curve

642: $X_{n}^{\prime}$ usually looks quite different from $X_{n},$ there is a total

643: stochastic dependence between them. The method used in this article to tackle

644: this problem (as well as the intrinsic "inverse problem" aspects related to

645: the inversion of the covariance operators $\Gamma$ and $\Gamma^{\prime\prime}%

646: $) is new up to the authors' knowledge. As it can be seen through above at

647: display (\ref{estim}) or in the proofs below, it relies on a double

648: penalization technique first by the index $\alpha_{n}$ then by $\beta_{n}$

649: linking both indexes in order to suppress the bias terms asymptotically .

650: \end{remark}

651:

652: \section{An application to spectrometric data}

653:

654: In this section we will present an application of the Functional Linear

655: Regression with Derivatives (FLRD) introduced in this paper to a spectroscopic

656: calibration problem. Quantitative NIR (near-infrared) spectroscopy is used to

657: analyze food and agricultural materials. The NIR spectrum of a sample is a

658: continuous curve giving the absorption, that is $\log_{10}1/R$ where $R$ is

659: the reflection of the sample, against wavelength measured in nanometers (nm).

660:

661: In the \texttt{cookie} example considered here the aim is to predict the

662: percentage of each ingredient $y$ given the NIR spectrum $x$ of the sample

663: (see Osborne et al. (1984) for a full description of the experiment). The

664: constituents under investigation are: fat, sucrose, dry flour, and water.

665: There were 39 samples in the calibration set, sample number 23 having been

666: excluded from the original 40 as an outlier, and a further validation set with

667: 31 samples, again after the exclusion of one outlier.

668:

669: An NIR reflectance spectrum is available for each dough. The original spectral

670: data consists of 700 points measured from 1100 to 1498 nm in steps of 2 nm.

671: Following Brown et al. (2001) we reduced the number of spectral points to 256

672: by considering only the spectral range 1380-2400 nm in step of 4 nm. Samples

673: of centered spectra are plotted in Figure \ref{Cookie_spectres}.\bigskip\

674: %TCIMACRO{\FRAME{ftbpFU}{4.9035in}{3.1099in}{0pt}{\Qcb{Centered original

675: %spectra of four samples (256 measures)}}{\Qlb{Cookie_spectres}}%

676: %{cookie_spectres.eps}{\special{ language "Scientific Word";  type "GRAPHIC";

677: %maintain-aspect-ratio TRUE;  display "USEDEF";  valid_file "F";

678: %width 4.9035in;  height 3.1099in;  depth 0pt;  original-width 7.6268in;

679: %original-height 4.8274in;  cropleft "0";  croptop "1";  cropright "1";

680: %cropbottom "0";  filename 'cookie_spectres.eps';file-properties "XNPEU";}}}%

681: %BeginExpansion

682: \begin{figure}

683: [ptb]

684: \begin{center}

685: \includegraphics[

686: height=3.1099in,

687: width=4.9035in

688: ]%

689: {cookie_spectres.eps}%

690: \caption{Centered original spectra of four samples (256 measures)}%

691: \label{Cookie_spectres}%

692: \end{center}

693: \end{figure}

694: %EndExpansion

695:

696:

697: %\begin{figure}[h]

698: %\begin{center}

699: %\includegraphics*[scale=0.5 ]{Figures/Cookie-spectres.eps}

700: %\end{center}

701: %\caption{The spectrometric curves.}

702: %\label{Cookie_spectres}

703: %\end{figure}

704:

705:

706: A classical tool employed in the chemiometric literature for the prediction of

707: $y$ knowing the associated NIR spectra $(x_{j},j=1,\ldots,256)$ is the linear

708: model:%

709: \begin{equation}

710: y=\sum_{j=1,256}\theta_{j}x_{j}+\epsilon\label{Mult.Lin.Mod}%

711: \end{equation}

712: The problem then is to use the calibration data to estimate the unknown

713: parameters $\theta_{j}$. Clearly in this application since $39\ll256$ the

714: ordinary least squares fails and many authors proposed to use alternative

715: methods to tackle the problem: principal component regression (PCR) or partial

716: least squares regression (PLS). We invite the reader to look at the paper of

717: Frank and Friedman (1993) for a statistical view of some chemiometrics

718: regression tools.

719:

720: Following an idea of Hastie and Mallows, in their discussion of Frank and

721: Friedman's paper, we consider a spectrum as a functional observation. The

722: functional Linear Regression (FLR) corresponding to the model

723: \ref{Mult.Lin.Mod} defined above is:%

724: \[

725: y=\int_{\delta}x(t)\theta(t)dt+\varepsilon

726: \]

727: where $y$ is a scalar random variable, $x$ a real function defined on

728: $\delta=[1100,2400]$\ and $\theta(t)$ the unknown parameter function. Brown et

729: al. (2001), Ferraty and Vieu (2003), Marx and Eilers (2002) or Amato et al.

730: (2006) used such a model for a prediction problem with spectrometric data.

731:

732: The model FLRD introduced in this paper can be written as:%

733: \[

734: y=\int_{\delta}x(t)\phi(t)dt+\int_{\delta}x^{\prime}(t)\psi(t)dt+\varepsilon

735: \]

736: where $\phi(t)$ and $\psi(t)$ are unknown functions (see display

737: (\ref{modele}) for an equivalent definition). In this paragraph we compare the

738: performance of PCR, PLS, FLR, FLRD, Spline Smoothing model proposed by Cardot,

739: Ferraty and Sarda (2006) and Bayes wavelet predictions proposed by Brown et

740: al. (2001).

741:

742: We used the calibration data set for the estimation of parameter functions

743: $\phi(t)$ and $\psi(t)$ and validation data for calculation of the MSEP (Mean

744: Squared Error of Predictions):%

745: \[

746: MSEP=\frac{1}{31}\sum_{j=1}^{31}(y_{j}-\hat{y}_{j})^{2}%

747: \]

748: where $\hat{y}_{j}$\ is the prediction of $y_{j}$ obtained by the model with

749: estimated parameters. The choice of the parameters $\alpha$ and $\beta$ is

750: crucial for the prediction model. We used a cross-validation approach based on

751: the evaluation of the standard error of prediction $CVMSEP$:

752: \[

753: CVMSEP(\alpha,\beta)=\frac{1}{39}\sum_{i=1}^{39}[\frac{1}{38}\sum

754: \nolimits_{j=1}^{38}(y_{i}^{c}-\hat{y}_{j}^{c}(i;\alpha,\beta))^{2}],

755: \]

756: where $\hat{y}_{j}^{c}(i;\alpha,\beta)$ denotes the prediction of $y_{j}^{c}%

757: $\ in the calibration set without sample $i$. Results for different methods of

758: prediction of four ingredients are displayed in Table \ref{Table.MSEP}. We

759: used B-spline basis ($k=100$) for obtaining predictions with Spline Smoothing,

760: Spline Ridge RLF and Spline RLFD methods. For each of those methods we give

761: the values of the smoothing or penalty parameters based on an analogous

762: cross-validation approach.

763:

764: \begin{table}[h]

765: \begin{center}%

766: \begin{tabular}

767: [c]{|l|cccc|}\hline

768: & \multicolumn{4}{|c|}{MSE Validation}\\

769: Method and parameters & Fat & Sugar & Flour & Water\\\hline

770: PLS & 0.151 & 0.583 & 0.375 & 0.105\\

771: PCR & 0.160 & 0.614 & 0.388 & 0.106\\

772: Spline Smoothing ($k_{n}=8$) & 0.546 & 0.471 & 2.226 & 0.183\\

773: Spline Ridge FLR ($\beta=0.00002$) & 0.044 & 0.494 & 0.318 & 0.087\\

774: Spline FLRD ($\alpha=0.07,\beta=0.15$) & 0.092 & 0.450 & 0.332 & 0.069\\

775: Bayes Wavelet & 0.063 & 0.449 & 0.348 & 0.050\\\hline

776: \end{tabular}

777: \end{center}

778: \caption{$MSEP$ criterion for all models (see Brown et al. for results of PLS,

779: PCR and Bayes wavelet methods).}%

780: \label{Table.MSEP}%

781: \end{table}

782:

783: We note that functional approaches work better then PLS or PCR methods for the

784: four predicted variables with respect to $MSEP$ criterion. Our simulation, as

785: noted also by Marx and Eilers (2002), show that functional methods lead to

786: more stable prediction. The Spline FLRD method produces in general equivalent

787: results in terms of predictions with the best methods presented in table

788: \ref{Table.MSEP}.

789:

790: \bigskip

791:

792: \section{Proofs}

793:

794: In the sequel $M$ and $M^{\prime}$ will stand for constants.

795:

796: Let $S$ and $T$ be two selfadjoint linear operators on a Hilbert space $H,$ we

797: denote $T\ll S$ whenever for all $x$ in $H,$ $\left\langle Tx,x\right\rangle

798: \leq\left\langle Sx,x\right\rangle $ then $\left\Vert T\right\Vert _{\infty

799: }\leq\left\Vert S\right\Vert _{\infty}$.

800:

801: The norm in the space $L^{2}\left(  B\right)  $ where $\left(  B,\left\Vert

802: \cdot\right\Vert _{B}\right)  $ is a Banach space is defined the following way

803: : let $X$ be a random element in the Banach space $B,$ then

804: \[

805: \left\Vert X\right\Vert _{L^{2}\left(  B\right)  }=\left(  E\left\Vert

806: X\right\Vert _{B}^{2}\right)  ^{1/2}%

807: \]

808: When the notation is not ambiguous we systematically drop the index $B$ i.e :

809: $\left\Vert X\right\Vert _{L^{2}}=\left(  E\left\Vert X\right\Vert _{B}%

810: ^{2}\right)  ^{1/2}$.

811:

812: \subsection{Preliminary facts :}

813:

814: In order to gain some clarity in the proofs and to alleviate them we first

815: list a few results stemming from operator or probabillity theory.

816:

817: \textbf{Fact 1}: If $T$ is a positive operator (either random or not),

818: $T+\gamma I$ is invertible for all $\gamma>0$ with bounded inverse and

819: $\left\Vert \left(  T+\gamma I\right)  ^{-1}\right\Vert _{\infty}\leq

820: \gamma^{-1}$. Hence%

821: \begin{equation}

822: \left\Vert \Gamma_{n}^{\dag}\right\Vert _{\infty}=\left\Vert \Gamma^{\dag

823: }\right\Vert _{\infty}=\left\Vert \Gamma_{n}^{\prime\prime\dag}\right\Vert

824: _{\infty}=\left\Vert \Gamma^{\prime\prime\dag}\right\Vert _{\infty}%

825: =\alpha^{-1} \label{norm.res}%

826: \end{equation}

827:

828:

829: \textbf{Fact 2}: As a consquence of assumption $\mathbf{A1}$ and of the strong

830: law of large numbers for Hilbert valued random elements (see Ledoux, Talagrand

831: (1991) Chapter 7),%

832: \[

833: T_{n}\underset{n\rightarrow+\infty}{\rightarrow}T\quad a.s.

834: \]

835: whenever $T_{n}=\Gamma_{n},\Gamma_{n}^{\prime},\Gamma_{n}^{\prime\ast}%

836: ,\Gamma_{n}^{\prime\prime}$ (resp. $T=\Gamma,\Gamma^{\prime},\Gamma

837: ^{\prime\ast},\Gamma^{\prime\prime}$) since all theses random operators may be

838: rewritten as sums of i.i.d. random variables. These sequences of random

839: operators are almost surely bounded%

840: \begin{equation}

841: \sup_{n}\left\Vert T_{n}\right\Vert _{\infty}\leq M\quad a.s.

842: \label{norm.delta}%

843: \end{equation}

844: which also means that

845: \begin{equation}

846: \max\left(  \sup_{n}\left\Vert \delta_{n}\right\Vert _{W},\sup_{n}\left\Vert

847: \delta_{n}^{\prime}\right\Vert _{L}\right)  \leq M^{\prime} \label{asd}%

848: \end{equation}

849: since (for instance) $\delta_{n}=\Gamma_{n}\phi+\Gamma_{n}^{\prime}\psi+e_{n}$

850: where $e_{n}$ is again a sum of i.i.d random elements :%

851: \[

852: e_{n}=\frac{1}{n}\sum_{k=1}^{n}X_{k}\varepsilon_{k}%

853: \]

854: We also set%

855: \[

856: e_{n}^{\prime}=\frac{1}{n}\sum_{k=1}^{n}X_{k}^{\prime}\varepsilon_{k}%

857: \]

858: (see below for details).

859:

860: \textbf{Fact 3}: The Central Limit Thorem in Hilbert spaces (or standards

861: results on rates of convergence for Hilbert valued random elements in square

862: norm) provide a rate in the $L^{2}$ convergence of several random variables of

863: interest in the proofs. See for instance Ledoux, Talagrand (1991) or Bosq

864: (2000) . Whenever $T_{n}=\Gamma_{n},\Gamma_{n}^{\prime},\Gamma_{n}^{\prime

865: \ast},\Gamma_{n}^{\prime\prime}$ (resp. $T=\Gamma,\Gamma^{\prime}%

866: ,\Gamma^{\prime\ast},\Gamma^{\prime\prime}$) we have $\mathbb{E}\left\Vert

867: T_{n}-T\right\Vert _{\mathcal{HS}}^{2}=O\left(  \frac{1}{n}\right)  $ hence

868: \begin{equation}

869: \left\Vert T_{n}-T\right\Vert _{L^{2}\left(  \mathcal{HS}\right)  }=O\left(

870: \dfrac{1}{\sqrt{n}}\right)  \label{L2}%

871: \end{equation}

872: since all theses random operators may be rewritten as sums of i.i.d. random variables.

873:

874: We begin with proving Proposition \ref{Ident}.

875:

876: \textbf{Proof of Proposition \ref{Ident} :}

877:

878: The method of the proof may be adapted from the model studied in Mas, Pumo

879: (2006). The couple $\left(  \phi,\psi\right)  $ will be identified whenever,

880: for any other couple $\left(  \phi_{a},\psi_{a}\right)  $, if

881: \[

882: \left\{

883: \begin{tabular}

884: [c]{l}%

885: $\delta=\Gamma\phi+\Gamma^{\prime}\psi=\Gamma\phi_{a}+\Gamma^{\prime}\psi

886: _{a},$\\

887: $\delta^{\prime}=\Gamma^{\prime\ast}\phi+\Gamma^{\prime\prime}\psi

888: =\Gamma^{\prime\ast}\phi_{a}+\Gamma^{\prime\prime}\psi_{a}.$%

889: \end{tabular}

890: \right.

891: \]

892: $\left(  \phi_{a},\psi_{a}\right)  =\left(  \phi,\psi\right)  $. This will be

893: true if%

894: \[

895: \left\{

896: \begin{tabular}

897: [c]{l}%

898: $\Gamma\left(  \phi-\phi_{a}\right)  +\Gamma^{\prime}\left(  \psi-\psi

899: _{a}\right)  =0,$\\

900: $\Gamma^{\prime\ast}\left(  \phi-\phi_{a}\right)  +\Gamma^{\prime\prime

901: }\left(  \psi-\psi_{a}\right)  =0.$%

902: \end{tabular}

903: \right.

904: \]

905:

906:

907: This means that the couple $\left(  \phi-\phi_{a},\psi-\psi_{a}\right)  $

908: belongs to the kernel of the linear operator defined blockwise on $W\times L$

909: by :%

910: \[

911: \left(

912: \begin{array}

913: [c]{cc}%

914: \Gamma & \Gamma^{\prime}\\

915: \Gamma^{\prime\ast} & \Gamma^{\prime\prime}%

916: \end{array}

917: \right)  .

918: \]

919: As $\Gamma^{\prime\ast}=D\Gamma$ and $\Gamma^{\prime\prime}=D\Gamma^{\prime}$,

920: the Proposition will be proved if the blockwise operator defined on $W\times

921: L$ and with values in $W$ :%

922: \[

923: \left(

924: \begin{array}

925: [c]{cc}%

926: \Gamma & \Gamma^{\prime}%

927: \end{array}

928: \right)  =\left(

929: \begin{array}

930: [c]{cc}%

931: \Gamma & \Gamma D^{\ast}%

932: \end{array}

933: \right)

934: \]

935: is one to one. It is plain that the kernel of this operator is precisely the

936: space $\mathcal{N}$ that appears at display (\ref{fluke}).

937:

938: This finishes the proof of the Proposition.

939:

940: The next two general Propositions are proved for further purpose.

941:

942: \begin{proposition}

943: \label{lodge}%

944: \begin{align*}

945: \sup_{n}\left\Vert \left(  \Gamma_{n}^{\prime\prime\dag}\right)  ^{1/2}%

946: \Gamma_{n}^{\prime\ast}\right\Vert _{\infty}  &  <M\quad a.s.,\\

947: \sup_{n}\left\Vert \Gamma_{n}^{\prime}\left(  \Gamma_{n}^{\prime\prime\dag

948: }\right)  ^{1/2}\right\Vert _{\infty}  &  <M\quad a.s.,\\

949: \sup_{n}\left\Vert \left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}%

950: \Gamma^{\prime\ast}\right\Vert _{\infty}  &  <M,\\

951: \sup_{n}\left\Vert \Gamma^{\prime}\left(  \Gamma^{\prime\prime\dag}\right)

952: ^{1/2}\right\Vert _{\infty}  &  <M.

953: \end{align*}

954:

955: \end{proposition}

956:

957: \begin{proof}

958: We prove only the first bound since the method may be copied for the other

959: ones. Set $R_{n}=D\Gamma_{n}^{1/2}$ then :%

960: \begin{align*}

961: \Gamma_{n}^{\prime\prime}  &  =R_{n}R_{n}^{\ast},\\

962: \left(  \Gamma_{n}^{\prime\prime\dag}\right)  ^{1/2}\Gamma_{n}^{\prime\ast}

963: &  =\left(  R_{n}R_{n}^{\ast}+\alpha I\right)  ^{-1/2}R_{n}\Gamma_{n}^{1/2}.

964: \end{align*}

965: At last,%

966: \[

967: \left\Vert \left(  \Gamma_{n}^{\prime\prime\dag}\right)  ^{1/2}\Gamma

968: _{n}^{\prime\ast}\right\Vert _{\infty}\leq\left\Vert \left(  R_{n}R_{n}^{\ast

969: }+\alpha I\right)  ^{-1/2}R_{n}\right\Vert _{\infty}\left\Vert \Gamma

970: _{n}^{1/2}\right\Vert _{\infty}.

971: \]

972: It is plain that%

973: \[

974: \sup_{n}\left\Vert \Gamma_{n}^{1/2}\right\Vert _{\infty}\leq M\quad a.s.

975: \]

976: If the Schmidt decomposition of $R_{n}$ is :%

977: \[

978: R_{n}=\sum_{k\in\mathbb{N}}\mu_{k,n}\left(  u_{k,n}\otimes v_{k,n}\right)  ,

979: \]

980: $(u_{k,n}\in W,\ v_{k,n}\in L)$ it is simple algebra to get :%

981: \begin{equation}

982: \left(  R_{n}R_{n}^{\ast}+\alpha I\right)  ^{-1/2}R_{n}=\sum_{k\in\mathbb{N}%

983: }\frac{\mu_{k,n}}{\sqrt{\mu_{k,n}^{2}+\alpha}}\left(  u_{k,n}\otimes

984: v_{k,n}\right)  \label{mano}%

985: \end{equation}

986: which yields $\left\Vert \left(  R_{n}R_{n}^{\ast}+\alpha I\right)

987: ^{-1/2}R_{n}\right\Vert _{\infty}=\sup_{k}\left\{  \frac{\mu_{k,n}}{\sqrt

988: {\mu_{k,n}^{2}+\alpha}}\right\}  \leq1.$

989: \end{proof}

990:

991: \begin{proposition}

992: \label{KT}%

993: \[

994: \left\Vert \left(  S_{n}+\beta I\right)  ^{-1}\right\Vert _{\infty}\leq

995: \dfrac{1}{\beta}.

996: \]

997:

998: \end{proposition}

999:

1000: \begin{proof}

1001: The proof of this Lemma is similar to Lemma 7.4 in Mas, Pumo (2006). It was

1002: then proved for $S$ instead of $S_{n}$ and all operators should be changed to

1003: their empirical counterparts (e.g : $\Gamma_{n}$ insted of $\Gamma$). We give

1004: a sketch of it. The proof relies on the Schmidt decomposition of $S_{n}.$ One

1005: would get%

1006: \[

1007: S_{n}=\Gamma_{n}^{1/2}\Lambda_{n}\left(  \alpha\right)  \Gamma_{n}^{1/2}%

1008: \]

1009: where $\Lambda_{n}\left(  \alpha\right)  $ and $\Gamma_{n}^{1/2}$ are

1010: symmetric positive operators, which implies that $S_{n}$ itself is positive.

1011: It suffices then to apply \textbf{Fact 2 }(see the "Preliminary facts"

1012: subsection) to get the desired result.

1013: \end{proof}

1014:

1015: \subsection{Outline of the proof of Theorem \ref{conv} :}

1016:

1017: The following bound is valid :%

1018: \begin{align*}

1019: &  \left[  \widehat{y}_{n+1}-\left(  \left\langle \phi,X_{n+1}\right\rangle

1020: _{W}+\left\langle \psi,X_{n+1}^{\prime}\right\rangle _{L}\right)  \right]

1021: ^{2}\\

1022: &  =\left(  \left\langle \phi-\widehat{\phi},X_{n+1}\right\rangle

1023: _{W}+\left\langle \psi-\widehat{\psi},X_{n+1}^{\prime}\right\rangle

1024: _{L}\right)  ^{2}\\

1025: &  \leq2\left[  \left\langle \phi-\widehat{\phi},X_{n+1}\right\rangle _{W}%

1026: ^{2}+\left\langle \psi-\widehat{\psi},X_{n+1}^{\prime}\right\rangle _{L}%

1027: ^{2}\right]  .

1028: \end{align*}

1029: Then

1030: \begin{align*}

1031: \mathbb{E}\left\langle \phi-\widehat{\phi},X_{n+1}\right\rangle _{W}^{2}  &

1032: =\mathbb{E}\left[  \mathbb{E}\left\langle \phi-\widehat{\phi},X_{n+1}%

1033: \right\rangle _{W}^{2}|X_{1},...,X_{n}\right] \\

1034: &  =\mathbb{E}\left[  \mathbb{E}\left\langle \phi-\widehat{\phi}%

1035: ,X_{n+1}\right\rangle _{W}^{2}|\widehat{\phi}\right] \\

1036: &  =\mathbb{E}\left[  \left\Vert \Gamma^{1/2}\left(  \phi-\widehat{\phi

1037: }\right)  \right\Vert _{W}^{2}\right]

1038: \end{align*}

1039: Similarly,%

1040: \[

1041: \mathbb{E}\left\langle \psi-\widehat{\psi},X_{n+1}^{\prime}\right\rangle

1042: _{L}^{2}=\mathbb{E}\left[  \left\Vert \Gamma^{\prime\prime1/2}\left(

1043: \psi-\widehat{\psi}\right)  \right\Vert _{L}^{2}\right]

1044: \]

1045: Both preceding equations feature similar expressions. We focus on the term

1046: involving $\phi$ ; we will prove that :%

1047: \[

1048: \mathbb{E}\left[  \left\Vert \Gamma^{1/2}\left(  \phi-\widehat{\phi}\right)

1049: \right\Vert _{W}^{2}\right]  =O\left(  \frac{\beta^{2}}{\alpha^{2}}\right)

1050: +O\left(  \frac{1}{\alpha^{2}\beta^{2}n}\right)  .

1051: \]

1052: Within the proof the reader will easily be convinced that the method would

1053: lead to an analogous result for the term with $\psi.$ From now in order to

1054: alleviate notations we drop the index $\phi$ in $S_{n,\phi}$ and $u_{n,\phi}$.

1055: The sequences $\left(  \alpha_{n}\right)  _{n\in\mathbb{N}}$ and $\left(

1056: \beta_{n}\right)  _{n\in\mathbb{N}}$ will be denoted $\alpha$ and $\beta$

1057: respectively and for short.

1058:

1059: We start from%

1060: \begin{align*}

1061: \widehat{\phi}_{n}  &  =\left(  \Gamma_{n}-\Gamma_{n}^{\prime}\Gamma

1062: _{n}^{\prime\prime\dag}\Gamma_{n}^{\prime\ast}+\beta I\right)  ^{-1}\left(

1063: \delta_{n}-\Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}\delta_{n}^{\prime

1064: }\right) \\

1065: &  =\left(  S_{n}+\beta I\right)  ^{-1}u_{n}\\

1066: \phi &  =\left(  \Gamma-\Gamma^{\prime}\Gamma^{\prime\prime\dag}\Gamma

1067: ^{\prime\ast}\right)  ^{-1}\left(  \delta-\Gamma^{\prime}\Gamma^{\prime

1068: \prime\dag}\delta^{\prime}\right) \\

1069: &  =S^{-1}u

1070: \end{align*}

1071:

1072:

1073: where we recall that :%

1074: \begin{align*}

1075: u  &  =\delta-\Gamma^{\prime}\Gamma^{\prime\prime\dag}\delta^{\prime},\\

1076: u_{n}  &  =\delta_{n}-\Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}%

1077: \delta_{n}^{\prime},\\

1078: S  &  =\Gamma-\Gamma^{\prime}\Gamma^{\prime\prime\dag}\Gamma^{\prime\ast},\\

1079: S_{n}  &  =\Gamma_{n}-\Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}%

1080: \Gamma_{n}^{\prime\ast}.

1081: \end{align*}

1082:

1083:

1084: The proof relies on the following decomposition :%

1085: \begin{align}

1086: \widehat{\phi}_{n}-\phi &  =\left(  S_{n}+\beta I\right)  ^{-1}\left(

1087: u_{n}-u\right)  +\left(  \left(  S_{n}+\beta I\right)  ^{-1}-S^{-1}\right)

1088: u\nonumber\\

1089: &  =\left(  S_{n}+\beta I\right)  ^{-1}\left(  u_{n}-u\right)  +\left(

1090: S_{n}+\beta I\right)  ^{-1}\left(  S-S_{n}-\beta I\right)  S^{-1}u\nonumber\\

1091: &  =A_{n}+B_{n}+C_{n} \label{decomp}%

1092: \end{align}

1093: where%

1094: \begin{align}

1095: A_{n}  &  =\left(  S_{n}+\beta I\right)  ^{-1}\left(  u_{n}-u\right)

1096: \label{T1}\\

1097: B_{n}  &  =\left(  S_{n}+\beta I\right)  ^{-1}\left(  S-S_{n}\right)

1098: \phi\label{T2}\\

1099: C_{n}  &  =\beta\left(  S_{n}+\beta I\right)  ^{-1}\phi\label{T3}%

1100: \end{align}

1101: Along the forthcoming Lemmas we determine rates of convergence for these three

1102: terms. We will prove that the rate of decrease to zero in $L^{2}$ norm is

1103: $\left(  \alpha\beta\sqrt{n}\right)  ^{-1}$ for $A_{n}$ and $B_{n}.$ The rest

1104: of the proof of the main Theorem is postponed to the end of the next and last subsection.

1105:

1106: \subsection{Proof of the main Theorem}

1107:

1108: The first Lemma gives a reta of convergence for $S_{n}-S$.

1109:

1110: \begin{lemma}

1111: \label{cran}The following holds :%

1112: \[

1113: S_{n}-S=\Gamma_{n}-\Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma

1114: _{n}^{\prime\ast}-\Gamma+\Gamma^{\prime}\Gamma^{\prime\prime\dag}%

1115: \Gamma^{\prime\ast}=O_{L^{2}}\left(  \dfrac{1}{\alpha\sqrt{n}}\right)

1116: \]

1117:

1118: \end{lemma}

1119:

1120: \begin{proof}

1121: First of all by (\ref{L2}) :%

1122: \[

1123: \left\Vert \Gamma_{n}-\Gamma\right\Vert _{L^{2}\left(  \mathcal{HS}\right)

1124: }=O\left(  \dfrac{1}{\sqrt{n}}\right)

1125: \]

1126: We focus on%

1127: \begin{align*}

1128: &  \Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma_{n}^{\prime\ast

1129: }-\Gamma^{\prime}\Gamma^{\prime\prime\dag}\Gamma^{\prime\ast}\\

1130: &  =\Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma_{n}^{\prime\ast

1131: }-\Gamma^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma_{n}^{\prime\ast}%

1132: +\Gamma^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma_{n}^{\prime\ast}%

1133: -\Gamma^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma^{\prime\ast}\\

1134: &  +\Gamma^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma^{\prime\ast}%

1135: -\Gamma^{\prime}\Gamma^{\prime\prime\dag}\Gamma^{\prime\ast}.

1136: \end{align*}

1137: Then dealing with each of these three terms separatedly we get%

1138: \begin{align*}

1139: \left\Vert \Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma_{n}%

1140: ^{\prime\ast}-\Gamma^{\prime}\Gamma_{n}^{\prime\prime\dag}\Gamma_{n}%

1141: ^{\prime\ast}\right\Vert _{\infty}  &  \leq\left\Vert \Gamma_{n}^{\prime

1142: }-\Gamma^{\prime}\right\Vert _{\infty}\left\Vert \Gamma_{n}^{\prime\prime\dag

1143: }\Gamma_{n}^{\prime\ast}\right\Vert _{\infty}\\

1144: &  \leq\left\Vert \Gamma_{n}^{\prime}-\Gamma^{\prime}\right\Vert _{\infty

1145: }\left\Vert \left(  \Gamma_{n}^{\prime\prime\dag}\right)  \right\Vert

1146: _{\infty}\left\Vert \Gamma_{n}^{\prime\ast}\right\Vert _{\infty}\\

1147: &  \leq C\dfrac{\left\Vert \Gamma_{n}^{\prime}-\Gamma^{\prime}\right\Vert

1148: _{\infty}}{\alpha}\quad a.s.

1149: \end{align*}

1150: The last bound was derived from (\ref{norm.delta}) and (\ref{norm.res}).%

1151: \begin{align*}

1152: &  \left\Vert \Gamma^{\prime}\Gamma_{n}^{\prime\prime\dag}\left(  \Gamma

1153: _{n}^{\prime\ast}-\Gamma^{\prime\ast}\right)  \right\Vert _{\infty}\\

1154: &  \leq C\dfrac{\left\Vert \Gamma_{n}^{\prime\ast}-\Gamma^{\prime\ast

1155: }\right\Vert _{\infty}}{\alpha}\quad a.s.

1156: \end{align*}

1157: At last,%

1158: \begin{align*}

1159: \Gamma^{\prime}\left(  \Gamma_{n}^{\prime\prime\dag}-\Gamma^{\prime\prime\dag

1160: }\right)  \Gamma^{\prime\ast}  &  =\Gamma^{\prime}\Gamma_{n}^{\prime\prime

1161: \dag}\left(  \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)

1162: \Gamma^{\prime\prime\dag}\Gamma^{\prime\ast}\\

1163: &  =\Gamma^{\prime}\Gamma_{n}^{\prime\prime\dag}\left(  \Gamma^{\prime\prime

1164: }-\Gamma_{n}^{\prime\prime}\right)  \Gamma^{\prime\prime\dag}\Gamma

1165: ^{\prime\ast}\\

1166: &  =\left(  \Gamma^{\prime}-\Gamma_{n}^{\prime}\right)  \Gamma_{n}%

1167: ^{\prime\prime\dag}\left(  \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime

1168: }\right)  \Gamma^{\prime\prime\dag}\Gamma^{\prime\ast}\\

1169: &  +\Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}\left(  \Gamma

1170: ^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)  \Gamma^{\prime\prime\dag

1171: }\Gamma^{\prime\ast}%

1172: \end{align*}

1173: Then,%

1174: \begin{align*}

1175: &  \left\Vert \Gamma^{\prime}\left(  \Gamma_{n}^{\prime\prime\dag}%

1176: -\Gamma^{\prime\prime\dag}\right)  \Gamma^{\prime\ast}\right\Vert _{\infty}\\

1177: &  \leq\left\Vert \left(  \Gamma^{\prime}-\Gamma_{n}^{\prime}\right)

1178: \Gamma_{n}^{\prime\prime\dag}\left(  \Gamma^{\prime\prime}-\Gamma_{n}%

1179: ^{\prime\prime}\right)  \Gamma^{\prime\prime\dag}\Gamma^{\prime\ast

1180: }\right\Vert _{\infty}\\

1181: &  +\left\Vert \Gamma_{n}^{\prime}\left(  \Gamma_{n}^{\prime\prime\dag

1182: }\right)  ^{1/2}\right\Vert _{\infty}\left\Vert \left(  \Gamma_{n}%

1183: ^{\prime\prime\dag}\right)  ^{1/2}\left(  \Gamma^{\prime\prime}-\Gamma

1184: _{n}^{\prime\prime}\right)  \left(  \Gamma^{\prime\prime\dag}\right)

1185: ^{1/2}\right\Vert _{\infty}\left\Vert \left(  \Gamma^{\prime\prime\dag

1186: }\right)  ^{1/2}\Gamma^{\prime\ast}\right\Vert _{\infty}%

1187: \end{align*}

1188: By Proposition \ref{lodge} the second term may be bounded by%

1189: \[

1190: C\left\Vert \left(  \Gamma_{n}^{\prime\prime\dag}\right)  ^{1/2}\left(

1191: \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)  \left(  \Gamma

1192: ^{\prime\prime\dag}\right)  ^{1/2}\right\Vert _{\infty}=O_{L^{2}}\left(

1193: \dfrac{1}{\alpha\sqrt{n}}\right)

1194: \]

1195: since

1196: \[

1197: \left\Vert \Gamma_{n}^{\prime\prime\dag1/2}\right\Vert _{\infty}=\left\Vert

1198: \Gamma^{\prime\prime\dag1/2}\right\Vert _{\infty}=\alpha^{-1/2}.

1199: \]

1200: Cauchy-Schwartz inequality yields for the first :%

1201: \begin{align*}

1202: &  \mathbb{E}\left\Vert \left(  \Gamma^{\prime}-\Gamma_{n}^{\prime}\right)

1203: \Gamma_{n}^{\prime\prime\dag}\left(  \Gamma^{\prime\prime}-\Gamma_{n}%

1204: ^{\prime\prime}\right)  \Gamma^{\prime\prime\dag}\Gamma^{\prime\ast

1205: }\right\Vert _{\infty}^{2}\\

1206: &  \leq M\left(  \mathbb{E}\left\Vert \left(  \Gamma^{\prime}-\Gamma

1207: _{n}^{\prime}\right)  \Gamma_{n}^{\prime\prime\dag}\right\Vert _{\infty}%

1208: ^{4}\mathbb{E}\left\Vert \left(  \Gamma^{\prime\prime}-\Gamma_{n}%

1209: ^{\prime\prime}\right)  \Gamma^{\prime\prime\dag}\right\Vert _{\infty}%

1210: ^{4}\right)  ^{1/2}\\

1211: &  \leq M\dfrac{1}{n^{2}\alpha^{4}},

1212: \end{align*}

1213: hence%

1214: \[

1215: \left\Vert \left(  \Gamma^{\prime}-\Gamma_{n}^{\prime}\right)  \Gamma

1216: _{n}^{\prime\prime\dag}\left(  \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime

1217: }\right)  \Gamma^{\prime\prime\dag}\Gamma^{\prime\ast}\right\Vert =O_{L^{2}%

1218: }\left(  \dfrac{1}{\alpha^{2}n}\right)  .

1219: \]

1220: The proof of Lemma \ref{cran} is finished.

1221: \end{proof}

1222:

1223: \begin{lemma}

1224: \label{u}We have :%

1225: \[

1226: u_{n}-u=O_{L^{2}}\left(  \dfrac{1}{\alpha\sqrt{n}}\right)  .

1227: \]

1228:

1229: \end{lemma}

1230:

1231: \begin{proof}

1232: We start with :%

1233: \[

1234: u_{n}-u=\delta_{n}-\delta+\Gamma^{\prime}\Gamma^{\prime\prime\dag}%

1235: \delta^{\prime}-\Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}\delta

1236: _{n}^{\prime}.

1237: \]

1238: Clearly $\delta_{n}-\delta=O_{L^{2}}\left(  \dfrac{1}{\sqrt{n}}\right)  $ and

1239: we study the second term%

1240: \begin{align*}

1241: \Gamma_{n}^{\prime}\Gamma_{n}^{\prime\prime\dag}\delta_{n}^{\prime}%

1242: -\Gamma^{\prime}\Gamma^{\prime\prime\dag}\delta^{\prime}  &  =\left(

1243: \Gamma_{n}^{\prime}-\Gamma^{\prime}\right)  \Gamma_{n}^{\prime\prime\dag

1244: }\delta_{n}^{\prime}+\Gamma^{\prime}\left(  \Gamma_{n}^{\prime\prime\dag

1245: }-\Gamma^{\prime\prime\dag}\right)  \delta_{n}^{\prime}\\

1246: &  +\Gamma^{\prime}\Gamma^{\prime\prime\dag}\left(  \delta_{n}^{\prime}%

1247: -\delta^{\prime}\right)  .

1248: \end{align*}

1249: Since $\delta_{n}^{\prime}$ is almost surely bounded (see (\ref{asd})),

1250: $\Gamma_{n}^{\prime}-\Gamma=O_{L^{2}}\left(  \dfrac{1}{\sqrt{n}}\right)  ,$

1251: $\delta_{n}^{\prime}-\delta^{\prime}=O_{L^{2}}\left(  \dfrac{1}{\sqrt{n}%

1252: }\right)  $ and $\left\Vert \Gamma^{\prime\prime\dag}\right\Vert _{\infty

1253: }=\left\Vert \Gamma_{n}^{\prime\prime\dag}\right\Vert _{\infty}=\alpha^{-1}$

1254: we get :%

1255: \begin{align*}

1256: \left\Vert \left(  \Gamma_{n}^{\prime}-\Gamma^{\prime}\right)  \Gamma

1257: _{n}^{\prime\prime\dag}\delta_{n}^{\prime}\right\Vert _{W}  &  =O_{L^{2}%

1258: }\left(  \dfrac{1}{\alpha\sqrt{n}}\right)  ,\\

1259: \left\Vert \Gamma^{\prime}\Gamma^{\prime\prime\dag}\left(  \delta_{n}^{\prime

1260: }-\delta^{\prime}\right)  \right\Vert _{W}  &  =O_{L^{2}}\left(  \dfrac

1261: {1}{\alpha\sqrt{n}}\right)  .

1262: \end{align*}

1263: The remaining term is%

1264: \begin{align*}

1265: \Gamma^{\prime}\left(  \Gamma_{n}^{\prime\prime\dag}-\Gamma^{\prime\prime\dag

1266: }\right)  \delta_{n}^{\prime}  &  =\Gamma^{\prime}\Gamma^{\prime\prime\dag

1267: }\left(  \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)  \Gamma

1268: _{n}^{\prime\prime\dag}\delta_{n}^{\prime}\\

1269: &  =\Gamma^{\prime}\Gamma^{\prime\prime\dag}\left(  \Gamma^{\prime\prime

1270: }-\Gamma_{n}^{\prime\prime}\right)  \Gamma_{n}^{\prime\prime\dag}\left(

1271: \Gamma_{n}^{\prime\ast}\phi+\Gamma_{n}^{\prime\prime}\psi+u_{n}^{\prime

1272: }\right)  ,\\

1273: &  =\Gamma^{\prime}\left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}\left(

1274: m_{1}+m_{2}+m_{3}\right)

1275: \end{align*}

1276: where

1277: \begin{align*}

1278: m_{1}  &  =\left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}\left(

1279: \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)  \left(  \Gamma

1280: _{n}^{\prime\prime\dag}\right)  ^{1/2}\left(  \Gamma_{n}^{\prime\prime\dag

1281: }\right)  ^{1/2}\Gamma_{n}^{\prime\ast}\phi,\\

1282: m_{2}  &  =\left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}\left(

1283: \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)  \Gamma_{n}%

1284: ^{\prime\prime\dag}\Gamma_{n}^{\prime\prime}\psi,\\

1285: m_{3}  &  =\left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}\left(

1286: \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)  \Gamma_{n}%

1287: ^{\prime\prime\dag}e_{n}^{\prime}.

1288: \end{align*}

1289: First we drop $\Gamma^{\prime}\left(  \Gamma^{\prime\prime\dag}\right)

1290: ^{1/2}$ since the norm of this operator may be bounded by a constant

1291: independent from $\alpha$ (see Proposition \ref{lodge}). We turn to :%

1292: \begin{align*}

1293: \left\Vert m_{1}\right\Vert  &  \leq M\left\Vert \left(  \Gamma^{\prime\prime

1294: }-\Gamma_{n}^{\prime\prime}\right)  \right\Vert _{\infty}\left\Vert \left(

1295: \Gamma_{n}^{\prime\prime\dag}\right)  ^{1/2}\right\Vert _{\infty}\left\Vert

1296: \left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}\right\Vert _{\infty},\\

1297: \left\Vert m_{2}\right\Vert  &  \leq\left\Vert \psi\right\Vert _{L}\left\Vert

1298: \left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}\right\Vert _{\infty

1299: }\left\Vert \left(  \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)

1300: \right\Vert _{\infty}%

1301: \end{align*}

1302: since $\left\Vert \Gamma_{n}^{\prime\prime\dag}\Gamma_{n}^{\prime\prime

1303: }\right\Vert _{\infty}\leq1$ almost surely. The consequence of the display

1304: above is $\left\Vert m_{1}\right\Vert _{L^{2}}=O\left(  \dfrac{1}{\alpha

1305: \sqrt{n}}\right)  $ and $\left\Vert m_{2}\right\Vert _{L^{2}}=O\left(

1306: \dfrac{1}{\sqrt{\alpha n}}\right)  .$

1307:

1308: We can deal with $m_{3}$ as was done within the proof of the preceding Lemma

1309: \ref{cran}. Clearly we may cope with $m_{3}$ as if the random $\Gamma

1310: _{n}^{\prime\prime\dag}$ was replaced by the non random $\Gamma^{\prime

1311: \prime\dag}$. We should study%

1312: \[

1313: \left[  \left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}\left(  \Gamma

1314: ^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)  \left(  \Gamma^{\prime

1315: \prime\dag}\right)  ^{1/2}\right]  \left[  \left(  \Gamma^{\prime\prime\dag

1316: }\right)  ^{1/2}e_{n}^{\prime}\right]  .

1317: \]

1318: It is enough to get a rate of decrease for each of the these terms. Once again

1319: we have :%

1320: \begin{align*}

1321: \left\Vert \left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}\left(

1322: \Gamma^{\prime\prime}-\Gamma_{n}^{\prime\prime}\right)  \left(  \Gamma

1323: ^{\prime\prime\dag}\right)  ^{1/2}\right\Vert _{L^{2}}  &  =O\left(  \frac

1324: {1}{\alpha\sqrt{n}}\right) \\

1325: \left\Vert \left(  \Gamma^{\prime\prime\dag}\right)  ^{1/2}e_{n}^{\prime

1326: }\right\Vert _{L^{2}}  &  =O\left(  \frac{1}{\sqrt{\alpha n}}\right)

1327: \end{align*}

1328: which completes the proof of Lemma \ref{u}.

1329: \end{proof}

1330:

1331: Now we are ready to go back to (\ref{T1}) and (\ref{T2}) as announced sooner.

1332:

1333: \begin{lemma}

1334: \label{A_n}We have :%

1335: \begin{align*}

1336: A_{n}  &  =O_{L^{2}}\left(  \frac{1}{\alpha\beta\sqrt{n}}\right)  ,\\

1337: B_{n}  &  =O_{L^{2}}\left(  \frac{1}{\alpha\beta\sqrt{n}}\right)  .

1338: \end{align*}

1339:

1340: \end{lemma}

1341:

1342: \begin{proof}

1343: Since

1344: \[

1345: \left\Vert A_{n}\right\Vert \leq\left\Vert \left(  S_{n}+\beta I\right)

1346: ^{-1}\right\Vert _{\infty}\left\Vert u_{n}-u\right\Vert _{W}%

1347: \]

1348: by Lemma \ref{u} and Proposition \ref{KT} we get the first desired result

1349: \end{proof}

1350:

1351: Once again the proof of the second relies on Proposition \ref{KT} and Lemma

1352: \ref{cran}. Indeed%

1353: \begin{align*}

1354: \left\Vert B_{n}\right\Vert _{W}  &  \leq\left\Vert \left(  S_{n}+\beta

1355: I\right)  ^{-1}\right\Vert _{\infty}\left\Vert S-S_{n}\right\Vert _{\infty

1356: }\left\Vert \phi\right\Vert _{W}\\

1357: &  \leq\dfrac{\left\Vert \phi\right\Vert _{W}}{\beta}\left\Vert S-S_{n}%

1358: \right\Vert _{\infty}%

1359: \end{align*}

1360: hence the result.

1361:

1362: We should deal with the last term. In a first step we prove that $S_{n}$ may

1363: be replaced by $S.$

1364:

1365: \begin{lemma}

1366: \label{C_n}When $\alpha\beta\sqrt{n}\rightarrow+\infty$,%

1367: \[

1368: C_{n}=\beta\left(  S+\beta I\right)  ^{-1}\phi\left(  1+o\left(  1\right)

1369: \right)  .

1370: \]

1371:

1372: \end{lemma}

1373:

1374: \begin{remark}

1375: The preceding equality should be understood with respect to the $L^{2}$ norm.

1376: \end{remark}

1377:

1378: \begin{proof}

1379: Successively,%

1380: \begin{align*}

1381: C_{n}  &  =\beta\left(  S_{n}+\beta I\right)  ^{-1}\phi\\

1382: &  =\beta\left(  \left(  S_{n}+\beta I\right)  ^{-1}-\left(  S+\beta I\right)

1383: ^{-1}\right)  \phi+\beta\left(  S+\beta I\right)  ^{-1}\phi\\

1384: &  =\left[  \left(  \left(  S_{n}+\beta I\right)  ^{-1}\left(  S-S_{n}\right)

1385: \right)  +I\right]  \beta\left(  S+\beta I\right)  ^{-1}\phi

1386: \end{align*}

1387: and

1388: \[

1389: \left\Vert C_{n}\right\Vert \leq\left\Vert \beta\left(  S+\beta I\right)

1390: ^{-1}\phi\right\Vert _{W}\left(  1+\left\Vert \left(  S_{n}+\beta I\right)

1391: ^{-1}\left(  S-S_{n}\right)  \right\Vert _{\infty}\right)  .

1392: \]

1393: Now it suffices to apply Lemma \ref{A_n} to get the desired result.

1394: \end{proof}

1395:

1396: The next Lemma may be hard to understand at first glance. Within the

1397: forthcoming proof of Theorem \ref{conv} the bias term $C_{n}$ will slightly

1398: change. We refer to displays (\ref{E1}) and (\ref{E2}) below for a deeper understanding.

1399:

1400: \begin{lemma}

1401: \label{placebo}The following holds :%

1402: \[

1403: \left\Vert \Gamma^{1/2}\left(  S+\beta I\right)  ^{-1}\Gamma^{1/2}\right\Vert

1404: _{\infty}=O\left(  \frac{1}{\alpha}\right)  .

1405: \]

1406:

1407: \end{lemma}

1408:

1409: \begin{proof}

1410: Once again it takes two steps to get the result. First note that $\Gamma

1411: ^{1/2}\left(  S\right)  ^{-1}\Gamma^{1/2}$ is a bounded linear operator.

1412: Indeed%

1413: \begin{equation}

1414: S=\Gamma-\Gamma^{\prime}\Gamma^{\prime\prime\dag}\Gamma^{\prime\ast}%

1415: =\Gamma^{1/2}\Lambda_{\alpha}\Gamma^{1/2} \label{queer}%

1416: \end{equation}

1417: where $R=D\Gamma^{1/2},$%

1418: \[

1419: \Lambda_{\alpha}=I-R^{\ast}\left(  RR^{\ast}+\alpha I\right)  ^{-1}R.

1420: \]

1421: The Schmidt decomposition of $R$ is (see (\ref{mano}) above for the empirical

1422: version) :%

1423: \[

1424: R=\sum_{k\in\mathbb{N}}\mu_{k}\left(  u_{k}\otimes v_{k}\right)  .

1425: \]

1426: where $\left(  u_{k}\right)  _{k\in\mathbb{N}}$ (resp. $\left(  v_{k}\right)

1427: _{k\in\mathbb{N}}$) is a complete orthonormal system in $W$ (resp. $L$). Hence

1428: :%

1429: \begin{align*}

1430: \Lambda_{\alpha}  &  =\sum_{k\in\mathbb{N}}\left(  1-\dfrac{\mu_{k}^{2}}%

1431: {\mu_{k}^{2}+\alpha}\right)  \left(  u_{k}\otimes u_{k}\right) \\

1432: &  =\sum_{k\in\mathbb{N}}\dfrac{\alpha}{\mu_{k}^{2}+\alpha}\left(

1433: u_{k}\otimes u_{k}\right)  .

1434: \end{align*}

1435: The operator $\Lambda_{\alpha}$ has a bounded inverse%

1436: \[

1437: \Lambda_{\alpha}^{-1}=\dfrac{1}{\alpha}\sum_{k\in\mathbb{N}}\left(  \mu

1438: _{k}^{2}+\alpha\right)  \left(  u_{k}\otimes u_{k}\right)

1439: \]

1440: and $\left\Vert \Lambda_{\alpha}^{-1}\right\Vert _{\infty}=1+\left(  \sup

1441: \mu_{k}^{2}\right)  /\alpha\leq M/\alpha$ for $M$ large enough (or $\alpha$

1442: small enough).\newline Hence%

1443: \begin{equation}

1444: \Gamma^{1/2}\left(  S\right)  ^{-1}\Gamma^{1/2}=\Gamma^{1/2}\Gamma

1445: ^{-1/2}\Lambda_{\alpha}^{-1}\Gamma^{-1/2}\Gamma^{1/2}=\Lambda_{\alpha}^{-1}.

1446: \label{BH}%

1447: \end{equation}

1448: Now (second step) we prove that :%

1449: \[

1450: \Gamma^{1/2}\left(  S+\beta I\right)  ^{-1}\Gamma^{1/2}\ll\Gamma^{1/2}%

1451: S^{-1}\Gamma^{1/2}.

1452: \]

1453: Let us pick a given $x$ in $W$, then

1454: \[

1455: \left\langle \Gamma^{1/2}\left(  S+\beta I\right)  ^{-1}\Gamma^{1/2}%

1456: x,x\right\rangle _{W}=\left\langle \left(  S+\beta I\right)  ^{-1}\Gamma

1457: ^{1/2}x,\Gamma^{1/2}x\right\rangle _{W}%

1458: \]

1459: It suffices to get for all $y$ in in the domain of operator $\Gamma^{-1/2}$ :%

1460: \begin{equation}

1461: \left\langle \left(  S+\beta I\right)  ^{-1}y,y\right\rangle _{W}%

1462: \leq\left\langle S^{-1}y,y\right\rangle _{W} \label{CT}%

1463: \end{equation}

1464: Standard results on the spectrum of $\left(  S+\beta I\right)  ^{-1}S$ prove

1465: that $\left(  S+\beta I\right)  ^{-1}S\geq0$ and that $\left\Vert \left(

1466: S+\beta I\right)  ^{-1}S\right\Vert \leq1$ which is enough to claim

1467: (\ref{CT}).\newline We are now in position to finixh the proof of the Lemma.

1468: It is plain from (\ref{CT}) that%

1469: \[

1470: \left\Vert \Gamma^{1/2}\left(  S+\beta I\right)  ^{-1}\Gamma^{1/2}\right\Vert

1471: _{\infty}\leq\left\Vert \Gamma^{1/2}\left(  S\right)  ^{-1}\Gamma

1472: ^{1/2}\right\Vert _{\infty}=\left\Vert \Lambda_{\alpha}^{-1}\right\Vert

1473: _{\infty}\leq\frac{C}{\alpha}%

1474: \]

1475: which is the claimed result.\bigskip

1476: \end{proof}

1477:

1478: \textbf{Proof of Theorem }\ref{conv}\textbf{:}

1479:

1480: Now starting from (\ref{decomp}) we get%

1481: \begin{align}

1482: \left\Vert \Gamma^{1/2}\left(  \phi-\widehat{\phi}\right)  \right\Vert

1483: _{W}^{2}  &  \leq M\left\Vert \Gamma^{1/2}\left(  A_{n}+B_{n}+C_{n}\right)

1484: \right\Vert _{W}^{2}\nonumber\\

1485: &  \leq M\left(  \left\Vert A_{n}\right\Vert _{W}^{2}+\left\Vert

1486: B_{n}\right\Vert _{W}^{2}+\left\Vert \Gamma^{1/2}C_{n}\right\Vert _{W}%

1487: ^{2}\right)  . \label{E1}%

1488: \end{align}

1489:

1490:

1491: Lemmas \ref{A_n} gives the rates of convergence for $\left\Vert A_{n}%

1492: \right\Vert _{W}^{2}$ and $\left\Vert B_{n}\right\Vert _{W}^{2}$ respectively.

1493: But Lemma \ref{C_n} is unfortunately not enough to get a rate in the last

1494: term. However this previous Lemma enables to focus on :%

1495: \begin{equation}

1496: \beta\Gamma^{1/2}\left(  S+\beta I\right)  ^{-1}\phi=\beta\Gamma^{1/2}\left(

1497: S+\beta I\right)  ^{-1}\Gamma^{1/2}\Gamma^{-1/2}\phi\label{E2}%

1498: \end{equation}

1499: and

1500: \begin{equation}

1501: \left\Vert \Gamma^{1/2}C_{n}\right\Vert _{W}^{2}\leq M\beta^{2}\left\Vert

1502: \Gamma^{1/2}\left(  S+\beta I\right)  ^{-1}\Gamma^{1/2}\right\Vert _{\infty

1503: }^{2}\left\Vert \Gamma^{-1/2}\phi\right\Vert _{W}. \label{marin}%

1504: \end{equation}

1505: By assumption $\mathbf{A4}$, $\left\Vert \Gamma^{-1/2}\phi\right\Vert _{W}$ is

1506: finite. We deal with the central term, namely :%

1507: \begin{align*}

1508: \Gamma^{1/2}\left(  S+\beta I\right)  ^{-1}\Gamma^{1/2}  &  =\Gamma

1509: ^{1/2}\left(  \Gamma^{1/2}\Lambda_{\alpha}\Gamma^{1/2}+\beta I\right)

1510: ^{-1}\Gamma^{1/2}\\

1511: &  \ll\Gamma^{1/2}\left(  \Gamma^{1/2}\Lambda_{\alpha}\Gamma^{1/2}\right)

1512: ^{-1}\Gamma^{1/2}=\Lambda_{\alpha}^{-1}.

1513: \end{align*}

1514: (see (\ref{queer})) and%

1515: \[

1516: \left\Vert \Lambda_{\alpha}^{-1}\right\Vert _{\infty}^{2}=O\left(  \alpha

1517: ^{-2}\right)  .

1518: \]

1519: Collecting this last display with (\ref{marin}) we get%

1520: \[

1521: \left\Vert \Gamma^{1/2}C_{n}\right\Vert _{W}^{2}=O\left(  \frac{\beta^{2}%

1522: }{\alpha^{2}}\right)  .

1523: \]

1524: This finishes the proof of Theorem \ref{conv}.

1525:

1526: \begin{thebibliography}{99}                                                                                               %

1527:

1528:

1529: \bibitem {AF}Adams R.A. and Fournier J.J.F., 2003. Sobolev spaces, Academic

1530: Press, 2nd ed.

1531:

1532: \bibitem {Amato.et.al.CSDA}Amato U., Antoniadis A., Feiss I., 2006. Dimension

1533: reduction in functional regression with applications, to appear in

1534: \textit{Computational Statistics and Data Analysis.}

1535:

1536: \bibitem {Bos3}Bosq, D., 2000. \textit{Linear processes in function spaces}.

1537: Lectures notes in statistics. Springer Verlag.

1538:

1539: \bibitem {Brown.Fearn.Vanucci}Brown, P.J., Fearn, T. and Vanucci M., 2001.

1540: Bayesian wavelet regression on curves with application to a spectroscopic

1541: calibration problem, \textit{Journal of the American Statistical

1542: Association.}, 96 (454), 398--408.

1543:

1544: \bibitem {CH}Cai T., Hall, P., 2006. Prediction in functional linear

1545: regression, \textit{Annals of Statistics}, \textbf{34}, n$%

1546: %TCIMACRO{\U{b0}}%

1547: %BeginExpansion

1548: {{}^\circ}%

1549: %EndExpansion

1550: 5$.

1551:

1552: \bibitem {Cardot.SEFL}Cardot H., Ferraty F., Sarda P., 2006. Spline estimators

1553: for the functional linear model: Consistency, Application and Splus

1554: implementation, To appear in \textit{Statistica Sinica}.

1555:

1556: \bibitem {CMS2006}Cardot H., Mas A., Sarda P, 2006. Weak convergence in the

1557: functional linear model. To appear in \textit{Probability Theory and Related

1558: Fields}.

1559:

1560: \bibitem {DS}Dunford, N. and Schwartz, J.T., 1988. \textit{Linear Operators,

1561: Vol. I \& II}. Wiley Classics Library.

1562:

1563: \bibitem {FV}Ferraty, F., Vieu P., 2003. The functional nonparametric model

1564: and application to spectrometric data. \textit{Computational Statistics}

1565: \textbf{17} n$%

1566: %TCIMACRO{\U{b0}}%

1567: %BeginExpansion

1568: {{}^\circ}%

1569: %EndExpansion

1570: 4,$ 545-564.

1571:

1572: \bibitem {FV2}Ferraty, F., Vieu P., 2006. Nonparametic Functional Data

1573: Analysis, Springer.

1574:

1575: \bibitem {Frank.Friedman}Frank I.E. and Friedman J.H., 1993. A statistical

1576: view of some chemometrics regression tools, \textit{Technometrics},

1577: \textbf{35}, no 2, 109-135.

1578:

1579: \bibitem {GGK}Gohberg, I., Goldberg, S. and Kaashoek,M.A., 1991.

1580: \textit{Classes of linear operators Vol I \& II. Operator Theory : advances

1581: and applications}, Birkha\"{u}ser Verlag.

1582:

1583: \bibitem {Hastie.Mallows}Hastie T. and Mallows C., 1993. Discussion on the

1584: paper of Frank I.E. and Friedman J.H., 1993, A statistical view of some

1585: chemometrics regression tools, \textit{Technometrics}, \textbf{35}, no 2, 140-143.

1586:

1587: \bibitem {LT}Ledoux, M. and Talagrand, M, 1991 : \textit{Probability in Banach

1588: spaces. Isoperimetry and processes}. Ergebnisse der Mathematik und ihrer

1589: Grenzgebiete, \textbf{23}. Springer-Verlag, Berlin.

1590:

1591: \bibitem {Marx.Eilers}Marx, B.D. and Eilers, P.H.C. (2002) Multivariate

1592: calibration stability: a comparison of methods, \textit{Journal of

1593: Chemometrics}, \textbf{16}, 129-140.

1594:

1595: \bibitem {MP}Mas A, Pumo, B, 2006. The ARHD model. To appear in JSPI.

1596: Technical report available at

1597: \textit{http://fr.arxiv.org/PS\_cache/math/pdf/0502/0502285.pdf}

1598:

1599: \bibitem {marion.pumo}Marion J.M., Pumo B., 2004. Comparaison des mod\`{e}les

1600: ARH(1) et ARHD(1) sur des donn\'{e}es physiologiques, \textit{Annales de

1601: l'ISUP}, \textbf{48}, 3, pp. 29-38.

1602:

1603: \bibitem {Osborne.et.al.}Osborne, B.J., Fearn, T., Miller, A.R. and Douglas,

1604: S. (1984) Application of Near-Infrared Reflectance Spectroscopy to

1605: Compositional Analysis of Biscuits Dougts, \textit{J. of the Sc. of Food and

1606: Agricult.}, \textbf{35}, 99--105

1607:

1608: \bibitem {Ram1}Ramsay J.O., 2000. Differential equation models for statistical

1609: functions, \textit{Canadian Journal of Statistics}, \textbf{28}, n$%

1610: %TCIMACRO{\U{b0}}%

1611: %BeginExpansion

1612: {{}^\circ}%

1613: %EndExpansion

1614: 2,$ 225-240.

1615:

1616: \bibitem {RamDal}Ramsay J.O., Dalzell C.J., 1991, Some tools for functional

1617: data analysis (with dsicussion), \textit{Journal of the Royal Statistical

1618: Society}, B, \textbf{53}, 539-572.

1619:

1620: \bibitem {ramsey}Ramsay J.O., Silverman B.W., 1997. \textit{Functional Data

1621: Analysis}, Springer.

1622:

1623: \bibitem {ramsay2}Ramsay J.O., Silverman B.W., 2002. \textit{Applied

1624: Functional Data Analysis: Methods and Case Studies}, Springer.

1625:

1626: \bibitem {Silv}Silverman B.W., 1996. Smoothed functional principal component

1627: analysis by choice of norm, \textit{Annals of Statistics}, \textbf{24}, No 1, 1-24.

1628:

1629: \bibitem {TA}Tikhonov A.N., Arsenin V.Y., 1977. \textit{Solutions of ill-posed

1630: problems}. V.H. Winstons and sons, Washington..

1631:

1632: \bibitem {Z}Ziemer W.P., 1989. Weakly differentiable functions.

1633: \textit{Sobolev spaces and functions of bounded variations}. Graduate Text in

1634: Mathematics 120. Springer-Verlag, New-York.

1635: \end{thebibliography}

1636:

1637:

1638: \end{document}