0503:quant-ph0503222/QPF.tex

1: %\documentclass[12pt]{iopart}

2: \documentclass{modifiedstl}

3:

4: \usepackage{graphicx}

5: \usepackage{modifiedams}

6: \usepackage{cite}

7:

8: \begin{document}

9:

10: \title[Quantum projection filter]{Quantum projection filter for a highly

11: nonlinear model in cavity QED}

12:

13: \author{Ramon van Handel\footnote[3]{E-mail

14: address: ramon@its.caltech.edu} and Hideo Mabuchi\footnote[2]{E-mail

15: address: hmabuchi@its.caltech.edu}}

16:

17: \address{Physical Measurement and Control 266-33, California Institute of

18: Technology, Pasadena, CA 91125, USA}

19:

20: \begin{abstract}

21: Both in classical and quantum stochastic control theory a major role is

22: played by the filtering equation, which recursively updates the

23: information state of the system under observation.  Unfortunately, the

24: theory is plagued by infinite-dimensionality of the information state

25: which severely limits its practical applicability, except in a few select

26: cases (e.g.\ the linear Gaussian case.)  One solution proposed in

27: classical filtering theory is that of the projection filter. In this

28: scheme, the filter is constrained to evolve in a finite-dimensional family

29: of densities through orthogonal projection on the tangent space with

30: respect to the Fisher metric.  Here we apply this approach to the simple

31: but highly nonlinear quantum model of optical phase bistability of a

32: stongly coupled two-level atom in an optical cavity.  We observe

33: near-optimal performance of the quantum projection filter, demonstrating

34: the utility of such an approach.

35: \end{abstract}

36:

37: %Uncomment for PACS numbers title message

38: \pacs{00.00, 20.00, 42.10}

39:

40: % Uncomment for Submitted to journal title message

41: %\submitto{\JPA}

42:

43: % Comment out if separate title page not required

44: %\maketitle

45:

46: \section{Introduction}

47:

48: Over the past decade it has become increasingly clear that feedback

49: control of quantum systems is essentially a problem of stochastic control

50: theory with partial observations

51: \cite{s:belavkin2,s:doherty2,s:VanHandel2005}. In this context, the system

52: and observations are generally modeled as a pair of It\^o (quantum)

53: stochastic differential equations.  It is then the goal of the control

54: engineer to find a feedback control policy, depending on the system state

55: only through the past history of the observations, that achieves a

56: particular control objective.

57:

58: In the case of linear system dynamics and observations and Gaussian

59: initial conditions, the so-called optimal control problem can be solved

60: exactly both classically \cite{s:bensoussan} and quantum-mechanically

61: \cite{s:belavkin2,s:doherty1} provided that a quadratic performance

62: criterion is chosen.  This means that the control objective is specified

63: as an optimization problem, where a certain cost function (the performance

64: criterion) of the system evolution and the control signal is to be

65: minimized.  The resulting Linear-Quadratic-Gaussian (LQG) control is

66: widely used in many technological applications.  An important feature of

67: LQG theory is its {\it separation structure:} the optimal controller

68: splits up into a part that updates the optimal estimate of the system

69: state given the observations (the Kalman filter), and an optimal feedback

70: law which is only a function of the state estimate.

71:

72: It was originally suggested by Mortensen \cite{s:mortensen} that the

73: separation structure of LQG control carries over even to the nonlinear

74: case.  The problem now separates into the nonlinear filtering problem of

75: finding the optimal estimate of the system statistics given the

76: observations and the optimal control problem of finding a feedback law,

77: based only on the filtered estimate, that minimizes some performance

78: criterion.  The estimate propagated by the filter is often referred to as

79: the {\it information state} \cite{s:james2} as it contains all information

80: of the system possessed by the observer.  Unfortunately, nonlinear

81: stochastic control is plagued by two rather severe problems.  First, the

82: information state is generally infinite-dimensional even for very simple

83: nonlinear systems \cite{s:hazewinkelmarcus}. Second, even in the

84: finite-dimensional case the nonlinear optimal control problem is generally

85: intractable.  The latter can sometimes be alleviated by posing a less

86: stringent control objective \cite{s:VanHandel2005}. Nonetheless nonlinear

87: stochastic control remains an extremely challenging topic, both in the

88: classical and quantum mechanical case.

89:

90: This paper is concerned with the first problem, that of

91: infinite-dimensionality of the nonlinear information state.  There is no

92: universal solution to this problem.  The most common (though rather {\it

93: ad hoc}) approach used by engineers is known as the extended Kalman filter

94: \cite{s:jacobs}.  In this scheme, the system dynamics is linearized around

95: the current expected system state, and a Kalman filter based on the linear

96: approximation is used to propagate the estimate.  However, aside from the

97: fact that the method only performs well for nearly linear systems, it is

98: not clear how it can be applied to quantum models\footnote{

99: 	If the system dynamics can be meaningfully expressed in terms

100: 	of conjugate pairs of observables, one could imagine locally

101: 	linearizing the system Langevin equations to obtain a quantum

102: 	extended Kalman filter.  To our knowledge this has not yet been

103: 	attempted.  However, it is not clear how to do this in e.g.\

104: 	atomic systems, where the internal degrees of freedom do not

105: 	obey CCR.

106: }.

107:

108: A much more flexible approximation for nonlinear filtering equations was

109: proposed by Brigo, Hanzon and LeGland \cite{s:brigo1,s:brigo2,s:brigo3},

110: based on the differential geometric methods of information geometry

111: \cite{s:amari}.  In this scheme we fix a finite-dimensional family of

112: densities that are assumed to be good approximations to the information

113: state.  Using geometric methods the filter is then constrained to evolve

114: in this family.  The finite-dimensional approximate filter obtained in

115: this way is known as a projection filter, and often performs extremely

116: well when the approximating family is chosen wisely.  Moreover, as this

117: approximate filter is based on the optimal nonlinear filter, instead of on

118: the trajectories of the system state in phase space, it is readily

119: extended to the quantum case.  Though by no means a universal solution to

120: the filtering problem, we believe that the flexibility and performance of

121: this method likely make it widely applicable in the realistic (real-time)

122: implementation of quantum filtering theory.

123:

124: In this paper we apply the projection filtering method to a simple, but

125: highly nonlinear quantum system: a stongly driven, strongly coupled

126: two-level atom in a resonant single-mode optical cavity

127: \cite{s:alsing,s:mabuchi}. The output field of such an experiment exhibits

128: a randomly switching phase, caused by the atomic spontaneous emission. The

129: formalism developed by Brigo {\it et al.}\ can be applied directly to this

130: system if the information state (the conditional density of the atom and

131: cavity mode) is represented as a $Q$-function \cite{s:mandel}.

132: Remarkably, our projection filter shows strong connections to the

133: classical problem of filtering a random jump process in additive white

134: noise \cite{s:wonham,s:vellekoop}.

135:

136: Rather than using a quasiprobability representation, a fully quantum

137: theory of projection filtering is expressed in terms of finite-dimensional

138: families of density operators and quantum information geometry

139: \cite{s:amari}.  We will present the general theory in a future

140: publication.  Nonetheless there is no theoretical objection to the

141: approach taken in this paper.  In fact, we observe numerically that the

142: projection filter for our model has near-optimal performance,

143: demonstrating the utility of this approach.

144:

145: This paper is organized as follows.  In section \ref{sec:proj} we

146: introduce the projection filter and the neccessary elements of information

147: geometry.  Next, in section \ref{sec:phys}, we introduce the physical

148: model that we will be using as an example and obtain the associated

149: filtering equation.  In section \ref{sec:proj2} we obtain the projection

150: filter for our model.  Finally, in section \ref{sec:num}, we present and

151: discuss the results of numerical simulations.

152:

153:

154: \section{Information geometry and the projection filter}

155: \label{sec:proj}

156:

157: \subsection{The basic principle of the projection filter}

158:

159: \begin{figure}

160: \includegraphics[width=\textwidth]{PFilter.eps}

161: \caption{

162: \label{f:pfilter} Cartoon drawing of the projection filter.  (a) The

163: infinite-dimensional space of all densities is represented by $M$,

164: while $S$ is a finite-dimensional submanifold.  The filter, defined

165: by a (stochastic) differential equation in $M$, flows along a (random)

166: vector field on $M$.  The flow is such that the density remains close to

167: $S$, but $S$ is not invariant.  (b) To each point $\theta\in S$ the flow

168: associates a (random) tangent vector $X[\theta]$ which has components in

169: both $T_\theta S$ and its complement.  The projection filter is generated

170: by the vector field in $TS$ that, for each $\theta$, is the orthogonal

171: projection of $X[\theta]$ onto $T_\theta S$.}

172: \end{figure}

173:

174: The basic idea behind the projection filter is illustrated in Figure

175: \ref{f:pfilter}.  First, we assume that the information state can be

176: represented as a probability density, i.e.\ an integrable nonnegative

177: function on some underlying phase space.  Though this is not always the

178: case even in classical probability, this is generally a good assumption

179: for any ``reasonable'' model.  In this paper we will use a well-known

180: quantum quasiprobability distribution, the $Q$-function \cite{s:mandel},

181: for this purpose.  The set of all possible densities forms an

182: infinite-dimensional function space which we will denote by $M$.

183:

184: We also suppose that the information state is well approximated by

185: densities in some finite-dimensional subspace $S$ of $M$.  We will assume

186: that $S$ can be given the structure of a differential manifold, but we do

187: not require it to be a linear space.  As such we must be careful in what

188: follows in distinguishing between points in $S$, points in the tangent

189: bundle $TS$, etc., as in any differential geometric situation.

190:

191: In general, $S$ will not be an invariant set of the filter; if we start

192: with a density in $S$, the filter will cause the density to evolve into a

193: neighborhood of $S$ in $M$.  The idea behind the projection filter is

194: simply to constrain the optimal filter to remain in $S$.  As $S$ is

195: finite-dimensional, we can then express the projection filter as a

196: differential equation in a finite set of local coordinates on $S$.

197:

198: The optimal filter that propagates the information state of the system is

199: given by a stochastic differential equation (SDE) in $M$, and we are

200: seeking to express the projection filter as an SDE in $S$.  The precise

201: meaning of this statement is a somewhat important point which we will

202: return to at the end of this section; for now, we can imagine the filter

203: to be an ordinary differential equation that is driven by the

204: observations, as follows:

205: \begin{equation}

206: \label{eq:vectorf}

207: 	\frac{dp_t}{dt}=X[p_t;Y_t]

208: \end{equation}

209: Here $p_t\in M$ is the information state and $Y_t$ is the observation made

210: at time $t$.  $X$, then, is an observation-dependent vector field on $M$.

211:

212: To constrain the filter to evolve in $S$ we must only retain the dynamics

213: of (\ref{eq:vectorf}) that is parallel to $S$; dynamics perpendicular to

214: $S$ will move the density into an undesired region.  Mathematically, this

215: idea is simply implemented if we realize that at each point $\theta\in S$,

216: $X[\theta;Y]$ will have components both in the tangent space $T_\theta S$

217: and in its complement $T_\theta S^\perp$.  We can now constrain the vector

218: field by orthogonally projecting $X[\theta;Y]$ onto $T_\theta S$ for every

219: $\theta\in S$.  The resulting approximate filter, in which only the

220: dynamics that leaves $S$ invariant is retained, is the projection filter

221: \cite{s:brigo1,s:brigo2,s:brigo3}.

222:

223: Before we can flesh out the details of this scheme we must deal with the

224: fact that the filter is not given by a differential equation as in

225: (\ref{eq:vectorf}), but by an SDE of the form

226: \begin{equation}

227: \label{eq:vectors}

228: 	dp_t=A[p_t]\,dt+B[p_t]\,dY_t

229: \end{equation}

230: We would like to think of $A+B\dot Y_t$ as a ``stochastic vector field''

231: so that we can directly apply the scheme discussed above.  The theory of

232: stochastic differential equations on manifolds \cite{s:bismut,s:rogersw}

233: tells us that we can in fact do this, as long as we interpret

234: (\ref{eq:vectors}) as a {\it Stratonovich} SDE

235: \begin{equation}

236: \label{eq:stratflt}

237: 	dp_t=A[p_t]\,dt+B[p_t]\circ dY_t

238: \end{equation}

239: This is not surprising as, for example, It\^o's rule is incompatible with

240: the requirement that the Lie derivative along a vector field is a

241: derivation \cite{s:marsden} (in other words, a differential geometric

242: transformation rule can only contain first derivatives, and the only

243: stochastic integral with this property is the Stratonovich integral.)

244: Note that usually filtering equations are given in the It\^o form; hence

245: we must transform to the Stratonovich form before we can derive the

246: projection filter.

247:

248: \subsection{Information geometry}

249:

250: In order to perform the key step in the above procedure, the orthogonal

251: projection, we need an inner product in the tangent space $T_\theta S$.

252: A differential manifold is not naturally endowed with an inner product

253: structure, however, and hence the projection filter is not yet well

254: defined.  We need to add to the manifold a Riemannian structure

255: \cite{s:lafontaine}.  In statistics there is a natural way to do this, and

256: the resulting theory is known as information geometry \cite{s:amari}.

257:

258: There are different ways of introducing this structure, but perhaps the

259: easiest treatment is obtained by considering instead of the densities $M$

260: the space of {\it square roots} of densities $M^{1/2}$. The fact that any

261: density is integrable guarantees that the square root of any density is

262: square integrable; hence $M^{1/2}$ is a subspace of $L^2$, the space of

263: square integrable functions, and any vector field on $M^{1/2}$ takes

264: values in $L^2$.

265:

266: Similarly, we consider the manifold $S^{1/2}$, which we will explicitly

267: parametrize as

268: \begin{equation}

269:    S^{1/2}=\{\sqrt{p(\cdot,\theta)},~~\theta\in\Theta\subset\mathbb{R}^m\}

270: \end{equation}

271: That is, $S^{1/2}$ is a finite-dimensional manifold of square roots of

272: densities, parametrized by the local coordinates\footnote{

273: 	By writing this, we are assuming that the entire manifold can be

274: 	covered by a single coordinate chart.  Without this assumption

275: 	the description would be more complicated, as then we couldn't

276: 	describe the projection filter using a simple ``extrinsic'' SDE

277: 	in $\mathbb{R}^m$.

278: 	Often we can make our manifold obey this property simply by

279: 	removing a few points; we will see an example of this later.

280: } $\theta\in\Theta$.  As $S^{1/2}\subset M^{1/2}$, for any

281: $\theta\in\Theta$ the tangent space $T_\theta S^{1/2}$ is the linear

282: subspace of $L^2$ given by

283: \begin{equation}

284: \label{eq:tthetas}

285: 	T_\theta S^{1/2}=\mbox{Span}\left[

286: 		\frac{\partial\sqrt{p(\cdot,\theta)}}{\partial\theta^1},

287: 		\cdots,

288: 		\frac{\partial\sqrt{p(\cdot,\theta)}}{\partial\theta^m}

289: 	\right]\subset L^2

290: \end{equation}

291: The reason for working with square roots of densities is that this gives

292: a natural inner product in the tangent space, which is simply the standard

293: $L^2$-inner product.  In particular, we can calculate the associated

294: metric tensor in the basis of (\ref{eq:tthetas}):

295: \begin{equation}

296: \label{eq:fisher}

297: 	\left\langle

298: 	\frac{\partial\sqrt{p(\cdot,\theta)}}{\partial\theta^i},

299: 	\frac{\partial\sqrt{p(\cdot,\theta)}}{\partial\theta^j}

300: 	\right\rangle=

301: 	\int

302: 	\frac{\partial\sqrt{p(x,\theta)}}{\partial\theta^i}

303: 	\frac{\partial\sqrt{p(x,\theta)}}{\partial\theta^j}dx=

304: 	\frac{1}{4}g_{ij}(\theta)

305: \end{equation}

306: Up to a factor of $1/4$, this is the well-known Fisher information matrix

307: $g_{ij}(\theta)$.

308:

309: We are now in the position to define what we mean by orthogonal projection

310: of a vector field on $M$ onto $TS$.  At each $\theta$, the orthogonal

311: projection is

312: \begin{equation}

313: \label{eq:orthog}

314: 	\Pi_\theta X[\theta]=\sum_{i=1}^m\sum_{j=1}^m

315: 		4g^{ij}(\theta)

316: 	\left\langle X[\theta],

317: 	\frac{\partial\sqrt{p(\cdot,\theta)}}{\partial\theta^j}

318: 	\right\rangle

319: 	\frac{\partial\sqrt{p(\cdot,\theta)}}{\partial\theta^i},

320: \end{equation}

321: where we have used the inverse Fisher information matrix $g^{ij}(\theta)$

322: to account for the fact that the basis of (\ref{eq:tthetas}) is not

323: orthogonal.  This is the main result that is needed to obtain projection

324: filters.

325:

326: \subsection{Orthogonal projection of a Stratonovich filter}

327:

328: Let us now discuss how to perform orthogonal projection onto a

329: finite-dimensional manifold $S$ for the very general form

330: (\ref{eq:stratflt}) of a filtering equation.  We begin by converting the

331: equation to the square root form; this gives

332: \begin{equation}

333: 	d\sqrt{p_t}=\frac{1}{2\sqrt{p_t}}A[p_t]\,dt

334: 			+\frac{1}{2\sqrt{p_t}}B[p_t]\circ dY_t

335: \end{equation}

336: We now constrain the filter to evolve on $S^{1/2}$ through orthogonal

337: projection:

338: \begin{equation}

339: \label{eq:pfiltfll}

340: 	d\sqrt{p(\cdot,\theta_t)}=

341: 		\Pi_{\theta_t}

342: 		\frac{

343: 			A[p(\cdot,\theta_t)]

344: 		}{2\sqrt{p(\cdot,\theta_t)}}\,dt

345: 		+\Pi_{\theta_t}

346: 		\frac{

347: 			B[p(\cdot,\theta_t)]

348: 		}{2\sqrt{p(\cdot,\theta_t)}}\circ dY_t

349: \end{equation}

350: This is just a finite-dimensional SDE for the parameters $\theta_t$.  To

351: convert the expression explicitly into this form, note that by the

352: Stratonovich transformation rule

353: \begin{equation}

354: 	d\sqrt{p(\cdot,\theta_t)}=

355: 	\sum_i\frac{\partial\sqrt{p(\cdot,\theta_t)}}{\partial\theta_t^i}

356: 	\circ d\theta_t^i

357: \end{equation}

358: Comparing with (\ref{eq:orthog}) and (\ref{eq:pfiltfll}), we find that

359: \begin{equation}

360: \label{eq:projf}

361: 	d\theta_t^i=

362: 		\left\langle

363: 		\frac{

364: 			A[p(\cdot,\theta_t)]

365: 		}{p(\cdot,\theta_t)},\Lambda_t^i(\cdot,\theta_t)

366: 		\right\rangle dt + \left\langle

367: 		\frac{

368: 			B[p(\cdot,\theta_t)]

369: 		}{p(\cdot,\theta_t)},\Lambda_t^i(\cdot,\theta_t)

370: 		\right\rangle

371: 		\circ dY_t

372: \end{equation}

373: where

374: \begin{equation}

375: \label{eq:projf2}

376: 	\Lambda_t^i(\cdot,\theta_t)=

377: 	\sum_{j=1}^m

378: 	g^{ij}(\theta_t)\frac{\partial p(\cdot,\theta_t)}{\partial\theta^j_t}

379: \end{equation}

380: Equations (\ref{eq:projf}), (\ref{eq:projf2}) and (\ref{eq:fisher}) can be

381: used to directly calculate the projection filter for a wide range of

382: models.

383:

384: \section{The physical model and the quantum filter}

385: \label{sec:phys}

386:

387: \subsection{The Jaynes-Cummings model in the strong driving limit}

388: \label{sec:jc1}

389:

390: \begin{figure}

391: \includegraphics[width=\textwidth]{Jaynes.eps}

392: \caption{

393: \label{f:jaynes} Schematic of the experimental setup that corresponds to

394: our model.  A strongly coupled two-level atom in a resonant, single mode

395: cavity is strongly driven by a resonant driving laser, and spontaneously

396: emits in all directions.  One of the cavity mirrors is leaky, and the

397: atomic dynamics is observed through homodyne detection of the

398: electromagnetic field in this forward mode.}

399: \end{figure}

400:

401: We consider the following physical system, shown in Figure \ref{f:jaynes}.

402: A two-level atom is strongly coupled to the mode of a single-mode cavity.

403: The cavity mode and the atomic frequency are resonant.  The atom is

404: strongly driven on resonance by a laser, and spontaneously emits in all

405: directions.  A forward mode of the electromagnetic field outside the

406: cavity, initially in the vacuum state, scatters off one of the cavity

407: mirrors.  By making this mirror slightly leaky, we extract information

408: from the system into the external field.  Homodyne detection of the

409: forward mode then yields information about the atom and cavity.

410:

411: The goal of this section is to model this physical system as a pair of

412: It\^o quantum stochastic differential equations, one of which describes

413: the atom-cavity evolution and one describing the homodyne observations.

414: To this end, we begin by writing down the full Hamiltonian for the system:

415: \begin{equation}

416: 	H=H_0+H_d+H_{\rm JC}+H_f+H_s

417: \end{equation}

418: Here $H_0$ is the free Hamiltonian

419: \begin{equation}

420: 	H_0=

421: 	\hbar\omega_0a^\dag a+

422: 	\frac{\hbar\omega_0}{2}\sigma_z

423: 	+\int_0^\infty d\omega\,\hbar\omega (b_f^\dag(\omega)b_f(\omega)

424: 		+b_s^\dag(\omega)b_s(\omega)),

425: \end{equation}

426: $H_d$ is the drive Hamiltonian

427: \begin{equation}

428: 	H_d=i\hbar(\mathcal{E}/2)

429: 		(e^{i\omega_0t}\sigma-e^{-i\omega_0t}\sigma^\dag),

430: \end{equation}

431: $H_{\rm JC}$ is the well-known Jaynes-Cummings Hamiltonian

432: \begin{equation}

433: 	H_{\rm JC}=i\hbar g(a^\dag\sigma-a\sigma^\dag),

434: \end{equation}

435: and $H_f$ and $H_s$ are the dipole couplings to the forward and

436: spontaneous emission field modes outside the cavity (see e.g.\

437: \cite{s:gardinercollett})

438: \begin{eqnarray}

439: 	H_f=\hbar\int_0^\infty d\omega\,

440: 	\kappa_f(\omega)[ab_f^\dag(\omega)+a^\dag b_f(\omega)] \\

441: 	H_s=\hbar\int_0^\infty d\omega\,

442: 	\kappa_s(\omega)[\sigma b_s^\dag(\omega)+\sigma^\dag b_s(\omega)]

443: \end{eqnarray}

444: Here $\sigma=|g\rangle\langle e|$ is the atomic lowering

445: operator, $\sigma_z=[\sigma^\dag,\sigma]$, $a$ is the cavity mode lowering

446: operator (we will also use $x=a^\dag+a$ and $y=i(a^\dag-a)$), and

447: $b_f(\omega)$ and $b_s(\omega)$ are the annihilators of the forward and

448: spontaneous emission modes, respectively.  The resonant frequency of the

449: atom, drive and cavity mode is denoted by $\omega_0$, $\mathcal{E}$ is the

450: drive strength, $g$ is the atom-cavity coupling strength, and

451: $\kappa_f(\omega)$ and $\kappa_s(\omega)$ determine the

452: frequency-dependent coupling to the external field modes.

453:

454: We will assume that $\omega_0\gg\mathcal{E}\gg g>\kappa_f,\kappa_s$.

455: Following \cite{s:mabuchi}, let us switch to the interaction picture with

456: respect to $H_0+H_d$.  We obtain the interaction Hamiltonian

457: \begin{eqnarray}

458: 	\frac{H_{\rm I}}{\hbar}=

459: 	ig[a^\dag\sigma(t)-a\sigma^\dag(t)]

460: 	+\int_0^\infty d\omega\,

461: 		\kappa_s(\omega)[\sigma(t)b_s^\dag(\omega)

462: 			e^{i(\omega-\omega_0)t}+\mbox{h.c.}]

463: 	\nonumber\\

464: \label{eq:interaction}

465: 	\phantom{\frac{H_{\rm I}}{\hbar}=ig[a^\dag\sigma(t)-a\sigma^\dag(t)]}

466: 	+\int_0^\infty d\omega\,

467: 		\kappa_f(\omega)[ab_f^\dag(\omega)e^{i(\omega-\omega_0)t}

468: 			+\mbox{h.c.}]

469: \end{eqnarray}

470: where we have defined

471: \begin{eqnarray}

472: 	|\pm\rangle = 2^{-1/2}(|g\rangle\mp i|e\rangle), ~~~~

473: 	\mu=|-\rangle\langle+|, ~~~~ \mu_z=[\mu^\dag,\mu], \\

474: 	\sigma(t)=(-i/2)(\mu e^{-i\mathcal{E}t}+\mu_z-\mu^\dag e^{i\mathcal{E}t})

475: \end{eqnarray}

476: There are two time scales in the Hamiltonian (\ref{eq:interaction}), which

477: we will consider separately.  The first term evolves on the slow time

478: scale of the atomic evolution.  As $\mathcal{E}$ is very large compared to

479: the atomic time scale, we make the rotating wave approximation by dropping

480: the rapidly oscillating terms.

481:

482: The remaining terms in (\ref{eq:interaction}) correspond to the fast time

483: scale of interaction with the external electromagnetic field.  We cannot

484: use the rotating wave approximation for these terms, as the external

485: fields are broadband and thus have modes that respond on the fast time

486: scale.  Instead, we make the weak coupling (Markov) approximation for

487: these terms; this results in the following white noise Hamiltonian:

488: \begin{eqnarray}

489: \label{eq:whiten}

490: 	\tilde H_{\rm I}=

491: 	\hbar(g/2)\mu_zx

492: 	+\hbar\sqrt{2\kappa}\,[a\dot B_f^\dag(t)+\mbox{h.c.}]

493: 	+i\hbar[

494: 		\sqrt{\gamma_{+}/2}\,\mu\dot B_{s,+}^\dag(t)

495: 	\nonumber\\

496: 	\phantom{\tilde H_{\rm I}=\hbar(g/2)\mu_zx+}

497: 		+\sqrt{\gamma_{z}/2}\,\mu_z\dot B_{s,z}^\dag(t)+

498: 		\sqrt{\gamma_{-}/2}\,\mu^\dag\dot B_{s,-}^\dag(t)-\mbox{h.c.}]

499: \end{eqnarray}

500: Here $B_f^\dag$, $B_{s,+}^\dag$, $\dot B_{s,z}^\dag$ and

501: $\dot B_{s,-}^\dag$ are independent quantum white noises corresponding,

502: respectively, to the forward channel and the three spontaneous emission

503: channels at $\omega=\omega_0+\mathcal{E}$, $\omega_0$, and

504: $\omega_0-\mathcal{E}$ (the upper, middle and lower peaks of the Mollow

505: triplet.)

506:

507: We refer to \cite{s:gardinercollett} for a discussion of the white noise

508: approximation.  Care must be taken to assign an independent white noise to

509: each frequency component of $\sigma(t)$ (e.g.\ section III.E of

510: \cite{s:gardinercollett}); each frequency probes a different subset of the

511: modes $b(\omega)$ and hence ``sees'' a different noise.  In the weak

512: coupling limit these noises are in fact white and independent.  For a more

513: rigorous approach to the white noise limit see \cite{s:accardi,s:gough}.

514: Using the latter approach we can explicitly calculate

515: $\kappa=\pi\kappa_f(\omega_0)^2$, $\gamma_z=\pi\kappa_s(\omega_0)^2$, and

516: $\gamma_\pm=\pi\kappa_s(\omega_0\pm\mathcal{E})^2$.  For simplicity, we

517: will assume that approximately $\gamma_{z,+,-}=\gamma$.

518:

519: The white noise Hamiltonian (\ref{eq:whiten}) by itself is not well

520: defined.  However, we can give rigorous meaning to the equation

521: \begin{equation}

522: 	\frac{dU_t}{dt}=-\frac{i}{\hbar}\tilde H_{\rm I}U_t

523: \end{equation}

524: if we interpret it as a {\it Stratonovich} quantum stochastic differential

525: equation \cite{s:gardinercollett,s:gough}.  After conversion to the It\^o

526: form, this equation reads

527: \begin{eqnarray}

528: 	dU_t=[

529: 		\sqrt{\gamma/2}\,(\mu\,dB_{s,+}^\dag(t)

530: 			+\mu_z\,dB_{s,z}^\dag(t)

531: 			+\mu^\dag\,dB_{s,-}^\dag(t)-\mbox{h.c.})

532: 	\nonumber\\

533: 	\phantom{dU_t=[\sqrt{\gamma/2}}

534: 		-i\sqrt{2\kappa}\,(a\,dB_f^\dag(t)+\mbox{h.c.})

535: 	\nonumber\\

536: \label{eq:sys}

537: 	\phantom{dU_t=[\sqrt{\gamma/2}}

538: 		-\kappa a^\dag a\,dt

539: 		-(\gamma/2)\,dt

540: 		-i(g/2)\mu_z x\,dt

541: 	]\,U_t

542: \end{eqnarray}

543:

544: Let us now turn to the homodyne observation of the field.  The homodyne

545: detector measures a quadrature of the forward channel after it has

546: scattered off the cavity.  We will choose the quadrature

547: $B_f(t)+{B_f}^\dag(t)$; the observation process is then

548: $Y(t)=U_t^\dag(B_f(t)+{B_f}^\dag(t))U_t$ (i.e., the photocurrent is

549: $I(t)=dY(t)/dt$.)  Using the quantum It\^o rules

550: \cite{s:gardinercollett,s:hudpar}, we easily find the differential form of

551: this expression:

552: \begin{equation}

553: \label{eq:rawobs}

554: 	dY(t) = \sqrt{2\kappa}\,U_t^\dag yU_t\,dt+

555: 			dB_f(t)+dB_f^\dag(t)

556: \end{equation}

557: We can slightly extend our observation model to account for technical

558: noise, detector inefficiency, etc.  To model such effects, we add to

559: (\ref{eq:rawobs}) an independent corrupting noise $\propto dC(t)+dC^\dag(t)=

560: dV(t)$.  It is customary in the quantum optics literature to normalize

561: $Y(t)$ so that $dY(t)^2=dt$.  In terms of the detection efficiency

562: $\eta\in (0,1]$

563: \begin{equation}

564: \label{eq:obs}

565: 	dY(t) = \sqrt{2\kappa\eta}\,U_t^\dag yU_t\,dt

566: 		+\sqrt{\eta}\,[dB_f(t)+dB_f^\dag(t)]

567: 		+\sqrt{1-\eta}\,dV(t)

568: \end{equation}

569: We will take the It\^o equations (\ref{eq:sys}), (\ref{eq:obs}) as

570: our model for the system-observation pair.

571:

572: \subsection{The quantum filter}

573: \label{sec:jc2}

574:

575: Now that we have a model for the system and the observation process, we

576: can calculate the optimal filter.  The derivation of the filtering

577: equation is beyond the scope of this paper; for various approaches, see

578: \cite{s:VanHandel2005,s:belavkin,s:belavkz1,s:belavkz2,s:bouten}.  We will

579: attempt, however, through a simple finite-dimensional analogy, to explain

580: our interpretation of the filtering equation, as it is not entirely the

581: same as the interpretation that is often found in the physics literature

582: (e.g.\ \cite{s:wisemanf}).

583:

584: The optimal filter propagates the information state, which determines our

585: best estimate of every system observable given the observations we have

586: made.  In our model, every system observable can be represented as a

587: self-adjoint operator $X$ that lives on the atom-cavity Hilbert space; as

588: we are working in the Heisenberg picture, this observable at time $t$ is

589: given by $j_t(X)=U_t^\dag XU_t$.  We must now define what we mean by an

590: estimate of an observable.

591:

592: The idea behind the concept of estimation is that we have made some

593: observation, and given the outcome of this observation we wish to make a

594: guess as to the outcome of a different observable that we haven't

595: measured. That is, the estimate of an observable $X$ given an observation

596: of $Y$ is some function $f(Y)$ whose outcome represents our best guess of

597: $X$.  To find the {\it best} estimate we must specify some cost function

598: $\mathcal{C}[f]$ to optimize; the function that minimizes $\mathcal{C}$ is

599: then by definition the optimal estimate.

600:

601: The most commonly used estimator is one that minimizes the mean-square

602: error

603: \begin{equation}

604: \label{eq:meansq}

605: 	\mathcal{C}[f]=\langle(X-f(Y))^2\rangle

606: \end{equation}

607: The observable $f(Y)$ that minimizes this cost is called the {\it

608: conditional expectation} $\mathcal{E}(X|Y)$ of $X$ given $Y$.

609: We will use the conditional expectation as our information state

610: throughout this paper.  However, note that if we had chosen a different

611: cost we would obtain a different information state and filter.  There is

612: nothing inherently superior about the choice (\ref{eq:meansq}); in

613: fact, it is sometimes advantageous to choose a different estimator with

614: e.g.\ improved robustness properties \cite{s:james2,s:james3}.

615:

616: To understand how the conditional expectation relates to familiar notions

617: from quantum theory, we will demonstrate the procedure using a pair of

618: finite-dimensional observables \cite{s:maassen}.  Let $X$ and $Y$ be two

619: $n$-dimensional observables, $n<\infty$, and let $Y$ have $m$ distinct

620: eigenvalues $y_i$.  Then $Y$ can be decomposed as

621: \begin{equation}

622: 	Y=\sum_{i=1}^m y_iP_i

623: \end{equation}

624: where $P_i$ is the projection operator onto the eigenspace corresponding

625: to $y_i$.  Clearly any function of $Y$ is a linear combination of $P_i$,

626: and vice versa.  Hence we identify the set of all observables that are

627: functions of $Y$ with the span of $\{P_i\}$.  The conditional expectation

628: $\mathcal{E}(X|Y)$ is then the element of this set that minimizes the cost

629: (\ref{eq:meansq}).

630:

631: To find this element, we use the following trick.  The expression $\langle

632: X^\dag Y\rangle$ defines an inner product on the set of $n\times n$

633: complex matrices\footnote{

634: 	We assume for simplicity that the expectation map

635: 	$\langle\cdot\rangle$ is faithful \cite{s:maassen}.  If this is

636: 	not the case, then the conditional expectation is not unique.

637: 	However, all versions of $\mathcal{E}(\cdot|\cdot)$ are

638: 	equivalent in the sense that the difference between two versions

639: 	takes nonzero values with zero probability.

640: }.  Using this inner product, we orthogonally project $X$ onto the linear

641: space spanned by $\{P_i\}$.  This gives

642: \begin{equation}

643: \label{eq:qcondex}

644: 	P_YX=\sum_{i=1}^m\frac{\langle P_iX\rangle}{\langle P_i\rangle}P_i

645: \end{equation}

646: It is a well known fact that the orthogonal projection of some vector

647: $v$ onto a linear subspace $W$ with respect to any inner product $(a,b)$

648: gives the element $w\in W$ that minimizes the quantity $((v-w),(v-w))$

649: \cite{s:naylor}.  In our case, this means that $P_YX$ minimizes $\langle

650: (X-f^*(Y))(X-f(Y))\rangle$.  $f(Y)$ is only an observable, however, if $f$

651: is real, in which case we see that $P_YX$ is precisely the conditional

652: expectation $\mathcal{E}(X|Y)$.  Note that the orthogonal projection

653: $P_YX$ will always be self-adjoint if $X$ and $Y$ commute.

654:

655: Remarkably, when $X$ and $Y$ commute, the expression (\ref{eq:qcondex}) is

656: equivalent to the traditional projection postulate.  To see this, note

657: that if we observe $Y=y_i$ then $P_YX$ takes the value

658: $\langle P_iX\rangle/\langle P_i\rangle=

659: \langle P_iXP_i\rangle/\langle P_i\rangle$, which is exactly the

660: expectation of $X$ with respect to the initial state projected onto the

661: eigenspace of $y_i$.  The situation is somewhat ambiguous for noncommuting

662: $X$ and $Y$, and we will simply refrain from defining the conditional

663: expectation $\mathcal{E}(X|Y)$ when $[X,Y]\ne 0$.

664:

665: The quantum filter determines the best estimate of every system observable

666: given the observations; i.e., it propagates

667: $\pi_t(X)=\mathcal{E}(j_t(X)|Y(s\le t))$, where here

668: $\mathcal{E}(\cdot|\cdot)$ is a proper infinite-dimensional generalization

669: of (\ref{eq:qcondex}).  A crucial point is that $j_t(X)$ and $Y(s)$ can in

670: fact be shown to commute for all $s\le t$; this is called the {\it

671: nondemolition property} by Belavkin \cite{s:belavkin}.  Thus we see that,

672: even though we can evidently interpret the quantum filter in terms of the

673: projection postulate, we do not need to postulate anything beyond the

674: standard formalism of observables and expectations in quantum mechanics.

675: Instead, we see that the filter follows naturally from a statistical

676: inference procedure wherein we find the least-squares estimate for every

677: system observable given the observations.  This point of view is

678: very natural in a control-theoretic context.

679:

680: We now give the quantum filter for our model (\ref{eq:sys}), (\ref{eq:obs});

681: we refer to \cite{s:VanHandel2005,s:belavkin,s:belavkz1,s:belavkz2,s:bouten}

682: for various approaches to deriving this equation.  The result is

683: \begin{eqnarray}

684: 	d\pi_t(X)=

685: 	\pi_t(

686: 		(\gamma/2)\{

687: 			\overline{\mathcal D}[\mu]+

688: 			\overline{\mathcal D}[\mu_z]+

689: 			\overline{\mathcal D}[\mu^\dag]

690: 		\}X

691: 	)\,dt

692: \nonumber\\

693: \phantom{ d\pi_t(X)=\pi}

694: 	+\pi_t(

695: 		2\kappa\,\overline{\mathcal D}[a]X

696: 	)\,dt

697: 	+\pi_t(

698: 		i(g/2)[\mu_zx,X]

699: 	)\,dt

700: \nonumber\\

701: \phantom{ d\pi_t(X)=\pi}

702: 	+ \sqrt{2\kappa\eta}\,[

703: 		i\pi_t(a^\dag X-Xa)

704: 	-\pi_t(y)\pi_t(X)]\times

705: \nonumber\\

706: \label{eq:qfilt}

707: \phantom{ d\pi_t(X)=\pi + \sqrt{2\kappa\eta}\,[i\pi_t(a^\dag X-X}

708: 	(dY(t)-\sqrt{2\kappa\eta}\,\pi_t(y)\,dt)

709: \end{eqnarray}

710: where $\overline{\mathcal D}[c]X=c^\dag Xc-(c^\dag cX+Xc^\dag c)/2$.

711: The process $dW(t)=dY(t)-\sqrt{2\kappa\eta}\,\pi_t(y)\,dt$ is known as the

712: {\it innovations process}; it describes how ``surprised'' we are by the

713: measurement, as it is the difference between the observation $dY(t)$ and

714: our best estimate of what we should observe.  It can be shown that, as

715: long as the observation process $Y(t)$ has the statistics determined by

716: (\ref{eq:sys}) and (\ref{eq:obs}), the innovations process $dW_t$ is a

717: Wiener process.  In some sense this reflects the optimality of the filter,

718: as it means that the innovation is unbiased.

719:

720: Usually the quantum filter (\ref{eq:qfilt}) is written in its density

721: form.  To do this, we define a random density operator $\rho_t$ such

722: that\footnote{

723: 	We mean this in the sense of random variables; that is,

724: 	${\rm Tr}[X\rho_t]$ is a classical random variable with the same

725: 	statistics as the observable $\pi_t(X)$.  We have already implied

726: 	such a correspondence by interpreting $Y(t)$ as a classical

727: 	stochastic process.  In general, we can always express a set of

728: 	observables as classical random variables as long as they commute

729: 	\cite{s:maassen}.

730: } $\pi_t(X)={\rm Tr}[X\rho_t]$.  We then find

731: \begin{eqnarray}

732: 	d\rho_t=-i(g/2)[\mu_zx,\rho_t]\,dt

733: 		+2\kappa\mathcal{D}[a]\rho_t\,dt

734: \nonumber\\

735: \phantom{d\rho_t=-i}

736: 		+(\gamma/2)\{\mathcal{D}[\mu]

737: 			+\mathcal{D}[\mu_z]+\mathcal{D}[\mu^\dag]\}\rho_t\,dt

738: \label{eq:dfilt}\\

739: \nonumber

740: \phantom{d\rho_t=-i}

741: 	+\sqrt{2\kappa\eta}\,[

742: 		i\rho_t a^\dag-ia\rho_t-{\rm Tr}[\rho_t y]\rho_t

743: 	](dY(t)-\sqrt{2\kappa\eta}\,{\rm Tr}[\rho_t y]\,dt)

744: \end{eqnarray}

745: where $\mathcal{D}[c]\rho=c\rho c^\dag-(c^\dag c\rho+\rho c^\dag c)/2$.

746: This description is more economical than the raw filter (\ref{eq:qfilt}),

747: and appears frequently in the physics literature.  We have to be careful,

748: however, to interpret $\rho_t$ as the information state of an observer

749: with access to $Y(t)$, and {\it not} as the physical state of the system.

750: This point will be important for the interpretation of our results.

751:

752: We conclude this section with one more filter, the so-called unnormalized

753: filter, which is given by the expression

754: \begin{eqnarray}

755: 	d\tilde\rho_t=-i(g/2)[\mu_zx,\tilde\rho_t]\,dt

756: 		+2\kappa\mathcal{D}[a]\tilde\rho_t\,dt

757: \nonumber\\

758: \phantom{d\rho_t=-i}

759: 		+(\gamma/2)\{\mathcal{D}[\mu]

760: 			+\mathcal{D}[\mu_z]+\mathcal{D}[\mu^\dag]\}

761: 		\tilde\rho_t\,dt

762: \label{eq:ufilt}\\

763: \nonumber

764: \phantom{d\rho_t=-i}

765: 	+i\sqrt{2\kappa\eta}\,[\tilde\rho_t a^\dag-a\tilde\rho_t]\,dY(t)

766: \end{eqnarray}

767: The information state $\tilde\rho_t$ propagated by this filter is not

768: normalized, ${\rm Tr}[\tilde\rho_t]\ne 1$.  However, it is simply related

769: to the normalized information state by $\rho_t=\tilde\rho_t/{\rm

770: Tr}[\tilde\rho_t]$.  The chief advantage of (\ref{eq:ufilt}) is that it is

771: a linear equation, whereas (\ref{eq:dfilt}) is nonlinear in $\rho_t$.

772: This makes (\ref{eq:ufilt}) somewhat easier to manipulate.

773:

774: \subsection{The $Q$-filter}

775:

776: In \cite{s:mabuchi}, it was noticed that density operators of the form

777: \begin{equation}

778: \label{eq:qansatz}

779: 	\rho=\sum_{a=\pm}|a\rangle\langle a|\otimes

780: 		\int dy\, P^a(y)|iy/2\rangle\langle iy/2|

781: \end{equation}

782: [$|iy/2\rangle$ are coherent states of the cavity mode] form an invariant

783: set of the filtering equation (\ref{eq:dfilt}).  Thus, as long as the

784: initial density is within this set, we can represent the filtering

785: equations in terms of the pair of real (Glauber-Sudarshan) functions

786: $P^\pm(y)$ on a line.  Substituting (\ref{eq:qansatz}) into

787: (\ref{eq:ufilt}) yields the unnormalized $P$-filter

788: \begin{eqnarray}

789: 	dP^\pm_t(y)=

790: 	\frac{\partial}{\partial y}[(\pm g+\kappa y)P^\pm_t(y)]\,dt

791: \nonumber\\

792: \label{eq:pfiltt}

793: \phantom{dP^\pm_t(y)=(\pm g+}

794: 	+\frac{\gamma}{2}[P^\mp_t(y)-P^\pm_t(y)]\,dt

795: 	+\sqrt{2\kappa\eta}\,yP^\pm_t(y)\,dY(t)

796: \end{eqnarray}

797: For our purposes, it is more convenient to work with unnormalized

798: $Q$-functions

799: \begin{equation}

800: \label{eq:qfunction}

801: 	Q^\pm(y)=\langle\pm,iy/2|\rho|\pm,iy/2\rangle=

802: 	\int dy'\, P^\pm(y')\, e^{-(y-y')^2/4}

803: \end{equation}

804: as these are always guaranteed to be well-behaved densities \cite{s:mandel}.

805: We obtain

806: \begin{eqnarray}

807: 	dQ^\pm_t(y)=

808: 	\frac{\gamma}{2}[Q^\mp_t(y)-Q^\pm_t(y)]\,dt

809: 	+\frac{\partial}{\partial y}[(\pm g+\kappa y)Q^\pm_t(y)]\,dt

810: \nonumber\\

811: \label{eq:qfiltt}

812: \phantom{dQ^\pm_t(y)=Q}

813: 	+2\kappa\frac{\partial^2}{\partial y^2}Q^\pm_t(y)\,dt

814: 	+\sqrt{2\kappa\eta}\,\left[

815: 		y+2\frac{\partial}{\partial y}

816: 	\right]Q^\pm_t(y)\,dY(t)

817: \end{eqnarray}

818: The simplicity of this expression motivates our choice of this system for

819: demonstrating the quantum projection filter.

820:

821: Rather than using the $Q$-function for the projection filter, we could

822: work directly with the filter (\ref{eq:dfilt}) in density form and apply

823: methods of quantum information geometry \cite{s:amari}.  However, note

824: that any metric on a manifold of densities induces a metric on the

825: corresponding manifold of density operators (e.g.\ \cite{s:topology}).

826: Thus even the $Q$-function projection filter is a true quantum projection

827: filter, as long as we project onto a family of $Q$-functions that

828: correspond to valid quantum states.

829:

830: \subsection{Observing the spontaneous emission}

831:

832: Until now we have only observed the forward channel; however, at least in

833: principle, we could also observe independently the three spontaneous

834: emission channels $B_{s,z}$, $B_{s,\pm}$.  We would like to identify a

835: spontaneous emission event with the detection of a photon in one of these

836: side channels.  As such, in this section we discuss the situation wherein

837: direct photodetection is performed in each of the spontaneous emission

838: channels, in addition to the homodyne detection of the forward channel.

839:

840: The analysis in this case is very similar to the one performed in sections

841: \ref{sec:jc1}--\ref{sec:jc2}.  The system model is still given by

842: (\ref{eq:sys}).  Now, in addition to (\ref{eq:obs}), we need to introduce

843: three observation processes $N_{z,+,-}$ corresponding to photodetection

844: (with perfect efficiency) in the three spontaneous emission channels.

845: The details of this setup and the associated filtering equations are well

846: known and we will not repeat them here (see e.g.\ \cite{s:bouten}).  The

847: full (normalized) filtering equation is given by

848: \begin{eqnarray}

849: 	d\rho_t=-i(g/2)[\mu_zx,\rho_t]\,dt

850: 		+2\kappa\mathcal{D}[a]\rho_t\,dt

851: \nonumber\\

852: \phantom{d\rho_t=-i}

853: 		+(\gamma/2)\{\mathcal{D}[\mu]

854: 			+\mathcal{D}[\mu_z]+\mathcal{D}[\mu^\dag]\}\rho_t\,dt

855: \nonumber\\

856: \phantom{d\rho_t=-i}

857: 	+\sqrt{2\kappa\eta}\,[

858: 		i\rho_t a^\dag-ia\rho_t-{\rm Tr}[\rho_t y]\rho_t

859: 	](dY(t)-\sqrt{2\kappa\eta}\,{\rm Tr}[\rho_t y]\,dt)

860: \nonumber\\

861: \phantom{d\rho_t=-i}

862: 	+\mathcal{G}[\mu]\rho_t\,(dN_+(t)-(\gamma/2)\,{\rm Tr}[\mu^\dag\mu\rho_t]\,dt)

863: \nonumber\\

864: \phantom{d\rho_t=-i}

865: 	+\mathcal{G}[\mu_z]\rho_t\,(dN_z(t)-(\gamma/2)\,dt)

866: \nonumber\\

867: \label{eq:fullfilt}

868: \phantom{d\rho_t=-i}

869: 	+\mathcal{G}[\mu^\dag]\rho_t\,(dN_-(t)-(\gamma/2)\,{\rm Tr}[\mu\mu^\dag\rho_t]\,dt)

870: \end{eqnarray}

871: where $\mathcal{G}[c]\rho=c\rho c^\dag/{\rm Tr}[c\rho c^\dag]-\rho$.  It

872: can be shown that the statistics of the processes $N_{+,z,-}$(t) is such

873: that they are counting processes with independent jumps and rates

874: $(\gamma/2)\,{\rm Tr}[\mu^\dag\mu\rho_t]$, $(\gamma/2)$ and

875: $(\gamma/2)\,{\rm Tr}[\mu\mu^\dag\rho_t]$, respectively.

876:

877: We now have two different filters, equations (\ref{eq:dfilt}) and

878: (\ref{eq:fullfilt}), for the same physical system (\ref{eq:sys}).  To see

879: how they relate, recall that all the filter is propagating is an

880: information state.  The information state in (\ref{eq:dfilt}) represents

881: the best estimate of an observer who only has access to the homodyne

882: measurement in the forward channel.  The information state in

883: (\ref{eq:fullfilt}), however, represents the best estimate of a different

884: observer who has access to both the homodyne observation and to direct

885: photodetection of the spontaneous emission channels.  Neither information

886: state represents the physical state of the system; the latter is given by

887: (\ref{eq:sys}).

888:

889: In practice, the frequency-resolved monitoring of spontaneously emitted

890: photons is not (yet) experimentally feasible.  Hence we would never use

891: the filter (\ref{eq:fullfilt}) in an actual experimental situation.  On

892: the other hand, we are able to generate photocurrents $Y(t)$,

893: $N_{+,z,-}(t)$ with the correct statistics in a computer simulation.  It

894: is then interesting to compare the estimate of an observer who has access

895: to all photocurrents to the estimate of a realistic observer who only has

896: access to the forward channel.  In particular, this gives insight into the

897: question asked in \cite{s:mabuchi}, `in what sense should we be able to

898: associate observed phase-switching events [in the forward channel] with

899: ``actual'' atomic decays?'

900:

901: The main reason for introducing (\ref{eq:fullfilt}) is that it gives us a

902: convenient way to perform computer simulations of the photocurrent $Y(t)$.

903: We wish to generate sample paths of $Y(t)$, with the correct statistics,

904: in order to compare the performance of the optimal filter (\ref{eq:dfilt})

905: with the projection filter that we will derive shortly.  Ideally we would

906: directly simulate the system evolution (\ref{eq:sys}); this problem is

907: essentially intractable, however.  Fortunately, we have already expressed

908: the statistics of the photocurrents $Y(t)$, $N_{+,z,-}(t)$ completely in

909: terms of the information state.  Hence we can equivalently simulate the

910: photocurrents by simulating (\ref{eq:fullfilt}) according to these rules

911: ($dW(t)$ is a Wiener process, $N_+(t)$ has rate $(\gamma/2)\,{\rm

912: Tr}[\mu^\dag\mu\rho_t]$, etc.)

913:

914: Of course, we can also perform such simulations with (\ref{eq:dfilt}).

915: However, the advantage of (\ref{eq:fullfilt}) is that, if we choose

916: $\eta=1$, the pure states are an invariant set of this filter.  We can

917: thus rewrite the equation as a stochastic Schr{\"o}dinger equation, in

918: which we only have to propagate a vector instead of an operator.  This is

919: a much more efficient numerical procedure, and is frequently used in

920: quantum optics \cite{s:gardinerparkins}.  For our system, the stochastic

921: Schr{\"o}dinger equation corresponding to (\ref{eq:fullfilt}) is given by

922: \begin{eqnarray}

923: 	d|\psi_t\rangle=[

924: 		(-i(g/2)\mu_z x

925: 		 -i\kappa\langle y\rangle_t a

926: 		 -\kappa a^\dag a

927: 		 -(\kappa/4)\langle y\rangle_t^2)\,dt

928: \nonumber\\

929: \phantom{d|\psi_t\rangle=[(-i}

930: 		-\sqrt{2\kappa}\,(ia+\langle y\rangle_t/2)\,dW(t)

931: 		+(\mu/\langle\mu^\dag\mu\rangle_t^{1/2}-1)\,dN_+(t)

932: \nonumber\\

933: \phantom{d|\psi_t\rangle=[(-i}

934: \label{eq:sse}

935: 		+(\mu_z-1)\,dN_z(t)

936: 		+(\mu^\dag/\langle\mu\mu^\dag\rangle_t^{1/2}-1)\,dN_-(t)

937: 	]\,|\psi_t\rangle

938: \end{eqnarray}

939: where $\langle c\rangle_t=\langle\psi_t|c|\psi_t\rangle$, and

940: $\rho_t=|\psi_t\rangle\langle\psi_t|$.  We numerically solve this equation

941: in a truncated Fock basis for the cavity mode.  The homodyne photocurrent

942: (\ref{eq:obs}) is calculated from the innovation using

943: $dY(t)=\sqrt{2\kappa\eta}\,\langle y\rangle_t\,dt+\sqrt{\eta}\,dW_t+\sqrt{1-\eta}\,dV_t$.

944:

945:

946: \section{The quantum projection filter}

947: \label{sec:proj2}

948:

949: \subsection{The finite-dimensional family}

950:

951: Before we can obtain a projection filter for (\ref{eq:qfiltt}), we must

952: fix the finite-dimensional family of densities to project onto.  Note that

953: each density is actually the pair of $Q$-functions $Q^\pm(y)$, unlike

954: in section \ref{sec:proj} where each density was a single function.

955: However, we can easily put the problem into this form by making $\pm$

956: an argument of the function, i.e.\ $Q^\pm(y)=Q(y,\pm)$.  The square roots

957: of $Q$-functions form a perfectly reasonable $L^2$ space

958: ($L^2=L^2(\mathbb{R})\oplus L^2(\mathbb{R})$) with the inner product

959: \begin{equation}

960: 	\langle Q_1^{1/2},Q_2^{1/2}\rangle=

961: 		\sum_{a=\pm}\int_{-\infty}^\infty dy\,

962: 			Q_1^{1/2}(y,a)Q_2^{1/2}(y,a)

963: \end{equation}

964: In the following we will use the notations $Q^\pm(y)$ and $Q(y,\pm)$

965: interchangeably.

966:

967: Numerical simulations of (\ref{eq:qfiltt}) show that at any time, both

968: $Q^+(y)$ and $Q^-(y)$ are unimodal, roughly bell-shaped densities with an

969: approximately constant width.  This suggests that we can attempt to

970: approximate the information state by unnormalized density operators of the

971: form

972: \begin{equation}

973: 	\rho=

974: 		\nu^+|+\rangle\langle+|\otimes

975: 			|i\mu^+/2\rangle\langle i\mu^+/2|+

976: 		\nu^-|-\rangle\langle-|\otimes

977: 			|i\mu^-/2\rangle\langle i\mu^-/2|

978: \end{equation}

979: This corresponds to the bi-Gaussian family of unnormalized $Q$-functions

980: \begin{equation}

981: 	q(y,\pm)=\frac{\nu^\pm}{2\sqrt{\pi}}

982: 		\exp\left[

983: 			-\frac{(y-\mu^\pm)^2}{4}

984: 		\right],~\mu^\pm\in\mathbb{R},~\nu^\pm\ge 0

985: \end{equation}

986: We collect the parameters into a vector

987: $\theta=(\mu^+,\nu^+,\mu^-,\nu^-)$, where

988: $\theta\in\Theta=\{\mu^\pm\in\mathbb{R},~\nu^\pm\ge 0\}$.  Then the family

989: of square roots of densities

990: \begin{equation}

991: 	S^{1/2}=\{\sqrt{q(y,\pm;\theta)},~\theta\in\Theta\}

992: \end{equation}

993: is a finite-dimensional manifold in $L^2$ with the tangent space

994: \begin{equation}

995: 	T_\theta S^{1/2}=\mbox{Span}\left\{

996: 	\frac{\partial\sqrt{q(y,\pm;\theta)}}{\partial\theta^i}:

997: 	i=1\ldots 4\right\}

998: \end{equation}

999: and Fisher metric

1000: \begin{equation}

1001: 	g_{ij}(\theta)=

1002: 	4\left\langle

1003: 	\frac{\partial\sqrt{q(y,\pm;\theta)}}{\partial\theta^i},

1004: 	\frac{\partial\sqrt{q(y,\pm;\theta)}}{\partial\theta^j}

1005: 	\right\rangle

1006: \end{equation}

1007: Calculating the latter explicitly, we obtain the diagonal matrix

1008: \begin{equation}

1009: 	g(\theta)=\mbox{diag}\left\{

1010: 		\frac{\nu^+}{2},~\frac{1}{\nu^+},~

1011: 		\frac{\nu^-}{2},~\frac{1}{\nu^-}

1012: 	\right\}

1013: \end{equation}

1014:

1015: \subsection{The projection filter}

1016:

1017: We will perform projection of the unnormalized filtering equation

1018: (\ref{eq:qfiltt}), as in \cite{s:vellekoop}.  We begin by converting the

1019: equation into the Stratonovich form:

1020: \begin{eqnarray}

1021: 	dQ^\pm_t(y)=

1022: 	\frac{\gamma}{2}[Q^\mp_t(y)-Q^\pm_t(y)]\,dt

1023: 	+\frac{\partial}{\partial y}[(\pm g+\kappa(1-4\eta)y)Q^\pm_t(y)]\,dt

1024: \nonumber\\

1025: \phantom{dQ^\pm_t(y)=Q}

1026: 	+2\kappa(1-2\eta)\frac{\partial^2}{\partial y^2}Q^\pm_t(y)\,dt

1027: 	+\kappa\eta(2-y^2)Q^\pm_t(y)\,dt

1028: \nonumber\\

1029: \label{eq:qfilttst}

1030: \phantom{dQ^\pm_t(y)=Q+2\kappa(1-2\eta)\frac{\partial}{\partial y}}

1031: 	+\sqrt{2\kappa\eta}\,\left[

1032: 		y+2\frac{\partial}{\partial y}

1033: 	\right]Q^\pm_t(y)\circ dY(t)

1034: \end{eqnarray}

1035: We can now use (\ref{eq:projf}) and (\ref{eq:projf2}) to find dynamical

1036: equations for the projection filter.  After tedious but straightforward

1037: calculations, we obtain

1038: \begin{eqnarray}

1039: \label{eq:projfs1}

1040:         d\nu^+_t = \left[\frac{\gamma}{2}(\nu^-_t - \nu^+_t)

1041:                 -\kappa\eta\,(\mu^+_t)^2\nu^+_t

1042:         \right]dt

1043:         +\sqrt{2\kappa\eta}\,\mu^+_t\nu^+_t\circ dY(t) \\

1044: \label{eq:projfs2}

1045:         d\nu^-_t = \left[\frac{\gamma}{2}(\nu^+_t - \nu^-_t)

1046:                 -\kappa\eta\,(\mu^-_t)^2\nu^-_t

1047:         \right]dt

1048:         +\sqrt{2\kappa\eta}\,\mu^-_t\nu^-_t\circ dY(t) \\

1049:         \frac{d\mu^+_t}{dt} = -g-\kappa\mu^+_t+

1050:                 \frac{\gamma}{2}\frac{\nu^-_t}{\nu^+_t}

1051:                 (\mu^-_t-\mu^+_t) \\

1052:         \frac{d\mu^-_t}{dt} = +g-\kappa\mu^-_t+

1053:                 \frac{\gamma}{2}\frac{\nu^+_t}{\nu^-_t}

1054:                 (\mu^+_t-\mu^-_t)

1055: \end{eqnarray}

1056: Conversion to the It\^o form changes (\ref{eq:projfs1}) and

1057: (\ref{eq:projfs2}) to

1058: \begin{eqnarray}

1059:         d\nu^+_t = \frac{\gamma}{2}(\nu^-_t - \nu^+_t)\,dt

1060:         +\sqrt{2\kappa\eta}\,\mu^+_t\nu^+_t\, dY(t) \\

1061:         d\nu^-_t = \frac{\gamma}{2}(\nu^+_t - \nu^-_t)\,dt

1062:         +\sqrt{2\kappa\eta}\,\mu^-_t\nu^-_t\, dY(t)

1063: \end{eqnarray}

1064: Finally, we rewrite the equations in terms of the {\it normalized}

1065: parameters $\mu^\pm$ and $\tilde\nu_t^+=\nu_t^+/(\nu_t^++\nu_t^-)$.  This

1066: gives

1067: \begin{eqnarray}

1068:         d\tilde\nu^+_t = -\gamma(\tilde\nu^+_t-1/2)\,dt+

1069:         \sqrt{2\kappa\eta}\,\tilde\nu^+_t(1-\tilde\nu^+_t)

1070:                 (\mu^+_t-\mu^-_t)\times

1071: \nonumber\\

1072: \label{eq:pf1}

1073: \phantom{d\tilde\nu^+_t = -\gamma(\tilde\nu^+_t-1/2)}

1074:         \{dY(t)-\sqrt{2\kappa\eta}\,[\mu^+_t\tilde\nu^+_t

1075:                 +\mu^-_t(1-\tilde\nu^+_t)]\,dt\} \\

1076: \label{eq:pf2}

1077:         \frac{d\mu^+_t}{dt} = -g-\kappa\mu^+_t+

1078:                 \frac{\gamma}{2}\frac{1-\tilde\nu^+_t}{\tilde\nu^+_t}

1079:                 (\mu^-_t-\mu^+_t) \\

1080: \label{eq:pf3}

1081:         \frac{d\mu^-_t}{dt} = +g-\kappa\mu^-_t+

1082:                 \frac{\gamma}{2}\frac{\tilde\nu^+_t}{1-\tilde\nu^+_t}

1083:                 (\mu^+_t-\mu^-_t)

1084: \end{eqnarray}

1085: Equations (\ref{eq:pf1})--(\ref{eq:pf3}) form the projection filter

1086: for our model on the family $S^{1/2}$.

1087:

1088: Note that equations (\ref{eq:pf2}) and (\ref{eq:pf3}) are singular at

1089: $\tilde\nu^+_t=0$ or $1$.  We can trace this back to the fact that we have

1090: cheated a little in the definition of our family of densities.

1091: When $\nu^+=0$ (or $\nu^-=0$), the map $\theta\mapsto q(y,\pm;\theta)$ is

1092: not invertible, as in this case any choice of $\mu^+$ (or $\mu^-$) leads

1093: to the same density.  As we have essentially inverted this map to obtain

1094: the equations (\ref{eq:pf1})--(\ref{eq:pf3}) for the parameters, we can

1095: hardly expect these to be well-defined when this map is not invertible.

1096:

1097: Fortunately the points $\tilde\nu^+_t=0$ and $1$ are never reached if

1098: we start the filter with $0<\tilde\nu^+<1$.  Hence we can make the filter

1099: well-defined everywhere simply by removing the offending points

1100: $\nu^+_t=0$ and $\nu^-=0$ from $S^{1/2}$.  The map $\theta\mapsto

1101: q(y,\pm;\theta)$ is then invertible everywhere (in other words, then the

1102: manifold is covered by a single chart.)  Even if we want to consider

1103: starting the filter on $\tilde\nu^+_t=0$ or $1$ at $t=0$ this is not a

1104: problem; the filter dynamics will cause $\tilde\nu^+$ to evolve off the

1105: singular point, so that the filter is well defined after an arbitrarily

1106: small time step \cite{s:vellekoop}.

1107:

1108: \subsection{Connection with the Wonham filter}

1109:

1110: There is a remarkable connection between the projection filter obtained in

1111: the previous section and the theory of jump process filtering.  This

1112: theory goes back to the beautiful classic paper by Wonham \cite{s:wonham},

1113: in which the following problem is solved.

1114:

1115: Denote by $x(t)$ a stationary Markovian jump process which switches

1116: between two states $a_-$ and $a_+$ with a rate $\gamma/2$; i.e., $x(t)$ is

1117: a random telegraph signal \cite{s:gardinerhb}.  Now suppose we do not have

1118: access to a complete obeservation of $x(t)$, but only to the corrupted

1119: observation $y(t)$ defined by

1120: \begin{equation}

1121: \label{eq:wonhamobs}

1122: 	dy(t)=\sqrt{2\kappa\eta}\,x(t)\,dt+dw(t)

1123: \end{equation}

1124: where $dw(t)$ is a Wiener process.  We can now ask, what is our best guess

1125: of the probability $p_+(t)$ that $x(t)=a_+$, given the observations

1126: $y(s\le t)$?  The answer is given in closed form by (a special case of)

1127: the Wonham filter:

1128: \begin{eqnarray}

1129: 	dp_+(t)=-\gamma[p_+(t)-1/2]\,dt+

1130: 		\sqrt{2\kappa\eta}\,p_+(t)[1-p_+(t)](a_+-a_-)\times

1131: \nonumber\\

1132: \phantom{dp_+(t)=-\gamma[p_+(t}

1133: 	\{dy(t)-\sqrt{2\kappa\eta}\,[a_+p_+(t)+a_-(1-p_+(t))]\,dt\}

1134: \end{eqnarray}

1135: But this is exactly (\ref{eq:pf1}) with $\mu^+_t$ and $\mu^-_t$ replaced

1136: by the constants $a_+$ and $a_-$.

1137:

1138: Though intuitively appealing, this is in many ways a remarkable result.

1139: There appear to be no inherent jumps in either the optimal filter

1140: (\ref{eq:dfilt}), or the system-observation pair (\ref{eq:sys}),

1141: (\ref{eq:obs}) from which it was obtained.  It is true that we can choose

1142: to observe a jump process in the spontaneous emission channels, as in

1143: (\ref{eq:fullfilt}), but we could have equally chosen to perform homodyne

1144: or heterodyne detection which do not lead to jump process observations.

1145: Nonetheless (\ref{eq:pf1}) emerges naturally from our model, and an

1146: expression of the same form can even be obtained directly from

1147: (\ref{eq:dfilt}) \cite{s:mabuchi}.  Evidently there is a deep connection

1148: between our system and the theory of jump processes.

1149:

1150: As a classical filter, we can interpret the projection filter

1151: (\ref{eq:pf1})--(\ref{eq:pf3}) as an {\it adaptive} Wonham filter, where

1152: the equations (\ref{eq:pf2})--(\ref{eq:pf3}) continually adapt the

1153: parameters $a_+$ and $a_-$ in the Wonham filter (\ref{eq:pf1}).  A similar

1154: structure was observed in \cite{s:vellekoop}, where the classical problem

1155: of changepoint detection (the detection of a single random jump in white

1156: noise) was treated using the projection filtering approach.

1157:

1158:

1159: \section{Numerical results}

1160: \label{sec:num}

1161:

1162: \begin{figure}

1163: \includegraphics[width=\textwidth]{Compare.eps}

1164: \caption{

1165: \label{f:compare} Comparison between the optimal and projection filters.

1166: A typical observation is shown with $\eta=1$, $g=120$, $\kappa=40$,

1167: $\gamma=20$, and the integration was performed over $25\,000$ time steps.

1168: The top row, figures (a) and (c), were calculated using the optimal filter

1169: (\ref{eq:qfiltt}); the bottom row, (b) and (d), were calculated for the

1170: same observation process using the projection filter

1171: (\ref{eq:pf1})--(\ref{eq:pf3}).}

1172: \end{figure}

1173:

1174: In this section we present the results of numerical simulations of the

1175: various filters.  Sample paths of the observation process were generated

1176: by numerically solving (\ref{eq:sse}) with a truncated cavity basis of

1177: $25$ Fock states and the (appropriately truncated) initial state

1178: $|\psi_0\rangle=|-\rangle\otimes|0\rangle$. The thus generated

1179: observations were then filtered using the optimal filter in $Q$-function

1180: form (\ref{eq:qfiltt}), and using the projection filter

1181: (\ref{eq:pf1})--(\ref{eq:pf3}).

1182:

1183: The optimal filter was implemented using a simple finite-difference scheme

1184: \cite{s:nr} on a grid of $128$ equidistant points in the interval

1185: $y\in[-18,18]$, with the appropriately truncated initial condition

1186: corresponding to $|\psi_0\rangle$.  Finally, the projection filter was

1187: started with the initial condition $\tilde\nu^+=\mu^+=\mu^-=0$, where care

1188: was taken not to propagate $\mu^+$ until after the first time step.  In

1189: all simulations, stochastic integration was performed using the stochastic

1190: Euler method \cite{s:kloeden}.

1191:

1192: In Figure \ref{f:compare} a typical filtered sample path is shown.  The

1193: top row was obtained from the optimal filter, while the bottom row was

1194: obtained using the projection filter.  The value inferred for both the

1195: conditional probability of finding the atom in the $|+\rangle$ state (left

1196: column) and the conditional expectation of the $y$-quadrature (right

1197: column) are nearly identical for the two filters.  Evidently the

1198: projection filter is an extremely good approximation to the optimal

1199: filtering equation.

1200:

1201: \begin{figure}

1202: \includegraphics[width=\textwidth]{Typical.eps}

1203: \caption{

1204: \label{f:typical}  A single run of an experiment is simulated.  The dashed

1205: line (red) corresponds to the optimal estimate of an observer who has

1206: access to direct photodetection of the spontaneously emitted photons, as

1207: well as homodyne detection of the forward channel.  The solid line (green)

1208: is the optimal estimate of a different observer, who only has access to

1209: the homodyne photocurrent, for the same run of the experiment.  The dotted

1210: line (blue) is the projection filter estimate based only on the homodyne

1211: photocurrent.  All parameters are the same as in Figure \ref{f:compare}.}

1212: \end{figure}

1213:

1214: \begin{figure}

1215: \includegraphics[width=\textwidth]{Atypical.eps}

1216: \caption{

1217: \label{f:atypical}  Two more sample paths of the filtered estimate

1218: of atomic state, demonstrating missed jumps (top) and false jumps

1219: (bottom).  All parameters are the same as in Figure \ref{f:compare}.}

1220: \end{figure}

1221:

1222: Next, in Figure \ref{f:typical}, we compare the information state of the

1223: optimal and projection filters to the information state that is based on

1224: the additional observation of spontaneously emitted photons.  The latter

1225: filter demonstrates the behavior reported in \cite{s:alsing}.  Whenever a

1226: photon is observed in one of the side peaks of the Mollow triplet, the

1227: observer infers that the atom has made a jump.  The estimated phase of the

1228: cavity field then exponentially decays to a steady-state level of $\langle

1229: y\rangle=\pm g/\kappa$.

1230:

1231: If we do not measure the spontaneous emission, our best guess of the

1232: atomic state still behaves in a jump-like way.  However, we see that there

1233: is a little delay between the time that the observer of spontaneous

1234: emission thinks the atom has jumped, and the time that the homodyne

1235: observer comes to the same conclusion.  If we identify atomic decay with

1236: the spontaneous emission of a photon, we can now give a fairly

1237: satisfactory answer to the question posed in \cite{s:mabuchi}: `In what

1238: sense should we be able to associate observed phase switching events with

1239: ``actual'' atomic decays'?  It appears that an observed phase switch

1240: signals that, had we been making such an observation, we would likely have

1241: seen a spontaneously emitted photon a little while earlier.

1242:

1243: The detection delay is a rather generic property of the type of filtering

1244: problems we are considering \cite{s:vellekoop,s:shiryaev}.  Any time we

1245: see a large fluctuation in the observed process, the filter has to decide

1246: whether this is a large fluctuation of the noise, or a large fluctuation

1247: of the observed system.  As is pointed out by Shiryaev \cite{s:shiryaev}

1248: in the context of changepoint detection, the filter rides a delicate

1249: balance between minimizing the delay time and minimizing the probability

1250: of ``false alarms''.  Decreasing the number of false alarms (by

1251: choosing a different filtering cost) would unavoidably increase the delay

1252: time, and vice versa.  In our system, false alarms are missed jumps

1253: and false jumps; these do occur, as can be seen in Figure

1254: \ref{f:atypical}.

1255:

1256: \begin{figure}

1257: \includegraphics[width=\textwidth]{Better.eps}

1258: \caption{

1259: A typical filtered sample path with $\eta=1$, $g=600$, $\kappa=200$,

1260: $\gamma=20$. The integration was performed over $100\,000$ time steps.

1261: The various line types are as in the previous figures.

1262: \label{f:better}

1263: }

1264: \end{figure}

1265:

1266: If we wish to generally improve the quality of detection we have no

1267: alternative than to increase the signal-to-noise level of the observation.

1268: In the case of our system, we can do this if we increase $g$ and $\kappa$

1269: while keeping their ratio fixed (the analogy can be justified from

1270: (\ref{eq:wonhamobs}); the signal $x$ has fixed magnitude $x=\pm g/\kappa$,

1271: while the signal-to-noise ratio $\sim g/\sqrt{\kappa}$.)  A simulation

1272: with greatly increased signal-to-noise is shown in Figure \ref{f:better}.

1273: In this very strong coupling and damping regime, it appears that not much

1274: more information can be extracted from observation of the spontaneous

1275: emission than we could have already inferred from the homodyne

1276: photocurrent.

1277:

1278: \section{Conclusion}

1279:

1280: In this paper, we have suggested that the method of projection filtering

1281: can be very fruitful when applied to quantum filtering theory.  Using a

1282: simple model of a strongly coupled two-level atom in a cavity we

1283: numerically demonstrated near-optimal performance of the projection

1284: filter, as is evident from Figures \ref{f:compare}--\ref{f:better}.  We

1285: have also shown a connection between this model from cavity QED and the

1286: classical Wonham filter; the projection filter can be interpreted as an

1287: adaptive Wonham filter, applied to a quantum model.  In future work we

1288: will develop a ``true'' quantum formalism for projection filtering,

1289: using methods from quantum information geometry.

1290:

1291: The reduction of infinite or high-dimensional filters to a tractable set

1292: of equations is essential if we wish to perform estimation in real time,

1293: for example in a feedback control loop.  In a control-theoretic context,

1294: converting a large, complex system into a set of simple equations is known

1295: as model reduction.  Ideally, however, such a procedure should yield some

1296: bounds on the error of approximation; in our case we have observed

1297: numerically that the approximation error is very small, but we have no

1298: rigorous bounds to back up this statement.  In classical control theory of

1299: linear systems, the method of balanced truncation \cite{s:dullerud} gives

1300: a very general method for model reduction with guaranteed error bounds.

1301: How to do this effectively for nonlinear systems is still an open problem,

1302: however, both in classical and in quantum theory.

1303:

1304:

1305: \section*{Acknowledgment}

1306:

1307: The authors thank Mike Armen for useful discussions.

1308: This work was supported by the ARO (DAAD19-03-1-0073).

1309:

1310: \section*{References}

1311:

1312: \bibliographystyle{unsrt}

1313: \bibliography{QPF}

1314:

1315: \end{document}

1316:

1317: