quant-ph0006009/GM.tex
1: \documentstyle[psfig,aps,prl,amssymb]{revtex}
2: 
3: \begin{document}
4: \title{Increased Efficiency of Quantum State Estimation Using 
5:  {\it Non-Separable} Measurements}
6: \author{Paul B. Slater}
7: \address{ISBER, University of California, Santa Barbara, CA 93106-2150\\
8: e-mail: slater@itp.ucsb.edu, FAX: (805) 893-7995}
9: 
10: \date{\today}
11: 
12: \draft
13: 
14: \maketitle
15: 
16: \vskip -0.1cm
17: 
18: \begin{abstract}
19: We address the ``major open problem'' of evaluating how much increased
20: efficiency in estimation is possible using {\it non-separable} --- as 
21: opposed to separable --- measurements of $N$ copies 
22: of $m$-level quantum systems. First, we study the six cases 
23: $m=2$, $N=2,\ldots,7$ by 
24: computing  the $3 \times 3$ Fisher information
25: matrices for the corresponding {\it optimal} measurements 
26: recently devised  by Vidal {\it et al}
27: (Phys. Rev. A 60, 126 [1999]). We obtain simple polynomial expressions for 
28: the (``Gill-Massar'')  traces of the 
29: products of the inverse of the 
30: quantum Helstrom information matrix and these
31: Fisher information matrices.
32: The six traces {\it all}
33: have {\it minima} of $2 N -1$
34: in the {\it pure state} limit --- while for {\it separable} measurements 
35: (Phys. Rev. A 61, 042312 [2000]), 
36: the traces can equal $N$, but {\it not} exceed it. 
37: Then, the result of an analysis for $m=3$, $N=2$ 
38: leads us to {\it conjecture} that for 
39: optimal measurements for {\it all}
40: $m$ and $N$, the Gill-Massar trace achieves a 
41: {\it minimum} of  
42: $(2 N -1) (m-1)$ in the {\it pure state} limit.
43: \end{abstract}
44: 
45: 
46: \pacs{PACS Numbers {03.67.-a, 89.70.+c, 02.50.-r}}
47: 
48: \vspace{.1cm}
49: 
50: \tableofcontents
51: 
52: \section{Introduction}
53: We investigate information-theoretic properties of the optimal
54: measurement schemes recently devised by Vidal {\it et al} 
55: \cite{vidal}, helping thereby to address the ``major open problem'' 
56: \cite{gill} of evaluating how much 
57: increased efficiency in estimation 
58: is possible using {\it non-separable} measurements (cf. \cite{fischer}). 
59: In their 
60: extensive study, ``State estimation for large ensembles,'' 
61: which we seek to extend here, Gill and
62: Massar stated that ``we cannot compare our results with the recent analysis
63: of covariant [optimal] measurements on mixed states \cite{vidal} 
64: because we suppose separability
65: of the measurement, whereas \cite{vidal} does not'' \cite{gill}.
66: A ``separable measurement is one that can be carried out sequentially
67: on separate particles, where the measurement on one particle at any stage
68: (and indeed which particle to measure: one is allowed to measure particles
69: several times) can depend arbitrarily on the outcomes so far'' \cite{gill}.
70: 
71: The analyses  here are conducted in terms of the (classical) {\it Fisher 
72: information} (of the probability distributions associated with
73: the non-separable measurements), making use of 
74: the quantum (Helstrom) Cram\'er-Rao bound 
75: \cite{helstrom} on the Fisher information matrix 
76: for any {\it oprom} (operator-valued probability measure)
77: \cite{gill2,busch}.
78: Contrastingly, the studies of Vidal and his several Barcelona colleagues 
79: \cite{vidal,tarvid,vidal2,acin}
80:  have been formulated primarily in terms of
81: {\it fidelity}, $F(\rho,\rho')$
82: ($\rho$ and $\rho'$  being density matrices) \cite{uhlmann,jozsa},  
83: and secondarily, {\it information gain} \cite{tarvid}.
84: Now, there surely exists
85: an intimate connection between these approaches, since 
86: $2(1-F(\rho,\rho'))$ functions as the {\it Bures} distance between
87: $\rho$ and $\rho'$. The Bures metric is a distinguished member (the {\it 
88: minimal}
89: one) of a continuum of possible quantum extensions --- each 
90: associated with a distinct {\it operator monotone} function --- of the 
91: (classical) Fisher information
92: metric \cite{petzsudar,bc,paulpla}. The Helstrom-Cram\'er-Rao bound
93: corresponds to the particular use of the Bures metric {\it via} the concept of 
94: the {\it symmetric logarithmic derivative} \cite{helstrom}.
95: An interesting hypothesis is that asymptotically the Fisher information
96: matrix for optimal measurements is simply proportional to the metric
97: tensor associated with some specific operator monotone function.
98: (Our results below indicate that such a role is 
99: definitely {\it not} played by the
100: Bures metric.)
101: 
102: We shall be concerned 
103: here primarily (cf. secs.~\ref{tl} and \ref{fl}) with the
104: two-level quantum systems, representable by the $2 \times 2$ density matrices,
105: \begin{equation} \label{bloch}
106: \rho  = {1 \over 2} \pmatrix{1 + z & x + \mbox{i} y \cr
107: x - \mbox{i} y & 1- z \cr},
108: \end{equation}
109: where $r^2 = 
110: x^2 +y^2 +z^2 \leq 1$. The particular $(x,y,z)$ parameterization employed
111: in (\ref{bloch})
112:  corresponds to  the use of Cartesian coordinates for the  ``Bloch 
113: (or Poincar\'e) sphere'' (unit ball in 
114: three-space) representation of the two-level systems \cite{bm} 
115: \cite[sec. 4.2]{belt}, while the alternative (spherical coordinate)
116: parameter
117: $r$ is  the radial distance from the origin. Pure states, for which 
118: $|\rho|=0$, correspond to
119: $r=1$ and the fully mixed state, for which $|\rho| = {1 \over 4}$, to $r=0$.
120: 
121: For the cases of $N$  copies ($N=2,\ldots,7$) of a two-level quantum system
122: (\ref{bloch}) we obtain below in sec.~\ref{relations} 
123: a quite interesting pattern of results of increased 
124: efficiency using non-separable measurements,
125: which strongly suggests generalizability to arbitrary $N$.
126: To explicitly examine the cases $N>7$
127: would either entail considerable additional computations
128: for each specific $N$ and/or substantial analytical advances 
129: (cf. sec.~\ref{fishmono}) allowing one
130: to formally establish the measure of increased efficiency 
131: for {\it arbitrary} $N$. (We note that Latorre {\it et al} \cite{vidal2}
132: had to proceed {\it case-by-case}, 
133: that is, each $N$ individually, since they ``did not know how to build the
134: POVM algorithmically''.) In sec.~\ref{fishmono} we explore one possible
135: approach in this regard, attempting to explain the Fisher information 
136: matrices we compute in sec.~\ref{nce} in terms of monotone metrics. 
137: In sec.~\ref{nce}, we also formulate a conjecture as to the increase in
138: efficiency achieveable using non-separable optimal 
139: measurements for $N$ copies of $m$-level
140: quantum systems in general.
141: 
142: To begin our study, immediately below in sec.~\ref{goforit}, 
143: we  expand upon an observation \cite[p. 2684]{slatjmp} regarding 
144:  an information-theoretic relationship 
145: between certain classical
146: and quantum entities --- that is, the Fisher information matrix for 
147: a certain (quadrinomial) multinomial probability distribution and 
148: the quantum Helstrom information matrix (proportional to the 
149: Bures metric tensor), and its implications for 
150: optimal measurements.
151: 
152:  In sec.~\ref{uc} we examine further 
153: ramifications on issues 
154: of state estimation \cite{gill,helstrom} and 
155:  universal coding (data compression) \cite{cb1,kratt,kratt2,jozsa2}.
156: There appears to be an interesting relation between the devising of
157: optimal measurements as in \cite{vidal}, 
158: and universal quantum coding, as both processes involve
159: averaging with respect to isotropic prior probability distributions 
160: by ``projecting onto total spin eigenspaces, and within each such subspace,
161: onto total spin eigenstates with maximal total spin component in some 
162: direction'' \cite{vidal} --- cf. \cite[eqs. (5.33) and (5.34)]{vidal} 
163: and \cite[eq. (2.48)]{kratt}. The particular prior distribution which 
164: yields both the minimax and maximin for the universal quantum coding of
165: the two-level systems is based on the {\it quasi-Bures} metric, a particular
166: example of a monotone metric. We attempt in sec.~\ref{fishmono} 
167: to relate the Fisher information 
168: matrices we compute in sec.~\ref{nce} to the monotone metrics.
169: \section{Proportionality between Helstrom and Fisher Information 
170: Matrices} \label{goforit}
171: The density matrices (\ref{bloch})
172: turn out to have an intimate relationship
173: with a particular form of multinomial (that is, 
174: quadrinomial) probability distributions --- the 
175: {\it four} distinct possible outcomes
176: being  assigned probabilities
177: \begin{equation} \label{qpd}
178:  x^2,\quad y^2,\quad z^2, \quad 1-x^2-y^2-z^2 .
179: \end{equation}
180: One can attach to the three-dimensional convex set of two-level 
181: quantum systems (\ref{bloch}), 
182: adapting  one (the simplest) of the ``explicit'' 
183: formulas of Dittmann \cite[eq. (3.7)]{ditt1} \cite{ditt2},
184: \begin{equation}
185: d_{Bures}(\rho,\rho + \mbox{d} \rho)^2 =
186: {1 \over 4} \mbox{Tr} \{ \mbox{d} \rho \mbox{d} \rho +{1 \over |\rho|}
187: (\mbox{d} \rho  - \rho \mbox{d} \rho ) (\mbox{d} \rho -\rho \mbox{d} \rho) \},
188: \end{equation}
189: the $3 \times 3$ quantum (Helstrom) information
190: matrix \cite{helstrom,gill,barn}
191:  (that is, {\it four} times 
192: the Bures metric tensor \cite{ditt2,hub1,hub2,bc}),
193: \begin{equation} \label{niu}
194: H_{q}(x,y,z) = {1 \over  (1-x^2-y^2-z^2)} \pmatrix{1-y^2-z^2 & x y & x z \cr
195: x y & 1- x^2 -z^2 & y z \cr
196: x z & y z & 1-x^2 -y^2 \cr}.
197: \end{equation}
198: We  use the subscripts $q$ and $c$ --- in a suggestive, perhaps not
199: fully rigorous manner --- to denote results stemming from quantum or
200: classical considerations. Also, note that (\ref{niu}) ``blows up'' at the
201: pure states themselves --- so it will be problematical, at best, to 
202: directly compare
203: results pertaining to (\ref{niu}) with ones based on {\it pure state}
204: models \cite{gill,fuji}.
205: 
206: In spherical coordinates $(r,\theta,\phi$), $x = r \cos{\theta},
207: y =r \sin{\theta} \cos{\phi}, z = r \sin{\theta} \sin{\phi}$, the
208: matrix (\ref{niu})  takes a
209:  {\it diagonal} form,
210: \begin{equation} \label{sPh}
211: H_{q}(r,\theta,\phi)  =  \pmatrix{ {1 \over 1 - r^2} & 0 & 0 \cr
212: 0 & r^2 & 0 \cr
213: 0 & 0 & r^2 \sin^2{\theta} \cr},
214: \end{equation}
215: for this {\it orthogonal} system of coordinates (cf. \cite{tod}).
216: (Below, in the interest of succinctness, 
217: we will replace the frequently-occurring 
218: expression $x^2+y^2+z^2$ by its equivalent, $r^2$.)
219: 
220: Now,  the quantum information matrices (\ref{niu}) and (\ref{sPh})
221: are   simply proportional to the (classical) Fisher information 
222: \cite{frieden} matrices $I_{c}(x,y,z)$ and $I_{c}(r,\theta,\phi)$
223: for the quadrinomial probability distribution (\ref{qpd}).
224: (By way of algorithmic example, the $xy$-entry  of the $3 \times 3$
225: Fisher information matrix --- in its Cartesian coordinate form, 
226: $I_{c}(x,y,z)$ --- is
227:  computable as the expected value
228: of the 
229: [two-fold] product of the logarithmic derivatives of (\ref{qpd})
230:  with respect to
231: $x$ and with respect to $y$.)
232: More precisely, the nine entries of  $I_{c}(x,y,z)$  are
233: all {\it four} times the 
234: corresponding entries of (\ref{niu}), that is
235: \begin{equation}
236: I_{c}(x,y,z) = 4 H_{q}(x,y,z).
237: \end{equation}
238: A natural explanation for this  phenomenon is that the 
239: {\it information geometry} \cite{murray}
240:  of both models is that of the standard metric on
241: the surface of a three-sphere in four-dimensional Euclidean space 
242: \cite{bc,kass}.
243: 
244: Both quantum (Helstrom) information and Fisher information 
245: possess the property of {\it additivity}, that is, for $N$ independent 
246: identical density matrices
247: or probability distributions, the information matrices 
248: (possibly scalars) are $N$ times those
249: for a single one  \cite[exer. 1.10]{gill2}
250: \cite[sec. VI.4]{helstrom}
251:  \cite{kagan,chentsov,kagan2,rao}.
252: 
253: By the quantum version of the Cram\'er-Rao theorem \cite{helstrom},
254: the inverse matrix 
255: $H_{q}(x,y,z)^{-1}$  serves as a lower
256: bound on the variance-covariance matrix $V(x,y,z)$
257: for any {\it unbiased}
258: estimator of the parameters ($x,y,z$) of $\rho$.
259: (This means that the matrix 
260: difference, $V(x,y,z) -H_{q}(x,y,z)^{-1}$, must be  nonnegative definite,
261: that is, have all its
262: eigenvalues nonnegative.)
263:  In this regard,
264: \begin{equation} \label{inv}
265: H_{q}(x,y,z)^{-1} =  \pmatrix{1 -x^2 & -x y & - x z \cr
266: - x y & 1-y^2 & - y z \cr
267: -x z & - y z & 1-z^2 \cr}
268: \end{equation} 
269: (Of course,  $H_{q}(r,\theta,\phi)^{-1}$ is diagonal.)
270: 
271: By dint of the  additivity of information, in conjunction with the
272:  Cram\'er-Rao theorem (cf. \cite[eq. (26)]{gill}), one can 
273: conclude that it is {\it not}
274:  possible to devise
275: for  $N < 4$ independent identical two-level systems, an {\it oprom}
276: \cite{gill2,busch}, which has
277: for its outcomes the quadrinomial distribution (\ref{qpd}) 
278: (cf. \cite{vidal,bennett}).
279: (When we attempted to construct such an oprom for the 
280: case $N=2$, we found that the four
281:  operators could {\it not} all be nonnegative 
282: definite if they were to yield (\ref{qpd}).) However, for
283: $N \geq 4$, the question 
284: of whether such an oprom exists would appear
285:  to be a completely open one --- since now
286: the Cram\'er-Rao theorem does {\it not} rule out its possibility.
287: (The results of Vidal {\it et al} \cite{vidal}
288: show that an optimal {\it minimal} number of measurements for $N>3 $ is
289: at least {\it fifteen},
290:  exceeding  the number {\it four} for  an oprom that would give 
291: as its outcomes, the quadrinomial probability 
292: distribution (\ref{qpd}).) If such an oprom could be found for $N=4$ itself,
293: then the Cram\'er-Rao inequality would be {\it fully} saturated.
294: \section{Analyses of {\it Optimal} Measurements of Vidal {\it et al} for
295: $N$ Copies of Two-Level Quantum Systems} 
296: \label{nce}
297: \subsection{Computation of the Fisher Information Matrices} \label{omer}
298: \subsubsection{$N=2$}
299: Let us now consider the probability distribution
300:  in \cite{vidal} obtained from the optimal minimal number (five) of
301: measurements for the case of $N=2$ identical independent copies of
302: the two-level systems (\ref{bloch}). 
303: The  five probabilities --- as we have explicitly found --- 
304: can be written as (the three)
305: \begin{equation} \label{ped}
306: {1 \over 4} (1-r^2),\quad  {3 \over 16} (1+z)^2, \quad {1 \over 48}
307: (8 x^2 -4 \sqrt{2} x (z-3) +(z-3)^2) ,
308: \end{equation}
309: together with the pair
310: \begin{displaymath}
311: {1 \over 48} (9 + 2 x^2 \pm 4 \sqrt{3} x y + 6 y^2 + 2 \sqrt{2}
312: (x \pm \sqrt{3} y) (z-3) -6 z + z^2).
313: \end{displaymath}
314: 
315: Quite remarkably, the associated Fisher information matrix 
316: ($\tilde{I}_{c}$) turns out
317: to  precisely equal the quantum (Helstrom) information matrix,
318: $H_{q}(x,y,z)$  --- and not $2 H_{q}(x,y,z)$, which is the
319:  upper bound furnished by the
320: quantum Cram\'er-Rao theorem. So, the bound could be said to be
321: ``half-saturated''.
322: (In regard to this specific result, R. Gill has observed that there may
323: exist other measurement schemes which are {\it sub-optimal} accoding to the 
324: {\it fidelity} criterion of \cite{vidal}, but superior
325:  in terms of Fisher information (cf. \cite{tarvid}).)
326: \subsubsection{$N=3$}
327: For an optimal minimal set of measurements for $N=3$, we can take the eight
328: probabilities, consisting of the four pairs,
329: \begin{equation}
330:  {(1 \pm x)^3 \over 12},\quad
331: {(1 \pm y)^3 \over 12}, \quad {(1 \pm z)^3 \over 12}, \quad {1 \over 4} 
332: (1 \pm {x+y+z \over \sqrt{3}})  (1 -  r^2) .
333: \end{equation}
334: The associated Fisher information matrix is expressible as
335: \begin{equation} \label{pkd}
336: 2 H_{q}(x,y,z) + {1 \over 2( (x+y+z)^2 -3)}  \pmatrix{a & b & b \cr
337: b & a & b \cr
338: b & b & a \cr},
339: \end{equation}
340: where $a = 2 (1-x y - x z - y z)$ and $b=-1+r^2$.
341: The second summand in (\ref{pkd}) is {\it negative} definite (having two 
342: of its three negative
343: eigenvalues equal to $-{1 \over 2}$), while $3 H_{q}(x,y,z)$ is the upper bound
344: on the Fisher information matrix 
345: provided by the Cram\'er-Rao theorem.
346: \subsubsection{$N=4$}
347: An optimal minimal set of measurements for $N=4$ yields a 
348: fifteen-vector of probabilities. The Fisher information matrix for this
349: probability distribution is
350: \begin{equation} \label{cue}
351: 3 H_{q}(x,y,z) + {1 \over 12} \pmatrix{-7-5 y^2 - 5 z^2 & 5 x y & 5 x z \cr
352: 5 x y & -7 - 5 x^2 - 5 z^2 & 5 y z \cr
353: 5 x z & 5 y z & -7 -5 x^2 - 5 y^2 \cr}.
354: \end{equation}
355: The second term is {\it negative} definite with one eigenvalue
356: equal to $-{7 \over 12}$ and the other two, $ -{1  \over 12} 
357: (7 + 5 r^2)$. If we subtract (\ref{cue}) from 
358: the Cram\'er-Rao upper bound $4 H_{q}(x,y,z)$, we obtain (as we must) 
359: a nonnegative definite
360: matrix, having  two eigenvalues
361:  ${1 \over 12} (19 + 5 r^2)$ and one,
362: ${7 \over 12} + {1 \over 1 - r^2}$.
363: \subsubsection{$N=5$}
364: For $N=5$, a twenty-vector of probabilities was obtained for the optimal 
365: minimal number of measurements. The Fisher information matrix can be 
366: expressed as the sum of $4 H_{q}(x,y,z)$ (which dominates it, while
367: $3 H_{q}(x,y,z)$ does not) 
368: and a {\it negative}
369: definite matrix, having one of its three negative eigenvalues equal to
370: $-{3 \over 16} (5+3 r^2)$. This negative definite matrix
371: can be written as the product of ${1 \over 16 (-3 +(x+y+z)^2)}$ 
372: and a $3 \times 3$  matrix,
373: the $(1,1)$  cell of which is
374: \begin{equation} \label{fds}
375: -2 (-20 + 7 y^4 +9 y^3 z - 11 z^2 + 7 z^4 - 5 x^3 (y+z) + 3 y z (5 + 3 z^2) +
376: \end{equation}
377: \begin{displaymath}
378: 3 x (y + z) ( 5 + 3 y^2 + 3 z^2) + x^2 (10 + 7 y^2 - 5 y z + 7 z^2)
379:  + y^2 (-11 +14 z^2))
380: \end{displaymath}
381: and the  $(1,2)$ off-diagonal entry is
382: \begin{equation}
383: -5 x^4 + 14 x^3 y + 2 x^2 (5 + 9 y^2 + 14 y z- 5 z^2) -
384: 5 (-1+y^2+z^2)^2 + 14 x y (-3 + (y +z)^2).
385: \end{equation}
386: The remaining cells are obtainable by simple symmetry arguments (for example, 
387: the (2,2) cell can be gotten by interchanging $x$ and $y$ in (\ref{fds})).
388: \subsubsection{$N=6$}
389: For $N=6$, we used an optimal (but not minimal) set of thirty-three
390: measurements. We found --- using a large number of randomly generated 
391: points $(x,y,z)$ --- that the associated Fisher information matrix
392: was strictly dominated by $5 H_{q}(x,y,z)$, but not by $4.99  H_{q}(x,y,z)$.
393: The Fisher information matrix takes the form (cf. (\ref{cue})) 
394: \begin{equation} \label{owd}
395: 5 H_{q} (x,y,z) + {1 \over 120} \pmatrix{a & A  x y  &  A x z  \cr
396: A x y  & b&   A y z  \cr  A x z  &  A y z  & c \cr},
397: \end{equation}
398: where 
399: \begin{equation}
400: A=193 - 31 r^2, \quad a = - 125 -146 y^2 - 146 z^2 + 31 (y^2 +z^2)^2 
401: +x^2 (47 +31 y^2 +31 z^2),
402: \end{equation}
403: and the diagonal entry 
404: $b$ can be obtained from $a$ by interchanging $x$ and $y$,  
405: and $c$ from $a$ by interchanging $x$ and $z$. 
406: 
407: One of the three negative
408: eigenvalues of the second (``residual'') matrix in (\ref{owd}) is
409: $(125 -172 r^2 + 47 r^4)/(120 (-1+r^2))$. Now, if we were to rewrite 
410: (\ref{owd}) in the form of 
411: $4.99  H_{q}(x,y,z)$ plus a {\it slightly} revised residual matrix,
412: the eigenvalue in question would be altered only in the respect that 
413: the constant 125 would change to 123.8. This would render
414: it {\it positive} for
415: $r >.992348$, leading to a loss of strict dominance for $r \in 
416: [.992348,1]$.
417:  In this specific sense, the upper bound of $5 H_{q}(x,y,z)$
418: on the Fisher information matrix is {\it tight}.
419: The residual matrix for $N=4$ strictly dominates that for $N=6$. This 
420: indicates that the ``fit'' of $(N-1) H_{q}(x,y,z)$ to the Fisher information
421: matrix for optimal measurements of $N$ copies {\it improves} as $N$
422: increases.
423: \subsubsection{$N=7$} \label{ssecn7}
424: For $N=7$, employing a 42-vector of probabilities, we found the Fisher
425: information matrix to be strictly dominated by $6 H_{q}(x,y,z)$, but {\it not}
426:  by
427: $5.99 H_{q}(x,y,z)$.
428: Reviewing our previous analyses, we then found that the
429: analogous situation held also  for $N=3,\ldots,6$, 
430: that is, the Fisher information
431: matrix was dominated by $(N-1) H_{q}(x,y,z)$, but not by $(N-1.01)
432: H_{q}(x,y,z)$. The violations of these 
433: {\it diminished} bounds occur for nearly pure states, that is 
434: $r \approx 1$.
435: 
436:  Pursuing this line of thought, 
437: if we restrict consideration to the more mixed states for which
438:  $r < {1 \over 2}$, then for $N=7$ we have found that $3.9 H_{q}(x,y,z)$,
439: but not $3.85 H_{q}(x,y,z)$
440: bounds the Fisher information matrix for the optimal set of measurements.
441: Calculations suggest the hypothesis that in the neighborhood of 
442: the fully mixed
443: state $r=0$, the bound 
444: on the Fisher information matrices approaches from above
445:  $N  H_{q}(0,0,0)/2$, that is 
446: ${N \over 2}$ times the $3 \times 3$ identity matrix. 
447: Now, the fully mixed state is classical (binomial) in character, while the
448: pure states are quantum in nature. (It is interesting to note that Frieden
449: finds that in classical scenarios, only {\it one-half} of the bound or 
450: phenomenological information $J$ is utilized in the intrinsic 
451: quantum information $I$ \cite[eqs. (5.39), (6.55)]{frieden}.
452: ``In all covariant quantum theories (e. g., quantum mechanics, quantum 
453: gravity) $I$ and $J$ are exactly equal. In deterministic classical theories
454: such as classical electromagnetics and general relativity $I=J/2$. 
455: But in statistical classical theories $I=J$ again'' [e-mail message
456: from Frieden].)
457: \subsubsection{$N>7$}
458: We are not able to
459: proceed any further, that is for $N>7$, as there presently do not appear to be
460: corresponding 
461: sets of optimal measurements. As a {\it caveat} to the reader, 
462: let us point out that to recreate the optimal measurements for the
463: cases $N=6$ and 7 (which unlike the instances $N<6$, were not
464: formally demonstrated to be minimal in character), 
465: it is necessary to rely upon the quant-ph preprint version 
466: (9803066) of \cite{vidal2}, since there are certain errors (as confirmed
467: in an e-mail from R. Tarrach, though no formal {\it erratum} has 
468: appeared) in the final, published paper.
469: \subsection{Properties of the Computed Fisher Information Matrices}
470: \subsubsection{{\it Diagonal} nature for {\it even} $N$
471: in spherical
472: coordinates} \label{dnfe}
473: We have found that the Fisher information matrices given above 
474: for the optimal measuements of Vidal {\it 
475: et al} \cite{vidal} for both $N=4$ and 6
476: are {\it diagonal} in spherical coordinates ($r,\theta,\phi$).
477: For $N=4$, this is
478: \begin{equation} \label{diagn=4}
479: {1 \over 12} \pmatrix{ {29 + 7 r^2  \over 1 - r^2} & 0 & 0 \cr
480: 0 &  r^2 (29 -  5 r^2) & 0 \cr
481: 0 & 0 & r^2 (29  - 5  r^2) \sin^{2}{\theta} \cr},
482: \end{equation}
483: and for $N=6$,
484: \begin{equation} \label{diagn=6}
485: {1 \over 120} \pmatrix{ {475 + 172 r^2 - 47 r^4 \over 1 - r^2} & 0 & 0 \cr
486: 0 & r^2 (475 - 146 r^2 + 31 r^4) & 0 \cr
487: 0 & 0 & r^2 (475 -146 r^2 + 31 r^4) \sin^{2}{\theta} \cr}.
488: \end{equation}
489: For $N=2$, we also have a corresponding diagonal matrix, that is,
490: (\ref{sPh}). 
491: 
492: Cox and Reid \cite[p. 2]{cox} have listed three ``consequences of
493: orthogonality'' of the parameterization of a Fisher information matrix, such
494: as we have just observed.
495: These are that: 
496: (i) the maximum likelihood estimates of the means of the parameters
497: are asymptotically independent; (ii) the asymptotic standard error for
498: estimating one parameter is the same whether the other parameters are treated
499: as known and unknown; and (iii) there may be simplifications in the numerical
500: determination of the means of the parameters.
501: ``While orthogonality can always be achieved locally, global orthogonality
502: is possible only in special cases'' \cite[p. 2]{cox}.
503: In accompanying discussions to \cite{cox}, Sweeting identifies four
504: advantages to orthogonalization --- computation, approximation, interpretation,
505: and elimination of nuisance parameters --- while Barndorff-Nielsen, as well as
506: Moolgavkar and Prentice, 
507: explain parameter orthogonality in terms of Frobenius' Theorem. The latter
508: authors also 
509: indicate that the theorem of de Rham \cite[p. 187]{kobayashi}  gives 
510: necessary and sufficient conditions for each 
511: orthogonal parameter to be independent of
512: the others (as they are {\it not} in our  three even-dimensional examples just 
513: given).
514: 
515: \subsubsection{Pure- and fully mixed state limits}
516: Again using spherical coordinates, it is interesting to note
517: that for the {\it odd} cases of $N=3,5,7$, in the pure state limit
518: ($r \rightarrow 1$), the off-diagonal elements of the corresponding
519: $3 \times 3$ Fisher information matrix converge to zero. 
520: In all six (both odd and even) cases, in this same limit, the (1,1)-entries
521: are  indeterminate, the (2,2)-entries are  ${N \over 2}$ and the 
522: (3,3)-entries are ${N \sin^{2}{\theta} \over 2}$.
523: 
524: For the fully mixed state, $r=0$
525: (allowing the angular variables $\theta$ and $\phi$ to remain free), 
526: the only non-zero entry is the (1,1)-cell.
527: For $N=2$ it is 1, for $N=3$ it is
528: \begin{equation}
529: {1 \over 6} \lgroup 10 + \sin{2 \theta} (\cos{\phi} +\sin{\phi}) +\sin^{2}{\theta}
530: \sin{2 \phi} \rgroup,
531: \end{equation}
532: for $N=4$ it is ${29 \over 12}$, for $N=5$, it is ${(103 + 5 \cos{2 \phi}) 
533: \over 32}$, for $N=6$ it is ${95 \over 24}$, and for $N=7$,
534: \begin{equation}
535: {1 \over 96} \lgroup 456 \cos^{2}{\theta} +7 \sin{2 \theta}
536: (\cos{\phi} +\sin{\phi}) + \sin^{2}{\theta} (456 + 7 \sin{2 \phi}) \rgroup.
537: \end{equation}
538: \subsubsection{Integrals over Bloch sphere of volume elements} \label{volel}
539: For $N=2$, the integral of the volume element of the 
540: Fisher information matrix (that is, the square root of the determinant) 
541: over the (Bloch sphere of)
542: two-level quantum systems is $\pi^2 \approx 9.8696$, for $N=3$ it is
543: 21.0235,
544: for $N=4$, it is 
545: \begin{equation}
546: {1 \over 441} \sqrt{{29 \over 3}} \pi \lgroup 4705 E(-{7 \over 29}) 
547: -4194 K(-{7 \over 29}) \rgroup \approx 35.0281
548: \end{equation}
549: (where $E$ and  $K$ denote the corresponding elliptic integrals),
550: for $N=5$, it is 51.0763,
551:  for $N=6$, it is 69.1253, and for $N=7$, 88.8621.
552: These 
553: particular results would be needed for the application to the 
554: optimal measurements of Vidal {\it et al} \cite{vidal} 
555: of the universal coding
556: theorem of Clarke and Barron \cite{cb1}, discussed 
557: below in sec.~\ref{cuc}.
558: 
559: \subsection{Gill-Massar Traces} \label{relations}
560: Let us first observe that Gill and Massar \cite[eq.(26)]{gill} 
561: asserted that the upper (quantum [Helstrom] Cram\'er-Rao) 
562: bound $N H_{q}$, was {\it not}, in general, 
563: achievable in a multiparameter setting. This does appear to be 
564: strictly the case.
565:  However, our results for $N=2,\ldots,7$ for the
566: three-parameter $2 \times 2$ density matrices, indicate that --- using the
567: optimal measurements of Vidal {\it et al} \cite{vidal} --- one can,
568: by choosing $N$ large enough, come indefinitely close for 
569: the nearly pure states to this
570: bound.
571: 
572: To further relate to these analyses of Gill and Massar, we have computed for
573: $N=2,\ldots,7$, the traces of the product of $H_{q}(x,y,z)^{-1}$, given
574: in (\ref{inv}), and the Fisher information matrices 
575: we have obtained using the optimal
576: measurements of Vidal {\it et al}. (The traces of Fisher information matrices
577: play a central role in the work of Frieden on the fundamental 
578: equations of physics \cite[sec. 2.3.2]{frieden}.) 
579: For the estimation 
580: of  pure states, Theorem I in \cite{gill}
581: asserts that this trace quantity 
582: is bounded above by $N$, while Theorem II there says
583: that the same bound applies to  mixed states, with the restriction
584: to {\it separable} measurements. It is also
585: demonstrated there that these bounds are attainable --- and for large $N$ 
586: {\it simultaneously} for {\it all} states.
587: 
588: For $N=2$, it is easy to see, in the context of the 
589: results above, that this 
590: (``Gill-Massar'') trace result is simply 3. For $N=3$, we get another 
591: constant, 5, for the trace.
592: For $N=4$, we obtain
593: \begin{equation} 
594: GM_{4} = {29 - r^2 \over 4},
595: \end{equation}
596:  which is 7 for pure states
597: and 7.25 for the fully mixed state.
598: For $N=5$, the Gill-Massar trace is
599: \begin{equation}
600: GM_{5} = {19 - r^2 \over 2}, 
601: \end{equation}
602: which is 9 for pure states and 9.5 for the fully 
603: mixed state. For $N=6$, it is
604: \begin{equation}
605: GM_{6} = {95 - 8 r^2 + r^4 \over 8}.
606: \end{equation}
607: This last 
608: expression is monotonically decreasing from ${95 \over 8} =11.875$ at $r=0$ to
609: 11, that is, $2 N -1$ at $r=1$. For $N=7$, the Gill-Massar trace is
610: \begin{equation}
611:  GM_{7} = {57 - 6 r^2 + r^4 \over 4}, 
612: \end{equation} 
613: which 
614: equals ${57 \over 4} = 14.25$ at $r=0$ and 13 at $r=1$, being again
615: $2 N - 1$.
616: (In an earlier version of this paper, quant-ph/0002063, the results 
617: given --- including Fig. 1, plotting the Gill-Massar trace --- for
618: $N=7$ were ``anomalous'', in this regard. We subsequently ascertained
619:  that they were erroneous
620: in nature, due to a programming error.) In Fig.~\ref{gmt}, we plot 
621: ${GM_{N} \over (2 N-1)}$ for $N=4,5,6$ and 7.
622: \begin{figure}
623: \centerline{\psfig{figure=GMtraces.eps}}
624: \caption{Gill-Massar traces for $N=4,5,6$ and 7 
625: scaled by their values at the pure states, $r=1$, that is, 
626: $2 N -1 $. The $y$-intercepts for $r=0$, corresponding to the 
627: fully mixed state, increase with $N$.}
628: \label{gmt}
629: \end{figure}
630: 
631: It is easy to see, then, that
632: in these six cases the Gill-Massar bound \cite[eq. (27)]{gill}
633: of $N$ is violated --- as Theorem III of their paper recognizes will occur
634: for {\it non-separable} measurements.
635: So, we obtain a simple pattern of $2 N -1$ for the 
636: minimum of the trace quantity
637: in question. 
638: In regards to these results, R. Gill remarked in an e-mail message 
639: of Feb. 18, 2000 that
640: ``this is all very interesting. It means that there is a big discontinuity
641: at the surface of the Bloch sphere (where none of these $3 \times 3$ Fisher
642: information matrices is well-defined), and it means that the gain in using
643: joint measurements over separate measurements for mixed states is substantial
644: throughout the Bloch sphere''.
645: \subsection{Analyses for $m$-Level {\it Pure} States} 
646: \subsubsection{$m=2$}
647: In a further effort to relate to the analyses of Gill and Massar
648: \cite{gill}, let us consider for the moment simply the two-level pure states,
649: so we set $r=1$. In terms of the polar coordinates $(\theta,\phi)$,
650: the Helstrom information matrix takes the  form (cf. (\ref{sPh}), 
651: \cite[p. 4238]{FUJI})
652: \begin{equation} \label{fgn}
653: \pmatrix{1 & 0 \cr
654: 0 & \sin^{2}{\theta} \cr}.
655: \end{equation}
656: Then, the Fisher information matrix for the optimal measurements of 
657: $N$ copies \cite{vidal2} is simply ${N \over 2}$ times (\ref{fgn}), as we 
658: have confirmed through computations for $N=2,\ldots,7$ (cf. \cite{gill}).
659: (So, in the pure state case, unlike the mixed state one, 
660: the quantum Cram\'er-Rao bound of $N$ times 
661: (\ref{fgn}) is not asymptotically 
662: approached --- though the Gill-Massar trace bound of $N$ is achievable.)
663: \subsubsection{$m=3$} \label{tl}
664: We have also verfied that the same basic additive 
665: relation holds in the case of the {\it three}-level pure states for $N=2$,
666: using the formulas in \cite{acin}. Let us use the parameterization of
667: these states
668:  in terms
669: of {\it four} angular variables ($\theta,\phi,\chi_{1},\chi_{2})$ 
670: employed in \cite[eq. (2.1)]{cavesv},
671: \begin{equation} \label{spin1param}
672: | \psi \rangle = \mbox{e}^{\mbox{i} \chi_{1}} \sin{\theta} \cos{\phi}
673: |1 \rangle + \mbox{e}^{\mbox{i} \chi_{2}} \sin{\theta} \sin{\phi}
674: |2 \rangle + \cos{\theta} |3 \rangle.
675: \end{equation} 
676:  Then, the Helstrom information matrix 
677: is
678: \begin{equation} \label{kvi}
679: \pmatrix{4 & 0 & 0 & 0 \cr
680: 0 & 4 \sin^{2}{\theta} & 0 & 0 \cr
681: 0 & 0 & a & -\sin^{4}{\theta} \sin^{2}{2 \phi} \cr
682: 0 & 0 & -\sin^{4}{\theta} \sin^{2}{2 \phi} & b \cr},
683: \end{equation}
684: where (cf. \cite{slatprep})
685: \begin{equation}
686: a =  {1 \over 2} \lgroup 6 + 2 \cos{2 \theta} +
687: \cos{2 (\theta - \phi)} -2 \cos{2 \phi} +
688:  \cos{2(\theta+\phi)} \rgroup  \sin^{2}{\theta}  \cos^{2}{\phi},
689: \end{equation}
690: \begin{displaymath}
691: b=-{1 \over 2} \lgroup -6 - 2 \cos{\theta} + \cos{2 (\theta -  \phi)}
692: -2 \cos{2 \phi} +\cos{2(\theta +\phi)} \rgroup
693:  \sin^{2}{\theta} \sin^{2}{\phi}.
694: \end{displaymath}
695: (Note that (\ref{kvi}) is free of the variables, $\chi_{1}$ and
696:  $\chi_{2}$ --- as (\ref{sPh}) is free of $\phi$.)
697: So, for $N=2$ copies of a spin-1 system, the Fisher information matrix is
698: identically (\ref{kvi}), paralleling the specific results for both the pure
699: and mixed two-level quantum systems for $N=2$. We also intend to analyze 
700: the case $N=3$, using the specific prescription for the corresponding 
701: optimal measurements in \cite[sec. 6]{acin}. 
702: \subsubsection{supplementary analysis for 3-level 
703: {\it mixed} states} \label{uue}
704: We have attempted --- following the 
705: general methodology laid out by Vidal {\it et al} \cite{vidal} 
706: for the {\it two}-level mixed quantum systems --- to construct
707: an optimal measurement scheme for $N=2$ copies of mixed 
708: {\it three}-level 
709: systems. In doing so, we incorporated 
710: the optimal measurements for $N=2$ copies of
711: {\it pure} three-level quantum systems presented 
712: by Ac\'in, Latorre and Pascual in \cite[sec. 5]{acin}, that
713: were utilized immediately above. (J. Latorre informs me that he and his
714: co-authors  ``did not find any manageable way to make progress'' in such 
715: extended $m=3$ 
716: {\it mixed} cases, although he did point out that Arvind had recast and 
717: further developed many of their results using Penrose rays --- in 
718: apparently yet unpublished work.)
719: This led us to an oprom with {\it twelve} distinct outcomes, {\it nine}
720:  corresponding
721: to the vectors explicitly presented in \cite[eqs. (39), (40)]{acin}, 
722: and the additional {\it three}
723: coming from our own orthogonal decomposition of the 
724: associated rank three 
725: ``residual'' projector
726: (cf. \cite[eq. (3.3)]{vidal}). (A weight of ${2 \over 3}$ was applied
727: to the subset of nine outcomes.)
728: 
729: With this twelve-outcome oprom in hand, we found by {\it numerical} means
730: that the Gill-Massar trace
731: equalled a constant, 6 (while for $N=2$ 
732: copies of {\it two}-level systems this trace quantity 
733: was found in sec.~\ref{relations} also to be a constant, 3). 
734: (In \cite{slatprep}, we have 
735: been investigating the possibility of {\it symbolically}
736: inverting the $8 \times 8$ Helstrom information matrix --- making use of 
737: a recently-developed Euler angle parameterization of the $3 \times 3$ 
738: density matrices \cite{byrdslater}. The Gill-Massar trace would, of course, 
739: be the
740: trace of the product of this inverse matrix and the Fisher information 
741: matrix associated with the twelve-outcome oprom.)
742: This result and our earlier
743: ones for $m=2$, $N =2,\ldots,7$, 
744: lead us to conjecture that for non-separable optimal 
745: measurements of $N$ $m$-level 
746: quantum systems, the Gill-Massar trace for all $m$ and $N$ is exactly
747: $(2 N-1) (m-1)$ in the pure state limit, and no less than this for any
748: mixed state.
749: 
750: Now, for any measurement of a strictly 
751: pure state itself, the Gill-Massar trace can not exceed $N(m-1)$ by
752: Theorem I of \cite{gill}. 
753: (This bound is known to be achieveable for $m=2$ by Theorem  VII 
754: of \cite{gill}, and for mixed states using separable measurements by
755: Theorem VI.) So there is a clear discontinuity displayed
756: by {\it non-separable optimal}
757:  measurements {\it near} the pure state boundary, as 
758: well as considerable increased efficiency in estimating strictly mixed or 
759: impure states through the use of such measurements.
760: \subsubsection{$m=4$} \label{fl}
761: We 
762: have ascertained
763: the Helstrom information matrix for pure states of {\it four}-level
764: systems, making use
765: of the  appropriate analogue of the parameterization (\ref{spin1param})
766: presented in \cite[eq. (13)]{venki}. The 
767: six parameters naturally divide into two sets of three, and once again the
768: entries of the Helstrom 
769: information matrix are free of the
770: (three) members of one of the two sets.
771: \section{Universal Coding} \label{uc}
772: We can also apply to the three-dimensional family of 
773: quadrinomial probability distributions
774: (\ref{qpd}) certain important 
775: (classical) asymptotic results of Clarke and Barron \cite{cb1}  
776: pertaining to a number of problems, including those of universal 
777: data compression  and density estimation. Then,
778:  we can compare their
779: formulas with those for the $2 \times 2$ density matrices
780: (\ref{bloch}), based on the extension to the quantum domain 
781: of two-level systems by Krattenthaler
782: and Slater \cite{kratt,kratt2} of this work of Clarke and Barron 
783: (cf. \cite{jozsa}). (In what follows, we will denote probability distributions
784: of a general nature by $w$ and more specific ones by $W$, and subscript
785:  them --- as noted before --- by either $c$
786:  or $q$ to denote a result stemming from an analysis
787:  in the classical or 
788:  quantum domain.)
789: \subsection{Classical results of Clarke and Barron} \label{cuc}
790: Clarke and Barron examined the relative entropy 
791: ($N \rightarrow \infty$) 
792: between a true density
793: function and a joint (``Bayesian'') density function 
794: for a sequence of $N$ random variables taken to be the average of the
795: possible densities (comprising a parameterized family) with respect to a
796:  (prior) probability 
797: distribution over this family of density functions. 
798: The result of Clarke and Barron for the asymptotic relative entropy 
799: (Kullback-Leibler index) between the true density and the mixture is
800: \begin{equation} \label{ios}
801: {d \over 2} \log{{N \over 2 \pi \mbox{e}}} +{1 \over 2} \log{|I_{c}(\alpha)|} -
802: \log{w_{c} (\alpha)} +o(1),
803: \end{equation}
804: where $\alpha$ denotes the $d$-vector of variables parameterizing the
805: family of 
806: densities, $w_{c}(\alpha)$ 
807:  a prior probability distribution used
808: to average the $N$-fold products of independent identical density functions, 
809: and  $I_{c}(\alpha)$ the associated $d \times d$ Fisher information matrix.
810: As applied to our particular
811:  three-parameter ($d=3$) family of quadrinomial
812: distributions (\ref{qpd}), with $\alpha  = (r,\theta,\phi)$,
813: we have
814: \begin{equation} \label{lcy}
815: |I_{c}(r,\theta,\phi)| = \lgroup {64 \over 1-r^2}
816:  \rgroup  r^4 \sin^{2} {\theta}.
817: \end{equation}
818: Then, if we choose for the probability distribution, $w_{c}(\alpha)$, 
819: the particular one
820: \begin{equation} \label{bpr}
821: W_{c}(r,\theta,\phi) = \lgroup {1 \over \pi^{2} \sqrt{1-r^2}} \rgroup
822:   r^2 \sin{\theta}
823: \quad 
824: \propto \sqrt{|I_{c}(r,\theta,\phi)|},
825: \end{equation}
826:   the asymptotic relative
827: entropy between the true density and its Bayesian 
828: (mixture) average assumes the form
829: \cite[eq. (1.4)]{cb1}
830: \begin{equation} \label{out1}
831: {3 \over 2} \log{{N \over 2 \pi \mbox{e}}} +\log{8 \pi^{2}} +  o(1).
832: \end{equation}
833: (Let us note that  $r^2 \sin{\theta} \mbox{d} r \mbox{d} \theta 
834: \mbox{d} \phi$ is  the Jacobian 
835: determinant of
836: the transformation from Cartesian to spherical coordinates or, equivalently,
837: the volume element in spherical coordinates.)
838: Our particular selection of $W_{c}(r,\theta,\phi)$ 
839: is   ``Jeffreys' prior'' for this case, that 
840: is  the normalized (over the Bloch sphere) form
841: of the volume element ($\sqrt{|I_{c}(r,\theta,\phi)|}$)
842:  of the Fisher information
843: metric (cf. sec.~\ref{volel}). 
844: (The normalization factor, $8 \pi^{2}$, is evident
845:  in (\ref{out1})). Jeffreys' priors, as shown by Clarke and Barron
846: \cite{cb1},  fulfill the desideratum of yielding
847: the common {\it minimax}
848:  and {\it maximin} of the asymptotic relative entropy.
849: In the quantum analogue, though, (\ref{bpr}) does not play this 
850: distinguished role,
851: although a close (``quasi-Bures'') relative of it does \cite{kratt2,slathall}.
852: This probability distribution is
853: \begin{equation} \label{qB}
854: W_{q}(r,\theta,\phi) = 
855: .0832258  {\mbox{e} \over 1 -r^2} \lgroup
856:  {1-r  \over 1 +r} \rgroup^{1 \over 2 r} 
857: r^2 \sin{\theta}.
858: \end{equation}
859: \subsection{Quantum Results of Krattenthaler and Slater for Two-Level Systems} \label{kssec}
860: Krattenthaler and Slater \cite{kratt,kratt2} have sought to extend the 
861: general results of Clarke and Barron to the two-level {\it quantum}
862: systems (\ref{bloch}). They
863: averaged the $N$-fold 
864: {\it tensor} products of identical $2 \times 2$ density matrices 
865: (\ref{bloch}) (rather than averaging the  simple
866: products  of $N$ {\it random variables}) 
867: with respect to (spherically-symmetric/unitarily-invariant) 
868: probability distributions
869: distributions  of the form $w_{q}(r) r^2 \sin{\theta}$ 
870: (cf.  \cite[eq. (1.4)]{vidal}).
871:  The analogue (in terms of the {\it quantum} relative 
872: [von Neumann] 
873: entropy) of the Clarke-Barron result (\ref{ios})
874: is then ($d=3$)
875: \begin{equation} \label{pew}
876:  {3 \over 2} \log{ {N \over 2 \pi \mbox{e}}} +
877: {1 \over 2} \log{I_{q}(r)} -\log{w_{q}(r)} +  o(1),
878: \end{equation}
879: where (cf. (\ref{lcy}))
880: \begin{equation}
881: I_{q}(r) = {\mbox{e}^2 \over (1 -r^2)^{2}} \lgroup  {1-r \over 1 +r}
882:  \rgroup^{1 \over r}.
883: \end{equation}
884: So, 
885: \begin{equation}
886:  I_{q}(r) r^4 \sin^{2}{\theta} = 144.372 W_{q}(r,\theta,\phi)^{2} ,
887: \end{equation}
888: which can be compared with its classical counterpart,
889: \begin{equation}
890: |I_{c}(r,\theta,\phi)| = 64 \pi^{4} W_{c}(r,\theta,\phi)^2,
891: \end{equation}
892: where $64 \pi^{4} \approx 6234.18$.
893: 
894: As noted \cite{kratt2}, the quasi-Bures probability distribution, $W_{q}
895: (r,\theta,\phi)$,  given by (\ref{qB}), 
896: fulfills in the quantum domain of two-level systems
897: (\ref{bloch}), the distinguished role --- in yielding the common
898: asymptotic minimax and maximin --- of the Jeffreys' prior (that is, the
899: volume element of the Fisher information metric) in the classical sector.
900: In Fig.~\ref{nwz} we plot the term ${1 \over 2} \log{I_{q}(r)}$, 
901: present in (\ref{pew}), along with the comparable
902: (but always larger for $r<1$)
903: classical term, ${1 \over 2} \log{64 \over 1-r^2}$, in (\ref{lcy}). 
904: The units of the vertical axis are, then,  ``nats'' of information. (A nat
905:  is equal to $1/ \log_{e}{2} \approx$ 1.4427 bits.) 
906: So, in the example above, one achieves a lower relative entropy (redundancy)
907: by proceeding in the quantum domain, as opposed to the classical one.
908: 
909: \begin{figure}
910: \centerline{\psfig{figure=nat.eps}}
911: \caption{Quantum asymptotic relative entropy 
912: term --- ${1 \over 2} \log{I_{q}(r)}$ --- and its 
913: {\it larger} classical 
914: counterpart, ${1 \over 2} \log{{64 \over 1-r^2}}$, plotted against radial 
915: distance ($r$) in the Bloch sphere of two-level systems}
916: \label{nwz}
917: \end{figure}
918: In the case $r=0$ (the fully mixed state), the 
919: quantum (Krattenthaler/Slater) asymptotics is given by the expression
920: \begin{equation}
921: {3 \over 2} \log{{N \over 2 \pi \mbox{e}}} -\log{w_{q}(0)} + o(1).
922: \end{equation}
923: For a pure state ($r=1$), in the case that $w_{q}(r)$
924:  is {\it continuous} and nonzero
925: at $r=1$, the asymptotics is given, in general,  by \cite{kratt2}
926: \begin{equation}
927: 2  \log{N} -3 \log{2} -\log{\pi} -\log{w_{q}(1)} + o(1).
928: \end{equation}
929: However, for the particular case of the 
930: Jeffreys' prior (\ref{bpr}), which is {\it singular} at 
931: $r =1$, we have \cite[eq. (2.53)]{kratt}
932: \begin{equation}
933: {3 \over 2} \log{N} +{1 \over 2} \log{\pi} -2 \log{2}.
934: \end{equation}
935: 
936: It would be of interest to ascertain if one can construct a
937: probability distribution for which the (classical) Fisher information 
938: matrix is equal (in spherical coordinates) to \cite[eq. (3.17)]{petzsudar}
939: \begin{equation}  \label{bwo}
940: I_{quasi-Bures} (r,\theta,\phi) 
941:  = \pmatrix{{1 \over 1 - r^2} & 0 & 0 \cr
942: 0 & {r^2 g(s) \over 1 + r} & 0 \cr
943: 0 & 0 & {r^2 g(s) \sin^{2}{\theta} \over 1 + r}},
944: \end{equation} 
945: where $s= {1 -r \over 1 + r}$ and  $g(s) = \mbox{e} s^{{s \over 1 -s}}$. 
946: (If we employ $g(s) = {2 \over 1 + s}$ in
947: (\ref{bwo}), we obtain the Helstrom 
948: information matrix $H_{q}(r,\theta,\phi)$ \cite{petzsudar}.)
949: This would yield the  {\it quantum} (but non-Helstrom) 
950: information matrix, the square root of the determinant of which is 
951: proportional to  the quasi-Bures probability
952: distribution (\ref{qB}). This probability distribution 
953: (rather than (\ref{bpr}), as originally conjectured \cite{kratt}) 
954: has been shown to yield
955: the common minimax and maximin in the universal coding of the two-level
956: quantum systems \cite{kratt2}.
957: 
958: \subsection{Relations between {\it Monotone Metrics} and the  
959: Fisher Information Matrices  Computed in 
960: Sec.~\ref{omer} } \label{fishmono}
961:  It would  be of considerable interest to
962: determine the precise nature $N \rightarrow \infty$
963:  of the Fisher information matrices 
964:  corresponding to the use of optimal measurements \cite{vidal}. 
965: (``For the case of mixed states of spin 1/2 particles, or for higher spins
966: we do not know what the `outer' boundary of the set of (rescaled) achievable
967: Fisher information matrices based on arbitrary (non separable) measurements
968: of $N$ systems looks like. We have some indications about the shape of this
969: set\ldots and we know that it is convex and compact'' \cite[p. 19]{gill}.) 
970: In particular, 
971: we would like to ascertain whether or not there is convergence in 
972: form (to a diagonal matrix in spherical coordinates) between even and
973: odd values of $N$, as numerical evidence indicates, 
974: and whether or not the Fisher information matrices are asymptotically 
975: simply proportional to some specific 
976: member (\ref{bwo}) of a broad class of natural 
977: metric tensors (which includes the Bures and quasi-Bures metrics 
978: discussed in Sec.~\ref{kssec}) 
979: for the quantum states associated
980: with operator monotone functions $f(s) = {1 \over g(s)}$ \cite{petzsudar}.
981: \subsubsection{The (2,2)- and (3,3)-entries of the diagonal 
982: Fisher information matrices for even $N$}
983: In fact, if we equate the (2,2)-entries of the diagonal Fisher information
984: matrices  given in sec.~\ref{dnfe} 
985: for the optimal measurements for $N=4$ and $N=6$ to the (2,2)-cell
986: of $N$ times the general matrix (\ref{bwo}) and solve for $g(s)$, 
987: recalling that $s = {1-r \over 1 +r}$, we obtain for 
988: $N=4$,
989: \begin{equation} \label{g(s)4}
990: g(s) = {1 \over 6 (1+ s)^3}  (6 + 17 s + 6 s^2)
991: \end{equation}
992: and for $N=6$,
993: \begin{equation} \label{g(s)6}
994: g(s) = {1 \over 45 (1+s)^5} (45 + 222 s + 416 s^2 + 222 s^3 + 45 s^4). 
995: \end{equation} 
996: Both these symmetry-exhibiting 
997: functions, (\ref{g(s)4}) and (\ref{g(s)6}), as well as 
998: the corresponding 
999: (Bures/minimal monotone) result (the equation of a hyperbola) 
1000: for $N=2$, that is,
1001: \begin{equation} \label{g(s)2}
1002: g(s) = {1 \over 1 +s}
1003: \end{equation}
1004:  are monotonically-decreasing on the positive real axis 
1005: (Fig.~\ref{gole}), but we are 
1006: presently not aware (for the cases $N=4$ and 6, that is) 
1007: if the reciprocals, $f(s) = 1/g(s)$, are {\it operator}
1008: monotone functions, as required for membership in the class of monotone
1009: metrics of Petz and Sud\'ar \cite{petzsudar} \cite{les}. 
1010: (A function $f(s)$, mapping 
1011: the nonnegative real axis to itself, is called operator monotone if the
1012: relation $0 \leq K \leq H$ implies $0 \leq f(K) \leq f(H)$ for all matrices
1013: $K$ and $H$ of any order. The relation $K \leq H$ implies that all the 
1014: eigenvalues of $H-K$ are nonnegative.)
1015: \begin{figure}
1016: \centerline{\psfig{figure=PairMonotone.eps}}
1017: \caption{Monotonically-decreasing functions $g(s)$, that is 
1018: (\ref{g(s)4}), 
1019: (\ref{g(s)6}) and (\ref{g(s)2}), obtained by equating
1020: the (2,2)-entries of the computed 
1021: Fisher information matrices (\ref{diagn=4}),  
1022: (\ref{diagn=6}) and (\ref{sPh}) for $N=4,6$ and 2, respectively, with 
1023: $N$ times the 
1024: (2,2)-entry of the general matrix (\ref{bwo}) for a monotone metric.
1025: The curve for $N=6$ dominates that for $N=4$, which in turn 
1026: dominates the hyperbola
1027: for $N=2$.}
1028: \label{gole}
1029: \end{figure}
1030: 
1031: If we were to include in Fig.~\ref{gole} 
1032: the corresponding function for the {\it quasi-Bures}
1033: monotone metric, that is
1034: \begin{equation}
1035: g(s) = {e s^{s \over 1 -s} \over 2},
1036: \end{equation}
1037: it would be essentially indistinguishable from the hyperbola for $N=2$
1038: (corresponding to the Bures/minimal monotone metric).
1039: \subsubsection{The (1,1)-entries of the diagonal Fisher information matrices 
1040: for even $N$}
1041: If, pursuing these lines of thought, one could develop a formula for arbitrary
1042: (even) $N$ for the (2,2)-entry of the Fisher information matrix for optimal
1043: measurements, and 
1044: obviously easily then for the (3,3)-entry (which would be
1045: the (2,2)-entry multiplied by $\sin^{2}{\theta}$), the remaining question, of 
1046: course, 
1047: would be to obtain a general formula for the (1,1)-entry. In this regard,
1048: the apparent general result 
1049: (established above for $N=2,\ldots,7$) 
1050: that the Gill-Massar trace is $2 N -1$ in the 
1051: pure state limit might prove helpful. But since the (1,1)-entry of the metric
1052: tensor for any monotone metric (\ref{bwo}) 
1053: is always simply ${1 \over 1-r^2}$, it would 
1054: apparently be necessary to have some {\it asymptotic} convergence to this 
1055: expression, being that the results 
1056: in the computed Fisher information matrices 
1057: (\ref{diagn=4}) and (\ref{diagn=6}) 
1058: for $N=4$ and 6 (and presumably for
1059: arbitrary even $N$) contain polynomials in $r$ in their numerators, and not
1060: simply a constant term.
1061: In Fig.~\ref{11entry} we plot the (1,1)-entries divided by
1062: $N$ of the computed Fisher
1063: information matrices, in spherical coordinates, for $N=2,4$ and 6.
1064: \begin{figure}
1065: \centerline{\psfig{figure=11entry.eps}}
1066: \caption{(1,1)-entries divided by $N$ of the 
1067: computed diagonal Fisher information matrices (\ref{sPh}), (\ref{diagn=4}) 
1068: and (\ref{diagn=6}) for $N=2,4$ and 6, respectively. The value at $r=.9$ is 
1069: greatest for $N=6$ and least for $N=2$.}
1070: \label{11entry}
1071: \end{figure}
1072: \subsubsection{{\it Modified} Gill-Massar traces based on the 
1073: Yuen-Lax (maximal monotone) and quasi-Bures  information matrices}
1074: In sec.~\ref{relations}, we defined the Gill-Massar trace as the trace of the
1075: product of the inverse of the quantum {\it Helstrom} information matrix
1076: and the Fisher information matrices we had  computed 
1077: (sec.~\ref{omer}) based on the optimal
1078: (in terms of {\it fidelity}) measurements of Vidal {\it et al} \cite{vidal} 
1079: for $N=2,\ldots,7$. Now the quantum
1080: Helstrom information matrix corresponds to the use of the {\it minimal}
1081:  monotone
1082: (Bures) metric, as well as the {\it symmetric} logarithmic derivative. Now, 
1083: we replace this with the {\it maximal} monotone metric, corresponding 
1084: to the {\it right} logarithmic derivative \cite[eq. (4.27)]{helstrom}, 
1085: associated with Yuen and Lax \cite{yuen}. 
1086: This can be accomplished by using
1087: $g(s) = {(1+s)/ (2 s)}$ in 
1088: the (diagonal/orthogonal) 
1089: metric tensor (\ref{bwo}) rather than $g(s) = {2 \over 1+t}$ (which gives the
1090: quantum Helstrom information matrix).
1091: Then, we find that in the pure state limit ($r \rightarrow 1$) the values of
1092: the so-modified traces are  exactly $N-1$ --- rather than $2 N - 1$ --- for 
1093: all our six cases
1094: $N=2,\ldots,7$.
1095: For $N=2$, this is
1096: \begin{equation}
1097: \tilde{GM}_{2} = 3 - 2 r^2,
1098: \end{equation}
1099: for $N=4$,
1100: \begin{equation}
1101: \tilde{GM}_{4} = {1 \over 12} (87 - 61 r^2 + 10 r^4),
1102: \end{equation}
1103: and for $N=6$,
1104: \begin{equation}
1105: \tilde{GM}_{6} = {1 \over 120} (1425 -1070 r^2 + 307 r^4 - 62 r^6).
1106: \end{equation}
1107: These three functions, scaled by their value at $r=1$, that is $N-1$, are
1108: plotted in Fig.~\ref{yleps}.
1109: \begin{figure}
1110: \centerline{\psfig{figure=YLtrace.eps}}
1111: \caption{Traces --- scaled by $N - 1$ --- for $N=2,4$ and 6 
1112: based on the Yuen-Lax/maximal monotone metric analysis. 
1113: The $y$-intercepts for $r=0$ 
1114: increase with $N$.}
1115: \label{yleps}
1116: \end{figure}
1117: The traces $\tilde{GM}_{N}$ for $N=3$ and 7  are (three-line) 
1118: functions of not only $r$, as previously,
1119:  but of 
1120: $\theta$ and $\phi$ as well. 
1121: For $N=5$, we have 
1122: \begin{equation}
1123: \tilde{GM}_{5} = 
1124: {1 \over 16} ( 147 - 96 r^2 + 13 r^4 + {10 (r^2-1)^3 \over r^2 + r^2 
1125: \cos{2 \theta} -2}).
1126: \end{equation}
1127: In the fully mixed state limit ($r \rightarrow 0$),
1128: the values of the traces are 3, 5, 7.25, 9.5, 11.875 and 11.1875.
1129: 
1130: If we alternatively employ the quasi-Bures metric, using
1131: $g(s) = e s^{{s \over 1-s}}$, then, in the pure state limit for 
1132: $N=2,4$ and 6 we get traces equalling $(4 +e)/e \approx 2.47152$, 
1133: $3 + 8/e \approx 5.94304 $ and $5 + 12 / e \approx 9.41455$,
1134: respectively.
1135: (These results are intermediate, then, between those for the minimal 
1136: and maximal monotone metrics.) For $r=0$, the corresponding outcomes
1137: are the same as in the two situations above. In Fig.~\ref{qbtrace}, we plot
1138: these three traces scaled by the noted values at $r=1$.
1139: \begin{figure}
1140: \centerline{\psfig{figure=qbtrace.eps}}
1141: \caption{Traces --- scaled by their 
1142: values at $r=1$ --- for $N=2,4$ and 6 
1143: based on the quasi-Bures 
1144: monotone metric analysis. The $y$-intercepts for $r=0$ increase with $N$.}
1145: \label{qbtrace}
1146: \end{figure}
1147: The curves for $N=2$ and 4 intersect at $r=.395121$.
1148: 
1149: \section{Concluding Remarks}
1150: 
1151: We have explicitly constructed the 
1152: $3 \times 3$ Fisher information matrices for the optimal
1153: measurements of Vidal {\it et al} \cite{vidal} for 
1154: $N=2,\ldots,7$, 
1155: found  that they are tightly
1156: bounded by $(N-1) H_{q}$ near the pure state boundary, and
1157: conjectured that they converge from above to 
1158: ${N \over 2}$ times the identity matrix at the fully mixed state ($r=0$).
1159: As our main finding, we have uncovered (sec.~\ref{relations}) an 
1160: interesting (less strict)
1161:  analogue for non-separable
1162: measurements of a ``new quantum Cram\'er-Rao inequality'' of 
1163: Gill and Massar \cite[eq. (27)]{gill}. The possibility of extending it to
1164: the cases $N>7$ appears to be a challenging problem.
1165: Also, the development of optimal measurement schemes for multiple copies of
1166: $m$-level systems, $m>2$, and the subsequent evaluation of their Fisher
1167: information characteristics, merits investigation
1168: (cf. \cite{acin}). In this regard, we have presented in sec.~\ref{uue} 
1169: additional evidence --- for an optimal 
1170: measurement we devised for the case $m=3$, $N=2$ --- that 
1171: has led us to the conjecture that for optimal non-separable measurements of 
1172: $N$ copies of $m$-level quantum systems, the ``Gill-Massar trace'' 
1173: equals $(2 N-1) (m-1)$ in the pure state limit for {\it all} $m$ and $N$.
1174: 
1175: Additionally,
1176: it would be of interest to study the Fisher information matrices associated
1177: with 
1178: optimal measurements based on  {\it continuous} 
1179: oproms \cite[p. 386]{peres} \cite{slaterperes}.
1180: The relation between optimal measurements (sec.~\ref{nce}) and 
1181: universal quantum 
1182: coding (sec.~\ref{kssec})--- both 
1183: involving averaging with respect to isotropic prior
1184: probability distributions by projecting onto total spin 
1185: eigenstates --- appears to be worthy of 
1186: further consideration. (Fischer and Freyberger recently compared 
1187: the use of single adaptive measurements --- which possess certain
1188: practical advantages --- with the use of non-separable ones
1189: \cite{fischer}.)
1190: 
1191: 
1192: We have also investigated here several related topics, all pertaining to the
1193: information-theoretic properties of the two-level quantum systems. 
1194: We have posed  the problem of constructing an operator-valued
1195: probability measure (oprom) for 
1196: the smallest number possible of copies $N \geq 4$ 
1197:  which yields the quadrinomial probability
1198: distribution (\ref{qpd}), the Fisher information matrix 
1199:  for which is
1200: simply four times the quantum (Helstrom) information matrix (\ref{qpd}).
1201: Also, we discuss in sec.~\ref{ssecn7}
1202:  what appears to be an intriguing connection between our results
1203: and the work of Frieden \cite{frieden} concerning differences between 
1204: classical and quantum information.
1205: 
1206: 
1207: \acknowledgments
1208: 
1209: I would like to express appreciation to the Institute for Theoretical Physics
1210: for computational support in this research,  as well as 
1211: to M. J. W. Hall, G. Vidal,
1212: R. Tarrach, 
1213: R. Gill and B. R. Frieden for various forms of assistance and advice.
1214: 
1215: \begin{references}
1216: \bibitem{vidal} G. Vidal, J. I. Latorre, P. Pascual, and R. Tarrach,
1217: Phys. Rev. A 60, 126 (1999).
1218: \bibitem{gill} R. D. Gill and S. Massar, 
1219: Phys. Rev. A 61, 042312/1-16 (2000).
1220: \bibitem{fischer} D. G. Fischer and M. Freyberger, {\it Estimating Mixed
1221: Quantum States}, quant-ph/0005090.
1222: \bibitem{helstrom} C. W. Helstrom,
1223:  {\it Quantum Detection and Estimation Theory},
1224: (Academic, New York, 1976).
1225: \bibitem{gill2} R. Gill, {\it Asymptotics in Quantum Statistics},
1226: (Mathematical Institute, University of Utrecht, 1999). available at
1227: WWW: http://math.uu.nl/people/gill/Preprints/paper.ps.gz.
1228: \bibitem{busch} P. Busch, G. Cassinelli, and P. J. Lahti,
1229: Revs. Math. Phys. 7, 1105 (1995).
1230: \bibitem{tarvid} R. Tarrach and G. Vidal, Phys. Rev. A 60, R3339 (1999).
1231: \bibitem{vidal2} J. I. Latorre, P. Pascual, and R. Tarrach,
1232: Phys. Rev. Lett. 81, 1351 (1998).
1233: \bibitem{acin} A. Ac\'in, J. I. Latorre, and P. Pascual, Phys. Rev. A
1234: 61, 022113/1-7 (2000).
1235: \bibitem{uhlmann} A. Uhlmann, Rep. Math. Phys. 9, 273 (1976).
1236: \bibitem{jozsa} R. Jozsa, J. Mod. Opt. 41, 2315 (1994).
1237: \bibitem{petzsudar} D. Petz and C. Sud\'ar, J. Math. Phys. 37, 2662 (1996).
1238: \bibitem{bc} S. L. Braunstein and C. M. Caves, Phys. Rev. Lett.
1239: 72, 3439 (1994).
1240: \bibitem{paulpla} P. B. Slater, Phys. Lett. A 247, 1 (1998).
1241: \bibitem{bm} S. L. Braunstein and G. J. Milburn, Phys. Rev. A 51, 
1242: 1820 (1995).
1243: \bibitem{belt} E. G. Beltrametti and G. Cassinelli, {\it The Logic of Quantum
1244: Mechanics}, (Addison-Wesley, Reading, 1981).
1245: \bibitem{slatjmp} P. B. Slater, J. Math. Phys. 37, 2682 (1996).
1246: \bibitem{cb1} B. S. Clarke and A. R. Barron, IEEE Info. Th. 36, 453 (1990).
1247: \bibitem{kratt} C. Krattenthaler and P. B. Slater, Trans. IEEE Info. Th. 46,
1248: 801 (2000).
1249: \bibitem{kratt2} H. Grosse, C. Krattenthaler, and P. B. Slater,
1250: {\it Asymptotic Redundancies for Universal Quantum Coding. II}
1251: (in preparation).
1252: \bibitem{jozsa2} R. Jozsa, M. Horodecki, P. Horodecki, and R. Horodecki,
1253: Phys. Rev. Lett. 81, 1714 (1998).
1254: \bibitem{ditt1} J. Dittmann, Sem. Sophus Lie, 3, 73 (1993).
1255: \bibitem{ditt2} J. Dittmann, J. Phys. A 32, 2663 (1999).
1256: \bibitem{barn} O. E. Barndorff-Nielsen and R. D. Gill, J. Phys. A 33, 4481
1257: (2000).
1258: \bibitem{hub1} M. H\"ubner, Phys. Lett. A 163, 239 (1992).
1259: \bibitem{hub2} M. H\"ubner, Phys. Lett. A 179, 226  (1993).
1260: \bibitem{fuji} A. Fujiwara and H. Nagaoka, Phys. Lett. A 201, 119 (1995).
1261: \bibitem{tod} K. P. Tod, Class. Quant. Grav. 9, 1693 (1992).
1262: \bibitem{frieden} B. R. Frieden, {\it Physics from Fisher Information: A
1263: Unification}, (Cambridge University Press, Cambridge, 1999).
1264: \bibitem{murray} M. K. Murray and J. W. Rice, {\it Differential Geometry
1265: and Statistics}, (Chapman and  Hall, London, 1993).
1266: \bibitem{kass} R. E. Kass, Statist. Sci. 4, 188 (1989).
1267: \bibitem{kagan} A. M. Kagan, Probl. Pered. Inform. 12(2), 20 (1976).
1268: \bibitem{chentsov} N. N. Chentsov, in {\it Encyclopaedia of Mathematics},
1269: edited by M. Hazewinkel (Kluwer, Dordrecht, 1990), vol. 5, p. 78.
1270: \bibitem{kagan2} A. M. Kagan and Z. Landsman, Stat. Prob. Lett. 32, 175 (1997).
1271: \bibitem{rao} C. R. Rao, {\it Linear Statistical Inference and Its
1272: Applications} (Wiley, New York, 1973).
1273: \bibitem{bennett} C. H. Bennett, D. P. DiVincenzo, C. A. Fuchs, T. Mor, 
1274: E. Rains, P. W. Shor, J. A. Smolin, and W. K. Wootters,
1275: Phys. Rev. A 59, 1070 (1999).
1276: \bibitem{cox} D. R. Cox and N. Reid, J. R. Statist. Soc. B 49, 1 (1987).
1277: \bibitem{kobayashi} S. Kobayashi and K. Nomizu, {\it Foundations of
1278: Differential Geometry. Vol. 1}, (Interscience, New York, 1963).
1279: \bibitem{les} A. Lesniewski and M. B. Ruskai, J. Math. Phys. 40, 5702 (1999).
1280: \bibitem{FUJI} A. Fujiwara and H. Nagaoka, J. Math. Phys. 40, 4227 (1999).
1281: \bibitem{cavesv} C. M. Caves and G. J. Milburn, Opt. Commun. 179, 439 (2000).
1282: \bibitem{slatprep} P. B. Slater, {\it Bures Geometry of the Three-Level
1283: Quantum Systems}, quant-ph/0008069.
1284: \bibitem{byrdslater} M. S. Byrd and P. B. Slater, {\it Bures Measures 
1285: over the Spaces of Two and Three-Dimensional Density Matrices}, 
1286: quant-ph/0004055 (to appear in Phys. Lett. A).
1287: \bibitem{venki} V. E. Mkrtchian and V. O. Chaltykian, Opt. Commun.
1288: 63, 239 (1987).
1289: \bibitem{slathall} P. B. Slater, J. Phys. A 32, 8231 (1999).
1290: \bibitem{yuen} H. P. Yuen and M. Lax, Trans. IEEE Info. Th. 19, 740 (1973).
1291: \bibitem{peres} A. Peres, {\it Quantum Theory: Concepts and Methods},
1292: (Kluwer, Dordrecht, 1995).
1293: \bibitem{slaterperes} P. B. Slater, J. Math. Phys. 38, 2274 (1997).
1294: \end{references}
1295: 
1296: \listoffigures
1297: \end{document}