0109:gr-qc0109053/bip.tex

1: \documentclass[11pt]{article}

2: \usepackage{fullpage,proof,amssymb,amsmath,epsfig,stmaryrd}

3: \include{macros}

4: \bibliographystyle{alpha}

5: \begin{document}

6:

7: \title{Discrete Quantum Causal Dynamics}

8:

9: \author{

10: \begin{tabular}[t]{c}

11:         Richard F. Blute\thanks{

12: Research supported in part

13: by NSERC.

14: }\\Ivan T. Ivanov\\

15:         {\small Department of Mathematics}\\

16:         {\small \rule{0mm}{3mm} and Statistics}\\

17:         {\small University of Ottawa}\\

18:         {\small Ottawa, Ontario, Canada}\\

19:         \texttt{rblute,iti@mathstat.uottawa.ca}\\

20: \end{tabular}

21: \and

22: \begin{tabular}[t]{c}

23:         Prakash Panangaden\footnotemark[1]\\

24:         {\small School of Computer Science}\\

25:         {\small McGill University}\\

26:         {\small Montr\'eal, Qu\'ebec, Canada}\\

27:         \texttt{prakash@cs.mcgill.ca}

28:     \end{tabular}

29: }

30:

31: \maketitle

32:

33: \renewcommand{\girpar}{\bindnasrepma}

34:

35:

36: \begin{abstract}  We give a mathematical framework to

37: describe the evolution of an open quantum systems subjected

38: to finitely many interactions with classical apparatuses.  The

39: systems in question may be composed of distinct, spatially

40: separated subsystems which evolve independently but may also

41: interact. This evolution, driven both by unitary operators

42: and measurements, is coded in a precise mathematical

43: structure in such a way that the crucial properties of

44: causality, covariance and entanglement are faithfully

45: represented.  We show how our framework may be expressed

46: using the language of (poly)categories and functors.

47: Remarkably, important physical consequences - such as

48: covariance - follow directly from the functoriality of our

49: axioms.

50:

51: We establish strong links between the physical

52: picture we propose and linear logic. Specifically we show

53: that the refined logical connectives of linear logic can be

54: used to describe the entanglements of subsystems in a

55: precise way. Furthermore, we show that there is a precise

56: correspondence between the evolution of a given

57: system and deductions in a certain formal logical system

58: based on the rules of linear logic.

59:

60: This framework generalizes and enriches both

61: causal posets and the histories approach to quantum

62: mechanics.

63: \end{abstract}

64:

65:

66:

67: \section{Introduction}

68:

69: We propose a uniform scheme for describing a quantum system,

70: interacting with a network of classical objects.  The

71: system in question may be composed of distinct spatially

72: separated subsystems which evolve independently, but may

73: also interact with each other at various points as well as

74: with the classical objects.  When analyzing physical

75: laboratory experiments on quantum systems, we frequently

76: abstract away from the concrete experimental setup and from

77: the particular details of the machinery involved.  What we

78: usually keep is the description of the quantum system - and

79: its spatially separated subsystems - in terms of wave

80: functions or density matrices and unitary operators as well

81: as the changes of the quantum system induced by the

82: interactions with classical devices. Crucial properties of

83: the evolution such as the causal ordering, covariance of the

84: description for different observers and quantum entanglement

85: between distinct subsystems should be completely reflected

86: in any such description.

87:

88: The basis of our representation is the graph of events and

89: causal links between them.  An event could be one of the

90: following: a unitary evolution of some subsystem, an

91: interaction of a subsystem with a classical device (a

92: measurement) or perhaps just the coming together or

93: splitting apart of several spatially separated  subsystems.

94: Events will be depicted as vertices of a directed graph.  The

95: edges of the graph will represent the causal relations

96: between the different events.  The vertices of the graph are

97: then naturally labelled with operators representing the

98: corresponding processes.

99:

100: Of course, the processes of unitary evolution and

101: measurement take a certain amount of time; but we are only

102: interested in the causal relations between such events and

103: this allows us to consider them as point-like vertices on the

104: graph.  Thus we are thinking of the duration between events

105: as being longer than the duration of an event so that no

106: causal information is lost when we represent interactions as

107: events.

108:

109: The structure described thus far reflects the kinematical

110: properties of the quantum system.  To describe the dynamics

111: we need a composition of the operators assigned to the

112: vertices of the graph.  This composition is most conveniently

113: described in terms of a composition in a specific

114: mathematical structure, namely a polycategory generated by

115: the graph.  The whole description could then be concisely

116: summarized by noticing that we have a functor from this

117: polycategory to the polycategory of Hilbert spaces.   This

118: functor captures the dynamics of the system.

119:

120: Causal relations are made explicit and we prove that no

121: influences breaking causality arise in our scheme.  The

122: possible entanglement between spatially separated subsystems

123: - represented by distinct edges of the graph - is also

124: accounted for.  Thus, our framework allows one to represent

125: locality of interaction - i.e.\ causal influences do not

126: propagate outside the causal ``cone'' - while allowing the

127: expression of nonlocal correlations which occur when one has

128: quantum entanglement.  The tension between causal evolution

129: and quantum entanglement is resolved.

130:

131: The categorical framework that we use is intimately

132: connected with linear logic. Linear logic was

133: originally introduced~\cite{Girard87} as a logic intended

134: for a finer analysis of the way resources are consumed

135: during the course of a proof.  This logic has had a

136: significant impact on the theory of computation as well as

137: such far-flung areas as linguistics and pure mathematics.

138: In the present paper, the connectives of linear logic will

139: be used to express the existence or nonexistence of nonlocal

140: correlations. What we will introduce is a deductive system

141: based on the graph-theoretic structure of the system

142: that precisely picks out the spatial slices of physical

143: interest. Thus evolution of the system corresponds to

144: logical deductions within this deductive system.

145: For an expository introduction

146: to linear logic, see the review by Girard~\cite{Girard95} or

147: the brief exposition in the appendix.

148:

149: \subsection{Relation to other work}

150:

151: Next we outline the relations of our proposal to some

152: recent approaches to quantum mechanics and quantum gravity.

153:

154: \subsubsection{Consistent and decoherent histories}

155:

156: The \emph{consistent histories} approach to quantum

157: mechanics due to Griffiths and Omn\`es

158: \cite{Griffiths96,Omnes94} was formulated with the aim of

159: shedding new light on the conceptual difficulties of the

160: theory.  A closely related proposal with different

161: motivation is the \emph{decoherent histories} approach to

162: quantum cosmology of Gell-Mann and Hartle

163: \cite{Gell-Mann93}.  The basic ingredient in both approaches

164: is the notion of a \emph{history} of the quantum system

165: described by a sequence of projection operators in the

166: Hilbert space of the system, for a succession of times.  The

167: goal of quantum mechanics is to determine the probability of

168: an event or a sequence of events, thus one might hope to

169: assign probabilities to the histories of the quantum

170: system.  In order for the probabilities to be additive in

171: the usual sense, the histories have to be mutually

172: noninterfering.  Sets of histories obeying this condition

173: are selected with the use of a special bilinear form on

174: histories - the decoherence functional.

175:

176: A particular history is mathematically represented as a

177: linearly ordered sequence of projection operators in the

178: Hilbert space of the quantum mechanical system.  But the

179: linear causal ordering of the events in a history is too

180: restrictive in many experimental situations, in particular

181: when analyzing spatially separated entangled quantum

182: systems.  This issue is even more pressing for quantum

183: cosmology considerations.  An application of the histories

184: approach to quantum field theory on a curved space-time

185: \cite{Blencowe91} must assume the existence of a globally

186: hyperbolic manifold, and thus via the associated foliation,

187: a linear ordering of the histories of the quantum field.

188:

189: Our proposal for describing the evolution of an open quantum

190: system can be considered as describing a single history in a

191: set of histories.  The important point is that events are no

192: longer linearly ordered by temporal order but, rather,

193: partially ordered with respect to the causal order.  This

194: allows one to capture the notion of causal evolution in a

195: manifestly covariant fashion.  The consistency/decoherence

196: condition for histories has an immediate generalization for

197: histories described by more general graphs as proposed here.

198:

199: \subsubsection{Causal sets}

200:

201: \emph{Causal sets} form the basis of an approach to quantum

202: gravity mainly advocated by R. Sorkin and collaborators

203: \cite{Bombelli87,Sorkin91}, where the basic idea is to take

204: the notion of causality as the primitive. In classical

205: relativity, the structure of the space-time manifold

206: together with a metric of Lorentzian signature determines

207: the causality relation.  An important observation is that

208: the causal structure is conformally invariant, i.e.\

209: determined by only the conformal equivalence class of the

210: metric and hence more primitive than the metric.  Various

211: proposals for quantum gravity - for example, the twistor

212: program~\cite{Penrose72} - have taken as their point of

213: departure the idea that the causal structure is more

214: fundamental than the metric structure.

215:

216: In the causal sets approach, one takes the point of view

217: that, at the smallest length scales, spacetime is inherently

218: discrete and that the causal structure, the ``light cones'',

219: are fundamental.  This leads naturally to the idea of a

220: partially ordered set (poset for short) where the elements

221: are events and two events are related by causality.  The

222: main interest is in approximating continuous spacetimes with

223: such structures and defining processes that would generate

224: these structures, with a view to an eventual theory of

225: quantum gravity.  Though the aims are rather different the

226: issues connected with causality are closely related.

227:

228: Causal sets are further motivated by the idea that a

229: discrete structure would avoid the singularities that plague

230: physics (both classical and quantum).  The assumption that

231: space-time should be a continuous manifold is one of the

232: ingredients that leads to the problematic singularities of

233: quantum field theory and general relativity.  In the causal

234: sets approach, space-time is a discrete structure, thus

235: possibly avoiding these singularities, the idea being that

236: at the Planck scale, continuous geometry gives way to

237: discrete geometry.

238:

239: One way to think of this is that one approximates a manifold

240: as one ``sprinkles'' more and more points into the causal

241: set in a uniform fashion.  More formally, one would want to

242: obtain a manifold as the categorical limit of a diagram of

243: posets and embeddings \cite{Maclane98}. Applications and

244: extensions of these ideas can be found in papers such as

245: \cite{Markopoulou00,Markopoulou97,Raptis00b}, although this

246: list is by no means exhaustive.  In our approach we are not

247: thinking about generating the spacetime through such

248: limiting processes, but the idea of a causal set is implicit

249: in our work.  For us, a finite causal set is the kinematical

250: framework on which we describe evolution and information

251: flow.

252:

253: \subsubsection{Quantum causal histories}

254:

255: The notion of \emph{quantum causal history} was introduced

256: by Markopoulou in \cite{Markopoulou00}.  One begins with

257: a poset (causal set) and assigns Hilbert spaces to the

258: vertices and evolution operators to sets of edges.  The

259: assignment must satisfy properties analogous to

260: functoriality.  However, within this framework, one is

261: quickly led to violations of causality - as the author

262: herself notes - essentially because the slices used are

263: ``too global.''  She mentions the possibility of working

264: with a dual view.  In fact, in our work, we take such a

265: dualized view as our starting point.  In other words we

266: assign operators representing evolution or measurement to

267: vertices and Hilbert spaces to the edges, in a way

268: satisfying (poly)functoriality.

269:

270: \subsection{The Importance of Categories} A category can be

271: seen as a generalization of a poset in the following sense.

272: A poset merely records that an element $x$ is less than $y$

273: but a category keeps track of the different ways in which

274: $x$ might be less than

275: $y$.  For example, in logic one might consider formulas

276: (denoted by Greek letters like $\phi$, $\psi$ etc.\ ) and

277: the relationship of provability between them.  Thus one

278: would write $\phi\vdash\psi$ to mean that starting from the

279: assumption $\phi$ one can prove $\psi$.  This gives rise to

280: a transitive and reflexive relation; if one considers

281: equivalence classes of formulas (two formulas being

282: equivalent if each can be used to prove the other) we get a

283: poset.  However, if we are interested in distinguishing

284: distinct proofs we need to keep track of the different ways

285: in which $\phi$ can be used to prove $\psi$.  Thus formulas

286: as objects and proofs as morphisms can be organized into a

287: category.

288:

289: In a poset when one writes $x\leq y$ then, depending on the

290: context, one is stating something like the following:

291: \begin{itemize}

292: \item $x$ is less than $y$;

293: \item $x$ precedes $y$;

294: \item $x$ implies $y$.

295: \end{itemize}

296: \noindent or any of several other possibilities.  In a

297: causal set, we have in mind that $x$ causally precedes $y$.

298:

299: In the present work, we are particularly interested in

300: modelling the idea that information can flow from one event

301: to another in a number of different ways, \emph{along

302: different paths or channels}.  We would like to keep track

303: of all these various independent paths.  The structure of a

304: poset is inadequate for achieving this, as we would like to

305: say that $x\leq y$ in several different ways.  This

306: naturally suggests that we pass from posets to more general

307: graphs and eventually to categories.

308:

309: Many recent experiments feature spatially distributed

310: quantum systems. When entangled quantum subsystems come back

311: together in the same spacetime region, the description of the

312: resulting system is causally influenced by all events in the

313: paths of the subsystems.  In particular a past event could

314: influence the future events in several distinct ways through

315: different paths.  Our scheme is well adapted for analyzing

316: experiments featuring spatially separated quantum entangled

317: entities and could be used in the field of quantum

318: information processing to analyze information flow

319: situations.

320:

321: \subsection{Contents of the present paper}

322:

323: Section~\ref{causal} presents the basic ideas of our scheme

324: via an example. Section~\ref{dyn} discusses the basic

325: physical ideas involved.  In the first subsection we review

326: the notions of measurements and interventions. In the next

327: subsection we give the dynamical prescription in a special

328: case and in the final subsections we give the general

329: prescription and prove covariance.  In section~\ref{poly} we

330: review basic facts about polycategories and their

331: construction.  We also describe the polycategory of Hilbert spaces

332: and intervention operators

333: we will be using. In section~\ref{logic} we give a logical

334: presentation of polycategories and establish the connection

335: between our structures and linear logic. A functorial version of

336: our dynamical prescription is then presented.

337: We end with a discussion on further

338: applications of our scheme.

339: It is our hope that this paper will

340: interest members of several different communities within

341: mathematics, logic and physics.

342:

343: \section{Causal information flow via

344: examples}\label{causal}

345:

346: Consider a quantum system evolving in space-time while being

347: subjected to interactions with classical observers at a

348: number of points.  The causal and spatio-temporal relations

349: in the system will be represented by a directed acyclic

350: graph (hereafter called a \emph{dag}).  The vertices of the

351: graph - which will be drawn as boxes - represent the events

352: in the evolution of the system.  An event could be a

353: measurement by a classical observer, a local unitary

354: evolution or just a splitting of a subsystem into several

355: spatially separated subsystems, which however could still

356: share an entangled common state.  The propagation of the

357: different subsystems will be indicated by the edges of the

358: graph.

359:

360: There are a number of causal relations between edges and

361: vertices.  A vertex $v_1$ is said to \emph{immediately

362: precede} $v_2$ if there is a (directed) edge from $v_1$ to

363: $v_2$.  We write $v_1 \leq v_2$ for the reflexive transitive

364: closure of immediate precedence; thus $v \leq v$ always

365: holds and $v_1 \leq v_2$ means that there is a

366: \emph{directed path} from $v_1$ to $v_2$ (possibly of length

367: zero).  When $v_1 \leq v_2$ we sometimes say $v_1$ is ``to

368: the past of'' $v_2$ and dually ``$v_2$ is to the future of

369: $v_1$.''  When we draw a poset we typically leave out the

370: self-loops and only draw the minimal number of edges needed

371: to infer all the others; the so-called ``Hasse diagram'' of

372: the poset. We note that  our graphs will have initial and

373: final  ``half-edges'', i.e. edges with only one endpoint.

374: Physically we have some quantum states incoming (or

375: ``prepared'') followed by some interactions and some

376: outgoing state.

377:

378: The relation between vertices induces a causal relation

379: between edges.  We say that an edge $e_1$ is to the past of

380: another edge $e_2$ if the terminal vertex of $e_1$, say

381: $v_1$ and the initial vertex of $e_2$, say $v_2$, satisfy

382: $v_1 \leq v_2$.  Note that we could have $v_1 = v_2$.  An

383: initial edge is not to the future of any edge, nor is a

384: final edge to the past of any other edge.  If two edges are

385: not causally related, we say that they are ``spacelike

386: separated'' or acausal.  Note that two spacelike separated

387: edges could share a common terminal vertex or a common

388: initial vertex, (but since we have a graph, not both).  A

389: \emph{space-like slice} is defined as a set of pairwise

390: acausal edges.  Henceforth, whenever we say ``slice'' we

391: will always mean ``spacelike slice.''  Note that the initial

392: (or final) edges form a spacelike slice.  We call this the

393: \emph{initial (final) slice}.

394:

395: For example for the graph of Figure~\ref{figureN} the set of

396: edges $\{e_c, e_d, e_e\}$ form a space-like slice.  Another

397: example is the set $\{e_f, e_d, e_e\}$.  The edges $e_a$ and

398: $e_b$ form the initial slice.   The edges $e_a,e_b,e_f$ and

399: $e_g$ are half-edges, with $e_a$ and $e_b$ initial, and

400: $e_f$ and $e_g$ final.

401:

402: \begin{figure}[htb]

403: \begin{center}

404: \input{figure1}

405: \end{center}

406: \caption{}

407: \label{figureN}

408: \end{figure}

409:

410: Associated with any edge $e_i$ is an observer who has access

411: to a subsystem of the complete quantum system. Thus the

412: edges represent local information. Each edge $e_i$ is

413: assigned a density matrix $\rho_i$ in a Hilbert space

414: $\hi_i$\footnote{Throughout the paper, we assume that the

415: graph and the dimensions of all Hilbert spaces are

416: finite.}.  The density matrix $\rho_i$ describes the

417: knowledge about the quantum system available to the local

418: observer at the edge $e_i$.  More generally density matrices

419: will be associated to space-like slices.  For a space-like

420: slice consisting of edges $\{e_{i_1}, \dots e_{i_p}\}$, the

421: assigned density matrix will be denoted $\rho_{i_1, \dots

422: i_p}$.  This density matrix describes the subsystem of the

423: whole quantum system for that space-like slice.  Every

424: space-like slice has also a Hilbert space which is the

425: tensor product of the Hilbert spaces of the edges forming

426: the slice.  However the density matrix associated with the

427: slice is not in general a tensor product of the density

428: matrices on the edges. If it were, we could not capture

429: non-local quantum correlations.

430:

431: The graph of Figure~\ref{figureN}, represents a quantum

432: system $\mathit{Q}$ which starts evolving from a state in

433: which $\mathit{Q}$ consists of two spatially separated

434: subsystems $\mathit{Q}_a$ and $\mathit{Q}_b$ described by

435: density matrices $\rho_a$ and $\rho_b$ respectively, in

436: Hilbert spaces

437: ${\hi}_a$ and ${\hi}_b$.  The initial edges $e_a$ and $e_b$

438: form the initial slice in this simple system.  We will

439: follow the convention that if the initial slice consists of

440: several edges, the initial state of the whole system is a

441: tensor product state, i.e.  the subsystems are not

442: entangled. For the above example, $\psi_{init} = \psi_a

443: \otimes \psi_b$ and $\rho_a =

444: \proj{\psi_a}$ and $\rho_b = \proj{\psi_b}$. Entangled

445: subsystems on distinct edges will always have at least one

446: event in the common past.  Thus we always explicitly

447: represent the interaction which caused the entanglement.

448:

449: Each vertex $v_i$ of the graph is labelled with an operator

450: $T_i$ which describes the process taking place at the

451: corresponding event.  The operator $T_i$ at a given event

452: $v_i$ takes density matrices on the tensor product of

453: Hilbert spaces living on the incoming edges at $v_i$ to

454: density matrices on the tensor product Hilbert space of

455: outgoing edges.  The process at a vertex could be an

456: \emph{intervention}\footnote{Interventions are generalized

457: measurements where a quantum subsystem could be

458: discarded~\cite{Peres00}.  This will be discussed more fully

459: below.} corresponding to a positive operator-valued measure

460: (POVM)~\cite{Nielsen00,Peres95} or a unitary

461: transformation.  Or instead of an external or unitary action

462: there could be several quantum subsystems that come together

463: and then split apart, possibly in a different way.  We will

464: consider this last case as a particular instance of a

465: unitary evolution with identity evolution operator. As a

466: simple example, in the case of an event corresponding to

467: unitary evolution by a unitary operator

468: $U$, we have the usual expression:

469:

470: \begin{equation}\label{inop}

471: \rho^{in} \ \mapsto \ \rho^{out} \ = \

472: U \rho^{in} U^\dagger

473: \end{equation}

474:

475: The general expression for an operator associated to an

476: event will be discussed fully in the next section, see

477: equation (\ref{inap}).

478:

479: Here we will discuss some of the conditions such a dynamical

480: scheme has to satisfy in order to reflect causality and

481: other physical properties of the quantum system.  Causality

482: is the condition that the density matrix on a given edge

483: should not depend on the actions performed at vertices which

484: are acausal to this edge or are in its future.  For example,

485: referring back to Figure~\ref{figureN}, we would like any

486: quantum evolution rule to say that the density matrix at

487: $e_g$ is unaffected by the intervention at $v_3$ or the

488: density matrix at $e_f$ is unaffected by the intervention at

489: $v_2$.  A general unitary evolution between the states of

490: two space-like slices is easily shown to violate this

491: condition.  Therefore we need to incorporate some sort of

492: locality condition into the evolution scheme.

493:

494: It is not hard to formulate such an evolution scheme.  For

495: example, one could work with the dual picture and have

496: evolution occur along edges with density matrices at the

497: vertices.  It is not hard to formulate rules which would

498: enforce causality properly in such a framework.

499: Unfortunately this rules out quantum correlations across

500: spatially separated subsystems. Thus, the evolution scheme

501: cannot be too local because entangled subsystems of the

502: quantum system could fly apart and later come together at a

503: vertex.

504:

505: Consider the system shown in Figure~\ref{figureD}.

506: \begin{figure}[htb]

507: \begin{center}

508: \input{figure2}

509: \end{center}

510: \caption{}

511: \label{figureD}

512: \end{figure} The quantum system represented in this graph is

513: as follows.  The system is prepared in a state $\psi_a$ as

514: indicated by the density matrix $\rho_a =

515: \proj{\psi_a}$ on the incoming edge.  At the vertex $v_1$

516: the system splits into two spatially separated subsystems on

517: the edges $e_b$ and $e_c$ which, in general, are still

518: described by a global entangled state.  The local

519: transformations $T_2$ and $T_3$ will, in general, preserve

520: the entanglement and the global state will be still

521: entangled on the space-like slice

522: $\{e_d, e_e\}$.  The two subsystems come together at the

523: vertex $v_4$.  The two local density matrices $\rho_d$ and

524: $\rho_e$ are not sufficient to reconstruct the entangled

525: state of the system described by $\rho_f$.  The off-diagonal

526: terms of $\rho_f$ are not reflected in the local density

527: matrices, $\rho_d$ and $\rho_e$.  We need to include

528: information about the history of the state on the space-like

529: slice $\{e_d, e_e\}$ in order to reconstruct the global

530: state.  One possibility is to work with global space-like

531: slices, and show that the scheme is generally covariant in

532: the sense of being slice-independent.  In our functorial

533: approach, certain preferred (not necessarily global)

534: spacelike slices account for all entanglement.

535:

536: The rules for constructing and labeling the graphs given so

537: far reflect the kinematics of the quantum system.

538: Specifying the dynamics amounts to a prescription for how to

539: obtain the density matrices on every edge from the density

540: matrix on the initial slice and the operators at the

541: vertices of the graph.  This prescription will be given

542: below in section~\ref{dyn}.

543:

544: \section{Dynamics on Graphs}\label{dyn}

545: \subsection{Measurements and Interventions} We begin with

546: some standard material on density matrices and positive

547: operator-valued measures (POVMs)

548: \cite{Nielsen00,Preskill98}, before introducing Peres'

549: notion of

550: \emph{intervention operator} \cite{Peres00}.

551:

552: Density matrices are used for describing quantum subsystems

553: which are part of larger quantum systems.

554: In particular a local observer who has

555: access only to a subsystem $Q_1$ of a quantum system $Q$

556: will associate a density matrix to his subsystem.  Let

557: $\hi$ be the Hilbert space of state vectors of $Q$.

558:

559: If the overall system $Q$ is in a state described by a wave

560: function

561: $\ket{\psi} \in \hi$, then its density matrix is the

562: operator $\rho =

563: \proj{\psi} \in End(\hi)$.  Since $Q$ can be decomposed into

564: subsystems, its Hilbert space is a tensor product $\hi =

565: \hi_{1} \otimes \hi_{2}$ of the Hilbert space $\hi_{1}$ of

566: the subsystem $Q_1$ and the Hilbert space

567: $\hi_{2}$ describing the remaining degrees of freedom.  The

568: density matrix of the subsystem $Q_1$ is then given by a

569: partial trace with respect the Hilbert space $\hi_{2}$:

570: $\rho_{1} = Tr^{\hi_2} \rho$.  If $\hi$ is any Hilbert

571: space, then the space of all density matrices will be denoted

572: $\mathsf{DM}(\hi)$.

573:

574:

575: The \emph{measurement} of a property of a quantum system

576: involves interaction with a classical apparatus.  When a

577: classical apparatus measures an observable of a quantum

578: subsystem sitting inside a larger system the appropriate

579: mathematical formalism for such generalized measurement is

580: that of \emph{positive operator-valued measure} or POVM. Let

581: the possible outcomes of the measurement be labelled by the

582: letter $\mu

583: \in \{1 \dots N\}$.  The measurement involves interaction

584: between the apparatus and the quantum system, described by a

585: unitary operator.  The classical apparatus has a preferred

586: basis of states indexed by $\mu$.  After the measurement,

587: the apparatus appears in one these preferred states.  Since

588: we are only interested in describing our quantum subsystem

589: $Q_1$, we trace out all the remaining degrees of freedom.

590: Effectively to every outcome

591: $\mu$ is associated an operator $F_{\mu}$.  The density

592: matrix of $Q_1$ after the measurement with outcome $\mu$ is

593: given by

594: \begin{equation}

595: \rho'_{\mu} = \frac{1}{p_{\mu}} F_{\mu} \rho

596: F^{\dagger}_{\mu}

597: \end{equation} where $\rho$ is the density matrix before the

598: measurement and $p_{\mu}$ is a  numerical factor normalizing

599: the resulting density matrix to unit trace.   Consider the

600: family of positive operators $E_{\mu} = F^{\dagger}_{\mu}

601: F_{\mu}

602: $.  For a generalized measurement these have to satisfy the

603: condition

604: $\sum_{\mu} E_{\mu} = I$.  The probability $p_{\mu}$ for

605: obtaining a  measurement result labelled by $\mu$ is then

606: given by: $p_{\mu} =  Tr E_{\mu} \rho$.  This justifies the

607: name POVM.

608:

609: Even more general measurement processes could be considered

610: if the observer  discards part of the quantum system during

611: the process of measurement.   The appropriate mathematical

612: formalism for describing these generalized  measurements is

613: that of {\it intervention operators} \cite{Peres00}.   In

614: the process of measurement, the density matrix changes

615: according to:

616: \begin{equation}\label{inap}

617: \rho'_{\mu} \ = \

618: \frac{1}{p_{\mu}} \sum_m A_{\mu m} \ \rho \ A_{\mu

619: m}^{\dagger}

620: \end{equation} The families of maps $A_{\mu m}$ now act in

621: general from one Hilbert space to another, i.e for fixed

622: $\mu$ and $m$ they correspond to rectangular matrices.

623:

624: The label $\mu$ again distinguishes the set of possible

625: outcomes and the letter $m$ labels the degrees of freedom

626: discarded during this generalized measurement.  Since the maps

627: $A_{\mu m}$ come from measurements realized by unitary

628: operator on some larger Hilbert space they again satisfy a

629: completeness condition: $\sum_{\mu m} A^{\dagger}_{\mu m}

630: A_{\mu m} = I$, where $I$ is the identity operator in the

631: appropriate Hilbert space. Notice that if the labels $\mu$

632: and $m$ are absent in (\ref{inap}) the equation describes

633: unitary evolution.  Since the events we consider are

634: generalized measurements or unitary evolutions, equation

635: (\ref{inap}) is the appropriate mathematical representation

636: of those processes in full generality. Such maps

637: (\ref{inap}) on density matrices will be called {\it

638: intervention operators}.

639:

640: \subsection{The dynamical prescription}\label{dyna}

641:

642: We are now ready to start discussing the dynamics of a

643: quantum system represented by a dag $G$.  Dynamics will be

644: described by supposing that we are given a density matrix on

645: the initial spacelike slice, and then giving a prescription

646: for calculating the density matrices of future spacelike

647: slices. In essence, we are propagating the initial data

648: throughout the system.

649:

650: To each vertex $i\in G$ will be assigned an operator $T_i$,

651: and to each edge $e_j$ will be assigned a Hilbert space

652: $\hi_j$.  We note that all incoming (or outgoing) edges of a

653: given vertex are pairwise acausal and thus form a spacelike

654: slice.  Thus there will be a density matrix

655: $\rho_i^{in}$ associated to the slice of the incoming

656: edges.  Then one obtains the density matrix for the slice of

657: the outgoing edges by:

658: \[ \rho_i^{in}= T_i (\rho_i^{out}).\]

659:

660: Notice that more generally, for two acausal vertices, the sets of

661: incoming or outgoing edges are pairwise acausal.  Thus, the

662: associated intervention operators will act on different

663: Hilbert spaces and hence commute.

664:

665: We begin with an illustrative example.  Consider  the dag of

666: Figure~\ref{fig3}.

667: \begin{figure}

668: \begin{center}

669: \epsfig{file=figure3.eps}

670: \end{center}

671: \caption{}

672: \label{fig3}

673: \end{figure} Given the state on the initial slice, the

674: operators at the events propagate the state to the future.

675: In the example of Figure~\ref{fig3} we have:

676: $\rho_c = T_1 (\rho_a)$,\ $\rho_{fde} = T_2 (\rho_b)$.

677: However the next intervention operator $T_3$ must act on the

678: so far undefined density matrix

679: $\rho_{cd}$.  $T_3$ takes density matrices on

680: $\hi_c\ox\hi_d$ to those on

681: $\hi_g\ox\hi_h$.  By extending

682: $T_3$ with the appropriate identity operators, we

683: can view it as a map from

684: $\mathsf{DM}(\hi_c\ox\hi_d\ox\hi_e\ox\hi_f)$ to

685: $\mathsf{DM}(\hi_e\ox\hi_f\ox\hi_g\ox\hi_h)$.  Then we can

686: define the density matrix on another space-like slice,

687: namely $\rho_{fghe} = T_3 (\rho_c \otimes \rho_{fde})$.

688: Similarly $\rho_{fdi} = T_4 (\rho_{fde})$ and so on.

689: Starting from density matrices on the initial edges and

690: using the intervention operators associated with the

691: vertices - extended with identities as needed - we obtain

692: density matrices on specific space-like slices.

693:

694: The above inductive process for propagating density matrices

695: can be applied to any system described by a dag.  However,

696: the procedure only gives the density matrices for certain

697: spacelike slices within the dag.  For example, this

698: procedure does not yet yield a matrix for the slice $de$.

699: To calculate such density matrices, we will also have to

700: make use of the trace operator.  Before extending the

701: procedure to such slices, we first consider those for which

702: the above process is sufficient.  We call these slices

703: \textit{locative}.

704: \begin{defin}{\rm Let $G$ be a dag, and $L$ a slice of $G$.

705: Consider the set of all vertices $V$ which are to the past

706: of some edge in $L$.  Let $I$ be the set of initial edges in

707: the past of $L$.  Consider all paths of maximal length

708: beginning at an element of $I$ and only going through

709: vertices of $V$.  Then $L$ is \emph{locative} if all such

710: paths end with an edge in $L$.}

711: \end{defin} In our example, the locative slices are the

712: following:

713: $$a, b, ab, c, cb, def, adef, cdef, efgh, adfi, cdfi, fghe,

714: fghi, fgk, hej, hij, jk$$ while, for example, $de$ is not

715: locative.  Note that the fact that maximal slices are always

716: locative follows immediately from the definition of locative.

717:

718: We now describe the general rule for calculating the density

719: matrices on locative slices.  Associated with each locative

720: slice $L$ is the set $I$ of initial edges in the past of

721: $L$.  We choose a family of slices that begins with $I$ and

722: ends with $L$ in the following way.  Consider the set of

723: vertices $V$ between the edges in $I$ and the edges in $L$.

724: Because $L$ is locative we know that propagating slices

725: forwards through the vertices in

726: $V$ will reproduce $L$.  Let $M\subset V$ be such that the

727: vertices in $M$ are minimal in $V$ with respect to causal

728: ordering.  We choose arbitrarily any vertex $u$ in $M$,

729: remove the incoming edges of $u$ and add the outgoing edges

730: of $u$ to the set $I$ obtaining a new set of edges $I_1$. It

731: is clear that $I_1$ is spacelike and locative.  Proceeding

732: inductively in this fashion we obtain a sequence of slices

733: $I=I_0,I_1,I_2,\ldots,I_n = L$, where $n$ is the cardinality

734: of $V$.  Of course, this family of slices is far from unique.

735:

736: The dynamics is obtained as follows.  Recall that the states

737: on initial edges are assumed not to be entangled with each

738: other so that one can obtain the density matrix on any set

739: of initial edges, in particular $I$, by a tensor product.

740: Let $\rho_0$ be the density matrix on $I$.  We look at the

741: vertex $u$ that was used to go from $I$ to $I_1$ and apply

742: the intervention operator $T$ assigned to this vertex -

743: possibly augmented with identity operators as in the example

744: above.  Proceeding inductively along the family of slices,

745: we obtain the density matrix $\rho_n$ on $L$.

746:

747: The important point now is that $\rho_n$ does not depend on

748: the choice of slicing used in going from $I$ to $L$.  This

749: can be argued as follows. Suppose we have a locative slice

750: $S$ and two vertices $u$ and $v$ which are both causally

751: minimal above $S$ and acausal with respect to each other.

752: Then we have four slices to consider, $S$, $S_u$, $S_v$ and

753: $S_{uv}$ where by $S_u$ we mean the slice obtained from $S$

754: by removing the incoming edges of $u$ and adding the

755: outgoing edges of $u$ to $S$ and similarly for the others.

756: It is clear, in this case, that the intervention operators

757: assigned to $u$ and to $v$ commute and the density matrix

758: computed on

759: $S_{uv}$ is independent of whether we evolved along the

760: sequence

761: $S\to S_u\to S_{uv}$ or $S\to S_v\to S_{uv}$.  Now when we

762: constructed our slices at each stage we had the choice

763: between different minimal vertices to add to the slice.  But

764: such vertices are clearly pairwise acausal and hence, by the

765: previous argument applied inductively, the evolution

766: prescription is independent of all possible choices.

767:

768: So far we have defined density matrices on locative slices

769: only.  To define density matrices on general spacelike

770: slices we will need to consider partial tracing operations.

771:

772: \subsection{General Slices}

773:

774: Recall that when one has subsystems $Q_1$ and $Q_2$ of a

775: quantum system

776: $Q$, the Hilbert space for $Q$ may be decomposed as

777: $\hi_1\ox\hi_2$ where

778: $\hi_i$ represents $Q_i$.  The density matrix for $Q_1$ is

779: obtained by tracing over $\hi_2$.  To obtain a candidate for

780: the density matrix of a spacelike slice $L$, we should find

781: a locative slice $M$ that contains $L$ and trace over the

782: Hilbert spaces on edges in $M\setminus L$.  Such a locative

783: slice $M$ always exists because maximal spacelike slices are

784: always locative.  $M$ is not unique however, and thus - as

785: we did for locative slices - we must show that different

786: choices give the same result.  To simplify the notation we

787: will discuss the case of density matrices associated with

788: single edges.  The case of a general space-like slice is

789: similar.

790:

791: Consider an edge $e_i$ in a graph $G$.  Let $V_i =

792: \{v_{i_1}, \dots, v_{i_p}\}$ be the set of vertices in the

793: past of $e_i$.  Let $I_i =

794: \{e_{i_1}, \dots, e_{i_q}\}$ be the set of initial edges in

795: the past of

796: $e_i$.  Constructing a sequence of slices by incrementally

797: incorporating the vertices of $V_i$ in a manner similar to

798: what we did in the previous subsection, we get a locative

799: slice $M_i$ containing $e_i$.  Starting with the density

800: matrices on the edges of $I_i$ and applying the operators

801: associated with the vertices of $V_i$, we obtain the density

802: matrix on the locative slice $M_i$.  It is clear that $M_i$

803: is in an evident sense the minimal locative slice containing $e_i$.

804:

805: \begin{defin}{\rm We shall refer to $M_i$ as the \emph{least

806: locative slice} of the edge $e_i$.}

807: \end{defin}

808:

809: Let the least locative slice $M_i$ of an edge $e_i$ consist

810: of edges $\{e_i, e_{j_1}, \dots, e_{j_r}\}$.  The density

811: matrix $\rho_{i,j_1,

812: \dots, j_r}$ on $M_i$ is an element of the space $\ind(\hi_i

813: \otimes

814: \hi_{j_1} \otimes \dots \otimes \hi_{j _r})$.  Let $Tr^{j_1

815: \dots j_r}$ be the partial trace operation $\ind(\hi_i

816: \otimes \hi_{j_1} \otimes \dots

817: \otimes \hi_{j_r}) \rightarrow \ind(\hi_i)$.

818: \begin{defin}[Density matrix associated with an

819: edge]\label{rho}\emph{The density matrix $\rho_i$ at the

820: edge $e_i$ is defined to be:

821: \begin{equation}

822: \rho_i \ = \ Tr^{j_1 \dots j_r} \ \rho_{i,j_1, \dots, j_r} .

823: \end{equation}}

824: \end{defin} If $M_i$ consists of the single edge $e_i$, then

825: no tracing is  done.

826:

827: \begin{rem}

828: The causality condition for evolving the initial data on $G$

829: requires that the density matrix associated with a given

830: edge $e_i$ depends only on the initial data in the past of

831: $e_i$ and only those interventions to the past of $e_i$.

832: The density matrix $\rho_i$ as defined in~\ref{rho}

833: satisfies this requirement by construction and so our

834: prescription for dynamical evolution is causal.

835: \end{rem}

836:

837: In general, the edge $e_i$ is contained in many locative

838: slices and we could just as well have defined $\rho_i$ by

839: tracing over the complimentary degrees of freedom in any of

840: these locative slices.  Independence of the resulting

841: density matrices  is the discrete analog of Lorenz (or

842: general) covariance in our framework.   To clarify the

843: discussion consider the quantum system represented by the

844: graph on Figure~\ref{figureF}.

845:

846: \begin{figure}[htb]

847: \begin{center}

848: \input{figure4}

849: \end{center}

850: \caption{}

851: \label{figureF}

852: \end{figure}

853:

854: Let the initial $\rho_a$ be the density matrix of a

855: maximally entangled state of two spin $1/2$ subsystems:

856: $\rho_a = |\psi_a\rangle\langle\psi_a|$, where $\psi_a =

857: 1/\sqrt{2} \ (\psau \otimes \psbu + \psad \otimes \psbd)$.

858: At the first vertex the two subsystems separate with no

859: classical intervention.  Therefore $\rho_{bc} = \rho_a$.

860: The slice

861: $\{e_b, e_c\}$ is the least locative slice for the edge $e_b$

862: and we can compute the density matrix associated to this

863: edge: $\rho_b = T r^{c} \rho_{bc} = 1/2 \ (\psauk \psaub +

864: \psadk \psadb)$.  Next, let the intervention at the second

865: vertex be a measurement on the corresponding subsystem with

866: the result that the spin was found to be in the state

867: $\psbu$.  The intervention operator is the projection

868: operator on this state of the second subsystem: $T (\rho) =

869: 2\ P_2^{\uparrow} \rho P_2^{\uparrow}$.  We obtain:

870: $\rho_{bd} = T (\rho_{bc}) = (\psauk \otimes \ \psbuk)

871: (\psaub \ \otimes

872: \psbub)$.  If now we attempt to trace $\rho_{bd}$ over the

873: subsystem associated with the edge $e_d$, we will obtain an

874: incorrect result for $\rho_b$, namely $\psauk \psaub$.   The

875: resolution is well known.  Since a classical observer

876: located on the edge $e_b$ is not aware of the result of the

877: intervention at the second vertex, for him the density

878: matrix $\rho_{bd}$ has evolved from $\rho_{bc}$ by an

879: operator $\tilde T$ which includes all possible outcomes of

880: the measurement:

881: $\tilde\rho_{bd} = {\tilde T} (\rho_{bc}) =

882: \sum_{s=\uparrow,\downarrow} P_2^s \rho_{bc} P_2^s$.

883: Tracing out the

884: $d$-subsystem in the expression for $\tilde\rho_{bd}$, we

885: obtain the correct expression for $\rho_b$, namely

886: $\rho_b = 1/2 \ (\psauk \psaub + \psadk \psadb)$.

887:

888: Now we give the general prescription for computing the

889: density matrix on  an edge $e_i$ from an arbitrary locative

890: slice $L$ containing this edge.   We first compute a density

891: matrix $\tilde\rho_L$ for the slice $L$. But note this is

892: not the density matrix of definition~\ref{rho}.

893:

894: This density matrix is computed from the initial data by

895: applying intervention operators for the events in the past

896: of $L$ as before.  But now, we will consider two types of

897: events in the past of $L$, those that are in the past of

898: $e_i$ and those that are not.  For the events that are in

899: the past of the edge $e_i$, we use our regular intervention

900: operators without a summation over the set of possible

901: outcomes: $\rho \mapsto 1/p_{\mu} \sum_m A_{\mu m} \rho

902: A_{\mu m}^\dagger$.  We do not sum over the outcomes in this

903: case precisely because the outcome is in fact known at

904: $e_i$.  For the events that are in the past of the slice $L$

905: but not in the past of the edge $e_i$, we use operators

906: which sum over all possible outcomes: $\rho \mapsto

907: \sum_{\mu m} A_{\mu m} \rho A_{\mu m}^\dagger$. This time,

908: of course, the summation is there because the outcome cannot

909: be known at $e_i$ since these events are not in the past of

910: $e_i$.

911:

912: After we have obtained $\tilde\rho_L$, we trace out those

913: subsystems associated with edges in $L$ except for $e_i$ to

914: obtain the density matrix

915: $\tilde\rho_i$.  This is the density matrix associated with

916: our preferred edge $e_i$, as computed from the slice $L$.

917: The independence of the result on the choice of $L$ is

918: expressed in the following proposition:

919:

920: \begin{prop}[Covariance] Let $e_i$ be an edge in the dag

921: $G$.  The density matrix

922: $\rho_i$ associated  with the edge $e_i$ does not depend on

923: the choice of locative slice used to compute it.

924: \end{prop}

925:

926: \begin{proof} \\ We have already demonstrated that to any

927: edge $e_i$,  there is a unique least locative slice $M_i$

928: containing $e_i$.  Let

929: $\rho_i$ be the density matrix for the edge $e_i$ as

930: computed from the least locative slice and let

931: $\tilde\rho_i$ be the density matrix for the same edge but

932: computed from an arbitrary locative slice, say $L$,

933: containing $e_i$.  We will demonstrate the lemma by showing

934: that

935: $\rho_i=\tilde\rho_i$.

936:

937: First note that $M_i$ being less than $L$ implies that there

938: is a set $V$ of events between $M_i$ and $L$.  The plan is

939: to remove the effect of these events and show that, at each

940: stage, the density matrix is unaffected.  We begin by

941: picking a maximal event, say $k$, with the intervention

942: operator

943: $T_k$.  Since $k$ is maximal and hence acausal with all

944: other maximal elements of $V$, as well as with all the

945: maximal elements to the past of

946: $e_i$, the intervention operator at $k$ commutes with all

947: the intervention operators at the vertices just mentioned.

948: Thus, we can choose the intervention operator $T_k$ to be

949: the outermost, i.e.\ the density matrix

950: $\rho_L$ obtained by propagating to $L$ can be written as

951: \[ \rho_L = T_k(\rho') \] where $\rho'$ is the density

952: matrix on the (locative) slice obtained by removing the

953: edges to the future of $k$ from $L$ and adding the edges to

954: the past of $k$.  Using the explicit general form for an

955: intervention operator,

956: \[ \rho_L =

957: \sum_{\mu,m}A^{(k)}_{\mu,m}\rho'A^{\dagger(k)}_{\mu,m}.\] In

958: order to obtain the density matrix $\tilde\rho_i$, we trace

959: over all Hilbert spaces associated with edges in $L$ except

960: $e_i$.  In particular, we trace over the outgoing edges

961: associated with $k$.  Now we can use the cyclic property of

962: trace and rewrite the expression for $\tilde\rho_i$ as,

963: \[ \tilde\rho_i =

964: Tr(\sum_{\mu,m}A^{\dagger(k)}_{\mu,m}A^{(k)}_{\mu,m}\rho').

965: \] Now we use the identity

966: \[ \sum_{\mu m} A_{\mu m}^\dagger A_{\mu m}= I \] to get

967: \[ \tilde\rho_i = Tr(\rho').\]

968:

969: We have eliminated the effect of the intervention operator at

970: $k$. Proceeding inductively we can peel off the intervention

971: operators associated with the rest of the vertices in $V$,

972: thus

973: \[ \tilde\rho_i = \rho_i.\]

974: \end{proof}

975:

976: A similar argument for the case of a simple system

977: represented by the dag in Figure~\ref{figureD} is contained

978: in~\cite{Peres00b}.

979:

980: \section{Polycategories}\label{poly}

981:

982: We now wish to give a more axiomatic treatment of the above

983: construction. This will require the use of several concepts from

984: category theory and logic, which we now present.

985:

986: We begin by introducing the algebraic or categorical concepts

987: necessary for our formulation of the dynamics of quantum

988: information flow.  While it might seem that these structures

989: are excessively abstract, this level of abstraction has

990: several advantages.  First, it provides a great deal of

991: generality.  Our definition can be applied in many contexts,

992: in particular it may be applied in situations other than the

993: sorts of information flow considered here.  Second, the two

994: crucial properties of interest, causality and covariance,

995: now become straightforward consequences of the functoriality

996: of our axioms.

997:

998: \subsection{Posets, directed graphs and categories}

999:

1000: For comparison, we recall briefly that a poset is a set $P$

1001: together with a binary relation on $P$ (i.e.  a subset of

1002: $P\times P$) denoted $\leq$ that satisfies the properties of

1003: antisymmetry, transitivity and reflexivity.  It is a natural

1004: generalization of this idea to consider \emph{directed

1005: graphs}.  A directed graph is simply a set $D$, the set of

1006: \emph{vertices} or \emph{nodes}, together with a binary

1007: relation $R$ on $D$.   No properties of $R$ are required in

1008: the definition of directed graph.  In particular there is no

1009: implicit transitivity assumed.  A directed graph has a

1010: natural geometric visualization.  One considers the nodes as

1011: points in the plane, and if $x$ and $y$ are nodes with

1012: $\langle x,y\rangle\in R$,  we draw an arrow from $x$ to $y$.

1013:

1014: As already remarked, the nodes of our directed graph will be

1015: events, and arrows will represent propagation from one event

1016: to another.  To avoid temporal loops, we will add the single

1017: requirement that our directed graphs be \emph{acyclic},

1018: i.e.  there does not exist a sequence of edges

1019: $x_1,x_2,\ldots, x_n$ such that for all

1020: $i\in\{1,2,\ldots,n-1\}$, we have

1021: $\langle x_i,x_{i+1}\rangle\in R$,  and $x_1=x_n$.  This of

1022: course corresponds to there being no directed cycles in the

1023: geometric representation.  Hereafter, a directed acyclic

1024: graph will be called a \emph{dag}.  Note that every poset,

1025: considered as a directed graph, is acyclic.  This is a

1026: consequence of transitivity and antisymmetry.  But dags are

1027: a genuine generalization of posets.

1028:

1029: This difference will become more apparent when we consider

1030: the space of

1031: \emph{paths}.  In a poset all the paths are already included

1032: (even if they are not explicitly drawn in the visualization

1033: of the poset).  When we consider paths through a dag we may

1034: have multiple paths between the same two vertices.  These

1035: multiple paths represent different ways that information

1036: flowed from one point to another, thus, we must regard them

1037: as distinct.  Therefore - unlike the case with posets - we

1038: do not just want to regard the resulting structure as a

1039: binary relation, rather, we want to view it as a category.

1040:

1041: It is natural to associate to any dag $D$, indeed to any

1042: directed graph, a category.  We first briefly remind the

1043: reader of the basic definitions. See \cite{Maclane98} for a

1044: more extensive introduction.

1045:

1046: \begin{defin}

1047: \rm{ A \textbf{category} \textsf{C} consists of two

1048: collections, the collection of {\it objects} and the

1049: collection of {\it morphisms}. Each morphism is assigned a

1050: domain and codomain, both being objects of

1051: \textsf{C}.  Typically we write $f\colon A\rarr B$ to mean

1052: $f$ is a morphism with domain $A$ and codomain $B$.  To

1053: every object $A$, we have a special morphism, the identity

1054: $id\colon A\rarr A$.  There is also a composition law which

1055: takes morphisms $f\colon A\rarr B$ and $g\colon B\rarr C$

1056: and returns a morphism $gf\colon A\rarr C$.  All this data

1057: must satisfy several evident equations, as described for

1058: example in

1059: \cite{Maclane98}. We also remind the reader that a

1060: \textbf{functor} is a morphism of categories, i.e.  a

1061: functor, denoted $F\colon C\rarr D$ consists of a function

1062: taking objects $c\in C$ to objects $F(c)\in D$ and taking

1063: morphisms $f\colon c\rarr d$ in $C$ to morphisms $F(f)\colon

1064: F(c)\rarr F(d)$.  A functor must preserve identities and

1065: composition. } \end{defin}

1066:

1067: To each dag $D$, we associate a category $\mathsf{C}(D)$.

1068: This is the category \emph{freely generated} by the dag.

1069: See for example \cite{Maclane98} Chapter 2, for a detailed

1070: description.  The objects of our free category will be the

1071: vertices of $D$.  If $x$ and $y$ are vertices, a morphism

1072: from $x$ to $y$ is a directed path in our dag.  Identities are

1073: paths of length 0, and composition is given by concatenation

1074: of paths.  The verification of the axioms for a category is

1075: straightforward.

1076:

1077: One of the key points of our work is that we are proposing

1078: passing from posets to categories.  As we have remarked

1079: before, categories are more general than posets, indeed

1080: posets correspond to a degenerate class of categories in

1081: which there is at most one morphism between any two

1082: objects.  The richer structure of categories allows us to

1083: retain more information about the system.  Intuitively, the

1084: use of categories allows us not merely to note that $x$

1085: causally precedes

1086: $y$, but to keep track of the different ways that $x$ may

1087: evolve into $y$.   To make this more precise, we need

1088: a slightly different construction on dags, which will

1089: yield polycategories as opposed to categories.

1090:

1091: \subsection{Polycategories}

1092:

1093: Roughly speaking, the distinction between categories and

1094: polycategories is the following: A category allows one to

1095: have morphisms which go from single objects to single

1096: objects.  A polycategory allows one to have morphisms from

1097: lists of objects to lists of objects.  A typical morphism in

1098: a polycategory (hereafter called a polymorphism) would be

1099: denoted:

1100:

1101: \[ f\colon A_1,A_2,\ldots,A_n\lrarr B_1,B_2,\ldots,B_m\]

1102:

1103: There are a number of contexts in which such a generalization would be

1104: useful.  Before giving the formal definition, we discuss two such contexts.

1105: The first arises in algebra.  Consider Hilbert spaces, vector spaces or any

1106: class of modules in which one can form a tensor product.  Then we can

1107: define a polycategory as follows.  Our objects will be such spaces, and a

1108: morphism of the above form will be a linear function:

1109:

1110: \[ f\colon A_1\ox A_2\ox\ldots\ox A_n\lrarr B_1\ox

1111: B_2\ox\ldots\ox B_m\]

1112:

1113: Thus polycategories have proven to be quite useful in the analysis of

1114: (ordinary) categories in which one can form tensor products of objects.

1115: Indeed this was the original motivation for their definition.  See

1116: \cite{Lambek69,Szabo75}.  Categories in which one has a reasonable notion

1117: of tensor product are called \emph{monoidal}, and have recently figured

1118: prominently in several areas of mathematical physics, most notably

1119: topological quantum field theory \cite{Atiyah90,Baez95}.

1120:

1121: The second well-known application of polycategories is to

1122: logic.  Typically logicians are interested in the analysis of

1123: \emph{sequents}, written:

1124: \[ A_1,A_2,\ldots,A_n\vdash B_1,B_2,\ldots,B_m\]

1125: \noindent Now

1126: $A_1,A_2,\ldots,A_n,B_1,B_2,\ldots,B_m$  represent formulas

1127: in some logical system.  We say that the above sequent holds

1128: if and only if the conjunction of $A_1,A_2,\ldots,A_n$

1129: logically entails the disjunction of $B_1,B_2,\ldots,B_m$.

1130: There is a well-established correspondence between the sort

1131: of logical entailments considered here and categorical

1132: structures.  See for example

1133: \cite{Lambek86}.

1134:

1135: But notice the difference between this and our first

1136: example.  When talking about vector spaces, the ``commas''

1137: on the left and right were both interpreted as the tensor

1138: product.  However in the logic example, we have two

1139: different interpretations.  Commas on the left are treated

1140: as conjunction, while commas on the right are treated as

1141: disjunction.  Thus for a proper categorical interpretation

1142: of polycategories, one needs categories with two monoidal

1143: structures which interact in an appropriate fashion.  Such

1144: categories are called \emph{linearly} or \emph{weakly

1145: distributive}, a notion due to Cockett and Seely

1146: \cite{Cockett97,Blute96}. Linearly distributive categories

1147: are the appropriate framework for considering a specific

1148: logical system known as \emph{linear logic}, introduced by

1149: Girard \cite{Girard87,Girard89}.  For a brief exposition of

1150: linear logic, see the appendix. As we will see, the refined

1151: logical connectives of linear logic will be used to express

1152: the entanglements of our system.

1153:

1154: There is a very geometric or graphical calculus for

1155: representing morphisms in polycategories, which was

1156: introduced by Joyal and Street in

1157: \cite{Joyal91}, and given a logical interpretation in

1158: \cite{Blute96}.  A polymorphism of the form:

1159: \[ f\colon A_1,A_2,\ldots,A_n\lrarr B_1,B_2,\ldots,B_m\]

1160: \noindent is represented as follows:

1161:

1162: \setlength{\unitlength}{.6in}

1163:

1164: \begin{picture}(6,3)(-2,0)

1165: \put(1.2,1){\framebox(3,.6){$f$}}

1166: \put(1.5,1){\vector(0,-1){.6}}

1167: \put(1.9,1){\vector(0,-1){.6}}

1168: \put(2.7,1){\vector(0,-1){.6}}

1169: \put(3,.7){\ldots}

1170: \put(2.2,.7){\ldots}

1171: \put(3.5,1){\vector(0,-1){.6}}

1172: \put(3.9,1){\vector(0,-1){.6}}

1173: \put(1.5,2.2){\vector(0,-1){.6}}

1174: \put(1.9,2.2){\vector(0,-1){.6}}

1175: \put(2.7,2.2){\vector(0,-1){.6}}

1176: \put(3,1.9){\ldots}

1177: \put(2.2,1.9){\ldots}

1178: \put(3.5,2.2){\vector(0,-1){.6}}

1179: \put(3.9,2.2){\vector(0,-1){.6}}

1180: \put(1.15,2.1){$A_1$}

1181: \put(1.55,2.1){$A_2$}

1182: \put(2.9,2.1){$A_{n-1}$}

1183: \put(4.0,2.1){$A_n$}

1184: \put(1.15,.5){$B_1$}

1185: \put(1.55,.5){$B_2$}

1186: \put(2.85,.5){$B_{m-1}$}

1187: \put(4.0,.5){$B_m$}

1188: \end{picture}

1189:

1190: Thus the polymorphism is represented as a box, with the

1191: incoming and outgoing arrows labelled by objects.

1192: Composition in polycategories then can be represented

1193: pictorially in a very natural fashion.  Before giving a

1194: general discussion of composition in a polycategory, we

1195: illustrate this graphical representation.  Suppose we are

1196: given two polymorphisms of the following form:

1197:

1198: \begin{center}

1199: $f\colon A_1,A_2,\ldots,A_n\lrarr B_1,B_2,\ldots,B_m,C$\\

1200: $g\colon C,D_1,D_2,\ldots,D_k\lrarr E_1,E_2,\ldots,E_j$

1201: \end{center}

1202:

1203: Note the single object $C$ common to the codomain of $f$ and

1204: the domain of $g$.  Then under the definition of

1205: polycategory,  we can compose these to get a morphism of

1206: form:

1207:

1208: \[g\circ_Cf\colon A_1,A_2,\ldots,A_n,D_1,D_2,\ldots,D_k\lrarr B_1,B_2,

1209: \ldots,B_m,E_1,E_2,\ldots,E_j\]

1210:

1211: The object $C$ which ``disappears'' after composition is called the \emph{cut

1212: object}, a terminology derived from logic.  Note that we subscript the

1213: composition by the object being cut.  This composition would be represented

1214: by the diagram on Figure~\ref{FigureComp}:

1215:

1216: \begin{figure}[htb]

1217: \begin{picture}(6,4)(-1,-1.5)

1218: \put(1.2,1){\framebox(3,.6){$f$}}

1219: \put(1.5,1){\vector(0,-1){2.12}}

1220: \put(1.9,1){\vector(0,-1){2.12}}

1221: \put(2.7,1){\vector(0,-1){2.12}}

1222: \put(3,.7){\ldots}

1223: \put(2.2,.7){\ldots}

1224: \put(3.5,1){\vector(0,-1){2.12}}

1225: \put(3.9,1){\line(0,-1){.6}}

1226: \put(1.5,2.2){\vector(0,-1){.6}}

1227: \put(1.9,2.2){\vector(0,-1){.6}}

1228: \put(2.7,2.2){\vector(0,-1){.6}}

1229: \put(3,1.9){\ldots}

1230: \put(2.2,1.9){\ldots}

1231: \put(3.5,2.2){\vector(0,-1){.6}}

1232: \put(3.9,2.2){\vector(0,-1){.6}}

1233:

1234: \put(3.6,-0.5){\framebox(3,.6){$g$}}

1235: \put(3.9,-.5){\vector(0,-1){.6}}

1236: \put(4.3,-.5){\vector(0,-1){.6}}

1237: \put(5.1,-.5){\vector(0,-1){.6}}

1238: \put(5.4,-.8){\ldots}

1239: \put(4.6,-.8){\ldots}

1240: \put(5.9,-.5){\vector(0,-1){.6}}

1241: \put(6.3,-.5){\vector(0,-1){.6}}

1242: \put(3.9,.7){\line(0,-1){.6}}

1243: \put(3.65,.5){$C$}

1244: \put(4.3,2.23){\vector(0,-1){2.13}}

1245: \put(5.1,2.23){\vector(0,-1){2.13}}

1246: \put(5.4,.4){\ldots}

1247: \put(4.6,.4){\ldots}

1248: \put(5.9,2.23){\vector(0,-1){2.13}}

1249: \put(6.3,2.23){\vector(0,-1){2.13}}

1250: \end{picture}

1251: \caption{}

1252: \label{FigureComp}

1253: \end{figure}

1254:

1255: We only label the segment corresponding to the cut object, for ease of

1256: reading.  Thus composition in a polycategory is represented by the

1257: concatenation of the graphs of $f$ and $g$, followed by joining the

1258: incoming and outgoing edges corresponding to the cut object.  There are

1259: several other possibilities for applications of the composition rule.  In

1260: some cases, the graphical representation requires our arrows to cross.

1261: This corresponds to having a \emph{symmetric} polycategory.  This is very

1262: much related to having a symmetric tensor or tensors, i.e.  ones with the

1263: property that $A\ox B\cong B\ox A$.  We will always assume our

1264: polycategories are symmetric.

1265:

1266: We now give a more formal definition of polycategory.  We

1267: refer the reader to \cite{Cockett97,Szabo75} for further details.

1268:

1269: \begin{defin}

1270: \rm{

1271: A \textbf{polycategory} $\mathsf{C}$ consists of

1272: the following data:

1273:

1274: \begin{itemize}

1275: \item A set of objects, denoted $|\mathsf{C}|$.

1276: \item If $A_1,A_2,\ldots,A_n$ and  $B_1,B_2,\ldots,B_m$ are

1277: finite  sequences of  objects, then we have a set of

1278: morphisms of the form

1279: $f\colon A_1,A_2,\ldots,A_n\lrarr B_1,B_2,\ldots,B_m$.  We note that

1280: technically one must consider these sequences of objects as being defined

1281: only up to permutation.

1282: \item For every object $A$, we have an identity

1283: morphism $id_A\colon A\rarr A$.

1284: \end{itemize}

1285:

1286: \noindent The composition law was already described pictorially.  The data

1287: of course are subject to a number of axioms, of which most important for us

1288: is the one which requires associativity of composition.  The

1289: notion of \emph{polyfunctor} between polycategories is also straightforward

1290: to formulate.  One first has a function $F$ taking objects to objects, and

1291: then given a morphism $f\colon A_1,A_2,\ldots,A_n \lrarr B_1,B_2, \ldots,

1292: B_m$, one assigns to it a morphism

1293: \begin{equation}

1294: F(f)\colon F(A_1),F(A_2),\ldots, F(A_n) \lrarr F(B_1),F(B_2),\ldots,F(B_m).

1295: \end{equation}

1296: Again, a number of axioms must be satisfied, in particular the polyfunctor

1297: must commute with the composition of polymorphisms.

1298: }

1299: \end{defin}

1300:

1301: As suggested by the above, there is a relationship between polycategories

1302: and monoidal categories.  It is summarized in the following lemma, which

1303: can be found for example in \cite{Cockett97}:

1304:

1305: \begin{prop}\label{thelemma}

1306: Let $\mathsf{C}$ be a monoidal category.  Then one can associate to

1307: $\mathsf{C}$ a polycategory (which will typically be denoted by

1308: $P(\mathsf{C})$ as follows:

1309: \begin{itemize}

1310: \item The objects of $P(\mathsf{C})$ will be the same as those of

1311: $\mathsf{C}$.

1312: \item A polymorphism of the form $f\colon A_1,A_2,\ldots,A_n\lrarr B_1,B_2,

1313: \ldots,B_m$ is a morphism $f\colon A_1\ox A_2\ox\ldots\ox A_n\lrarr B_1\ox

1314: B_2\ox\ldots\ox B_m$.

1315: \item Composition is induced by the composition in $\mathsf{C}$ in the

1316: following way.  Suppose that we have two polymorphisms in $P(\mathsf{C})$

1317: as follows:

1318:

1319: \begin{center}

1320: $f\colon A_1,A_2,\ldots,A_n\lrarr B_1,B_2,\ldots,B_m,C$\\

1321: $g\colon C,D_1,D_2,\ldots,D_k\lrarr E_1,E_2,\ldots,E_j$

1322: \end{center}

1323:

1324: \noindent Then since we are in a monoidal category, we have morphisms

1325: \begin{center}

1326: $f\colon A_1\ox A_2\ox \ldots\ox A_n\lrarr B_1\ox B_2\ox\ldots\ox B_m\ox C$\\

1327: $g\colon C\ox D_1\ox D_2\ox \ldots\ox D_k\lrarr E_1\ox E_2\ox \ldots\ox E_j$

1328: \end{center}

1329:

1330: The composite in  $P(\mathsf{C})$ is then given by:

1331: \begin{equation}\label{identi}

1332: g\circ_C f=(id_{B_1\ox B_2\ox\ldots\ox B_m}\ox g)\circ(f\ox id_{D_1\ox

1333: D_2\ldots\ox D_k})

1334: \end{equation}

1335: \end{itemize}

1336: \end{prop}

1337:

1338: We note that the concepts of polycategory and monoidal category are not

1339: equivalent.  To obtain an equivalence, one needs to replace monoidal

1340: categories with the more general notion of linearly distributive category

1341: mentioned above.

1342:

1343: Now we will demonstrate that a dag generates a polycategory.  In this

1344: construction, the nodes of the dag will be assigned morphisms and the edges

1345: will be assigned objects.

1346:

1347: We consider the dag example of Figure~\ref{figure6}.  We have changed

1348: labels to be more appropriate for the present

1349: discussion.

1350:

1351:

1352: \begin{figure}[htb]

1353: \input{figure6}

1354: \caption{}

1355: \label{figure6}

1356: \end{figure}

1357:

1358: The idea behind the construction is that the nodes of the

1359: dag (the  boxes in our picture)  will correspond to

1360: polymorphisms.  For example, in the above picture,  the box

1361: $f_1$ determines a polymorphism:

1362:

1363: \[f_1\colon A\lrarr C,D\]

1364:

1365: Similarly, $f_4$ determines a polymorphism $f_4\colon

1366: D,E\rarr G$.  Thus we see that one has a polymorphism

1367: corresponding to each node.  The domain of that polymorphism

1368: will be the labels of the incoming arrows, and the codomain

1369: is determined by the labels of the outgoing arrows.  These

1370: are the basic morphisms of the polycategory.  As in the

1371: previous construction, one must adjoin morphisms

1372: corresponding to the allowable compositions.  For example,

1373: in the above case, we can compose the morphisms $f_4$ and

1374: $f_1$ along the cut object $D$ to obtain a new polymorphism

1375: $f_4\circ_D f_1\colon A,E \rarr C,G$.  One must also add

1376: identities and must force these composites to satisfy the

1377: appropriate equations.  This construction yields the {\it

1378: polycategory freely generated by the dag}.  More generally,

1379: we would have the following definition.

1380:

1381: \begin{defin}\label{free}{\rm We suppose that we are given a

1382: finite dag $G$. The {\em free polycategory generated by

1383: $G$}, denoted $P(G)$, is defined as follows.  If a given

1384: vertex $v$ has incoming edges

1385: $A_1,A_2,\ldots, A_n$ and outgoing edges $B_1,B_2,\ldots,B_m$

1386: then the polycategory will have a polymorphism of the form

1387: $f_v\colon A_1,A_2,\ldots,A_n\rarr B_1,B_2,\ldots,B_m$. In

1388: general by induction,  if $P(G)$ has polymorphisms of the

1389: form:

1390:

1391: \begin{center}

1392: $f\colon A_1,A_2,\ldots,A_n\lrarr B_1,B_2,\ldots,B_m,C$\\

1393: $g\colon C,D_1,D_2,\ldots,D_k\lrarr E_1,E_2,\ldots,E_j$

1394: \end{center}

1395:

1396: \noindent then we require the existence of a composite

1397: $g\circ_C f$ as a new polymorphism. We assume the existence

1398: of an identity  morphism for each edge of $G$. Finally we

1399: impose on this data the necessary equations implied by the

1400: definition of polycategory. }

1401: \end{defin}

1402:

1403: \subsection{Categories of interventions} Next we describe an

1404: appropriate for our construction polycategory of intervention

1405: operators; there are several reasonable choices, this being

1406: the most straightforward. We start with the well known fact that  the

1407: category $\mathsf{Hilb}$ of  Hilbert spaces and bounded

1408: linear operators is a monoidal category.   Hence by the

1409: construction of lemma \ref{thelemma}, we obtain a

1410: polycategory.  However this is not the category we will

1411: ultimately use. We will introduce a category

1412: $\mathsf{Conj}$. Intuitively, the objects are Hilbert space

1413: endomorphisms and morphisms are conjugations. A more formal

1414: definition is as  follows. Objects are finite-dimensional

1415: Hilbert spaces. A morphism from

1416: $\hi_1$ to $\hi_2$ is a finite family of maps $\{A_i\}_{i\in

1417: I}$ of linear morphisms $A_i\colon \hi_1\rarr\hi_2$.

1418: Composition is then described as  follows. If we have the

1419: following pair of maps:

1420:

1421: \[ \hi_1\to^{\{A_i\}_{i\in I}}\hi_2

1422: \to^{\{B_j\}_{j\in J}}\hi_3\]

1423:

1424: \noindent then the composite is:

1425:

1426: \[ \hi_1\to^{\{B_j\circ A_i\}_{\langle i,j\rangle\in I\times J}}\hi_3\]

1427:

1428: A morphism in $\mathsf{Conj}$ can be seen as taking

1429: endomorphisms of

1430: $\hi_1$ to endomorphisms of $\hi_2$ by the formula

1431: $\cO\mapsto\sum_m A_m {\cal O} A_m^{\dagger}$. The monoidal

1432: structure on $\mathsf{Hilb}$ lifts to a monoidal structure

1433: on the category $\mathsf{Conj}$.  The tensor product

1434: operator is the usual tensor product of operators on Hilbert

1435: spaces, on maps we take all possible pairings.  We next

1436: restrict the class of morphisms by considering only those

1437: families suxh that the corresponding conjugation is trace

1438: preserving. We call

1439: the resulting category $\mathsf{Dio}$.  This category also

1440: inherits a monoidal structure.  As discussed in Lemma

1441: \ref{thelemma} any monoidal category canonically gives rise

1442: to a polycategory associated to it.  We will denote by

1443: $\cP(\mathsf{Dio})$ the polycategory associated with

1444: $\mathsf{Dio}$.

1445:

1446: \section{The logic of polycategories}\label{logic}

1447:

1448: While definition~\ref{free} gives the free polycategory

1449: generated by  a dag $G$, it will prove to be useful to have

1450: a more constructive description.    Proof-theoretic

1451: techniques have proven to be useful in describing free polycategories.

1452: In our case, the logical structures necessary are quite

1453: simple, and so we  digress briefly to put definition~\ref{free} in

1454: logical terms.  Recall

1455: that one of the common interpretations of a polymorphism is

1456: as a logical sequent\footnote{We note that for

1457: purposes of this paper sequents should always be considered

1458: "up to permutation", i.e. one may rearrange the order of

1459: premises and conclusions as one sees fit.} of the form:

1460:

1461: \[ A_1,A_2,\ldots,A_n\vdash B_1,B_2,\ldots,B_m\]

1462:

1463: Our system will have only one inference rule, called

1464: the \textit{Cut rule}, which states:

1465: \begin{center}

1466: \mbox{

1467: \infer{\Gamma,\Gamma'\vdash\Delta,\Delta'}{\Gamma\vdash\Delta, A &

1468: \Gamma',A\vdash\Delta'}}

1469: \end{center}

1470:

1471: This should be interpreted as saying that if one has derived

1472: the two sequents above the line, then one can infer the

1473: sequent below the line. Proofs in the system always begin

1474: with \textit{axioms}.  Axioms are of the form

1475: $A_1,A_2,\ldots,A_n\vdash B_1,B_2,\ldots,B_m$, where

1476: $A_1,A_2,\ldots, A_n$ are the incoming edges of some vertex

1477: in our dag, and

1478: $B_1,B_2,\ldots,B_m$ will be the outgoing edges. There will

1479: be one such  axiom for each vertex in our dag. For example,

1480: consider Figure~\ref{fig3}. Then we will have the following

1481: axioms:

1482: \[

1483:  a\stackrel{1}{\ent} c\;\;\;

1484:  b\stackrel{2}{\ent}  d,e,f\;\;\;

1485:  c,d\stackrel{3}{\ent}  g,h\;\;\;

1486:  e\stackrel{4}{\ent}  i\;\;\;

1487:  f,g\stackrel{5}{\ent}  j\;\;\;

1488:  h,i\stackrel{6}{\ent}  k

1489: \] where we have labelled each entailment symbol with the

1490: name of the corresponding vertex. The following is an

1491: example of a deduction in this  system of the sequent

1492: $a,b\vdash f,g,h,i$.

1493:

1494: \begin{center}

1495: \mbox{

1496: \infer{a,b\vdash f,g,h,i}

1497: {\infer{a,b\vdash e,f,g,h}{b\vdash

1498: d,e,f & \infer{a,d\vdash g,h}{a\vdash c & c,d\vdash g,h}}

1499: & e\vdash i}

1500: }

1501: \end{center}

1502: This deduction corresponds to the fact that in the free

1503: polycategory generated by this dag, one has a morphism

1504: $a,b\rarr f,g,h,i$. In fact, it is easy to see that there is

1505: a precise correspondence between deductions in  this logical

1506: system and nonidentity morphisms in the free polycategory.

1507:

1508: As a first attempt at capturing quantum evolution on a dag $G$

1509: axiomatically, one

1510: might consider taking a polyfunctor from $P(G)$ to

1511: $P(\mathsf{Hilb})$, where

1512: $\mathsf{Hilb}$ is the usual category of finite-dimensional

1513: Hilbert spaces with its usual tensor product. Note that such

1514: a polyfunctor must necessarily take a sequence of, say, incoming edges

1515: $A_1,A_2,\ldots,A_n$ to

1516: $\hi_1\ox\hi_2\ldots\ox\hi_n$ where $\hi_i$ corresponds to

1517: $A_i$. Then one would (tentatively) define a set

1518: $\Delta$ of edges to be {\it valid} if there is a

1519: deduction in the logic generated by $G$ of

1520: $\Gamma\vdash\Delta$ where

1521: $\Gamma$ is a set of initial edges. Equivalently there must

1522: be a morphism

1523: $\Gamma\rarr\Delta$ in $P(G)$. Then the polyfunctor would

1524: take this to a morphism of Hilbert spaces

1525: $T\colon\hi_\Gamma\rarr\hi_\Delta$. The initial density

1526: matrices would always be assumed to be given, and one would

1527: just apply $T$ to the appropriate initial density matrices

1528: to obtain the density matrix associated to $\Delta$. The locative

1529: slices are the ones on which density matrices can be obtained without

1530: the trace operation and we are looking to equate the notions of

1531: locative and valid for slices. This

1532: approach would be genuinely axiomatic, and would evidently

1533: be applicable to other situations by simply using a category

1534: other than Hilbert spaces as the target of the polyfunctor.

1535: Furthermore we would suggest that using logic as the means

1536: of calculating the matrices gives the approach a very

1537: canonical flavor.

1538:

1539: However, with this notion of validity, we would fail to

1540: capture all locative slices, and thus our tentative notion

1541: of validity will have to be modified.  For example, consider

1542: the dag underlying the system of Figure~\ref{figureD} shown

1543: in Figure~\ref{figure5}.

1544:

1545: \begin{figure}[htb]

1546: \begin{center}

1547: \input{figure5}

1548: \end{center}

1549: \caption{}

1550: \label{figure5}

1551: \end{figure}

1552:

1553: Corresponding to this dag, we get the following basic

1554: morphisms (axioms):

1555: \[a\vdash b,c \,\,\,\,\,\,\, b\vdash d\,\,\,\,\,\,\, c\vdash

1556: e\,\,\,\,\,\,\, d,e\vdash f.\] Evidently, the set $\{f\}$ is

1557: a locative slice, and yet  the sequent $a\vdash f$ is not

1558: derivable. The sequent $a\vdash d,e$ is derivable, and one

1559: would like to cut it against $d,e\vdash f$, but one is only

1560: allowed to cut a single formula. Such ``multicuts'' are

1561: expressly forbidden, as they lead to undesirable logical

1562: properties

1563: \cite{Blute93}.

1564:

1565: Physically, the reason for this problem is that the sequent

1566: $d,e\vdash f$ does not encode the information that the two

1567: states at $d$ and $e$ are correlated.  It is precisely the

1568: fact that they are correlated that implies that one would

1569: need to use a multicut. To avoid this problem, one must

1570: introduce some notation, specifically a syntax

1571: for specifying such correlations. We will

1572: use the {\it logical connectives} of the multiplicative

1573: fragment of {\it linear logic} \cite{Girard87,Girard95}

1574: to this end. The multiplicative disjunction of linear

1575: logic, denoted $\girpar$ and called the {\it par} connective,

1576: will express such nonlocal correlations.

1577: In our example, we will write the sequent corresponding to

1578: vertex $4$ as

1579: $d\girpar e\vdash f$ to express the fact that the subsystems associated

1580: with these two edges are possibly entangled through interactions in their

1581: common past.

1582:

1583: Note that whenever two (or more)

1584: subsystems emerge from an interaction, they are correlated.

1585: In linear logic, this is reflected by the following rule

1586: called the (right) \emph{Par rule}:

1587:

1588: \begin{center}

1589: \mbox{

1590: \infer[]{\Gamma\vdash \Delta,A\girpar

1591: B}{\Gamma\vdash \Delta,A, B} }

1592: \end{center}

1593: Thus we can always introduce the symbol for correlation in the right

1594: hand side of the sequent.

1595:

1596: Notice that we can cut along a compound formula without

1597: violating any logical rules.  So in the present

1598: setting, we would have the following deduction:

1599: \begin{center}

1600: \mbox{

1601: \infer{a\vdash f}

1602:    {\infer{a\vdash d\girpar e}{\infer{a\vdash d, e}

1603: {\infer{a\vdash c,d}{a\vdash b,c & b\vdash d}  & c\vdash e}}

1604: & d\girpar e\vdash f} }

1605: \end{center} All the cuts in this deduction are legitimate;

1606: instead of a multicut we are cutting along a compound

1607: formula in the last step. So the first step in modifying our

1608: general prescription is to extend our polycategory logic, which originally

1609: contained only the cut rule, to

1610: include the connective rules of linear logic. These are described in

1611: the appendix.

1612:

1613: The above logical rule determines how one introduces a par connective

1614: on the righthand side of a sequent. For the lefthand side,

1615: one introduces pars in the axioms by the following general prescription.

1616: Given a vertex in a

1617: multigraph, we suppose that it has incoming edges

1618: $a_1,a_2,\ldots,a_n$ and outgoing edges $b_1,b_2,\ldots,b_m$.

1619: In the previous formulation, this vertex would have been

1620: labelled with the axiom $\Gamma=a_1,a_2,\ldots,a_n\vdash

1621: b_1,b_2,\ldots,b_m$. We will now introduce several pars

1622: ($\girpar$) on the lefthand side to indicate entanglements

1623: of the sort described above. Begin by defining a relation

1624: $\sim$ by saying $a_i\sim a_j$ if there is an initial edge

1625: $c$ and directed paths from $c$ to $a_i$ and from $c$ to

1626: $a_j$. This is not an equivalence relation, but one takes

1627: the equivalence relation generated by the relation $\sim$.

1628: Call this new relation $\cong$. This equivalence relation,

1629: like all equivalence relations, partitions the set $\Gamma$

1630: into a set of equivalence classes. One then "pars" together

1631: the elements of each equivalence class, and this determines

1632: the structure of the lefthand side of our axiom. For

1633: example, consider vertices 5 and 6 in Figure~\ref{fig3}.

1634: Vertex 5 would be labelled by $f\girpar g\vdash j$ and

1635: vertex 6 would be labelled by $h\girpar i\vdash k$. On the

1636: other hand, vertex 3 would be labelled by $c,d\vdash g,h$.

1637:

1638: Just as the par connective indicates the existence of past

1639: correlations, we use the more familiar tensor symbol $\ox$,

1640: which is also a connective of linear logic, to indicate the lack of

1641: nonlocal correlation. This connective also has a logical rule:

1642:

1643:

1644: \begin{center}

1645: \mbox{

1646: \infer{\Gamma,\Gamma'\vdash \Delta,\Delta',A\ox B}

1647: {\Gamma\vdash \Delta,A & \Gamma'\vdash \Delta',B} }

1648: \end{center}

1649:

1650: But we note that unlike in ordinary logic, this rule can only

1651: be applied in situations that are physically meaningful. We will say

1652: that two deductions $\pi$ and $\pi'$ are {\it spacelike

1653: separated} if all the the vertices of $\pi$ and $\pi'$

1654: are pairwise spacelike separated. In the above formula, we

1655: require that the deductions of $\Gamma\vdash \Delta,A$ and

1656: $\Gamma'\vdash \Delta',B$ are spacelike separated.

1657: This restriction of application of

1658: inference rules is similar to the restrictions of

1659: {\it ludics} \cite{Girard01}.

1660: >From a categorical standpoint, the restrictions imply

1661: that the connectives are only partial functors, but this is

1662: only a minor issue.

1663:

1664: Summarizing, to every dag $G$ we associate its ``logic'',

1665: namely the edges are considered as formulas and vertices are

1666: axioms. We have the usual linear logical connective rules,

1667: including the cut rule which in our setting is interpreted physically

1668: as propagation. The par connective denotes correlation, and the tensor

1669: lack of correlation. Note that every deduction in our system

1670: will conclude with a sequent of the form $\Gamma\vdash\Delta$,

1671: where $\Gamma$ is a set of initial edges.

1672:

1673: Now one would

1674: like to modify the definition of validity to say that a set

1675: of edges $\Delta$ is {\it valid} if in our extended

1676: polycategory logic, one can derive a sequent

1677: $\Gamma\vdash\hat{\Delta}$ such that the list of edges

1678: appearing in $ \hat{\Delta}$ was precisely $\Delta$, and

1679: $\Gamma$ is a set of initial edges. However this is still

1680: not sufficient as an axiomatic approach to capturing all

1681: locative slices. We note the example in Figure~\ref{figure8}.

1682:

1683: \begin{figure}[htb]

1684: \begin{center}

1685: \input{figure8}

1686: \end{center}

1687: \caption{}

1688: \label{figure8}

1689: \end{figure}

1690:

1691: Evidently the slice $\{f,g\}$ is locative, but we claim that

1692: it cannot be derived even in our extended logic. To this

1693: directed graph, we would associate the following axioms:

1694:

1695: \[ a\vdash c,h \,\,\,\,\,\, b\vdash d,e\,\,\,\,\,\,

1696: c,d\vdash f\,\,\,\,\,\, h,e\vdash g\]

1697:

1698: Note that there are no correlations between $c$ and $d$ or

1699: between $h$ and $e$. Thus no $\girpar$-combinations can be introduced.

1700: Now if one attempts to derive $a,b\vdash f,g$, we proceed as

1701: follows:

1702:

1703: \begin{center}

1704: \mbox{

1705: \infer{a,b\vdash h,e,f}{\infer{a,b\vdash c\ox d,h,e}{a\vdash

1706: c,h & b\vdash d,e} & \infer{c\ox d\vdash f}{c,d\vdash f}}

1707: }\end{center}

1708:

1709: At this point, we are unable to proceed. Had we attempted

1710: the symmetric approach tensoring $h$ and $e$ together, we

1711: would have encountered the same problem.

1712:

1713: The problem is that our logical system is still missing one

1714: crucial aspect, and that is that correlations develop

1715: dynamically as the system evolves, or equivalently as the

1716: deduction proceeds. Thus our axioms must

1717: change dynamically as well. We give the following definition.

1718:

1719: \begin{defin}{\em

1720: Suppose we have a deduction $\pi$ of the sequent

1721: $\Gamma\vdash\Delta$ in the graph logic associated to the

1722: dag $G$, and that $T$ is a vertex in $G$ to the future or acausal

1723: to the edges of the set

1724: $\Delta$ with $a$ and $b$ among the incoming edges of $T$.

1725: Then $a$ and $b$ are {\em correlated} with respect to $\pi$

1726: if there exist outgoing edges $c$ and $d$ of the proof

1727: $\pi$ and directed paths from $c$ to $a$ and from $d$ to $b$.

1728: }\end{defin}

1729:

1730: So the point here is that when performing a deduction, one does

1731: not assign an axiom to a given vertex until it is necessary to use

1732: that axiom in the proof. Then one assigns that axiom using this new

1733: notion of correlation and the equivalence relation defined above.

1734: This prescription reflects the physical reality that entanglement of

1735: local quantum subsystems could develop as a result of a distant interaction

1736: between some other subsystems of the same quantum system.

1737: We are finally able to give the following crucial definition:

1738:

1739:

1740:

1741: \begin{defin}{\em

1742: A set

1743: $\Delta$ of edges in a dag $G$ is said to be {\em valid} if there is a

1744: deduction in the logic associated to $G$ of

1745: $\Gamma\vdash\hat{\Delta}$ where $\hat{\Delta}$ is a sequence of formulas

1746: whose underlying set of edges is precisely $\Delta$ and where

1747: $\Gamma$ is a set of initial edges, in fact the set of initial edges

1748: to the past of $\Delta$.}

1749: \end{defin}

1750:

1751: We are also ready to state the result relating the logical deduction and

1752: the dynamics of Section~\ref{dyna} in a graph.

1753:

1754: \begin{thm} A set of edges is valid if and only if it is locative. More

1755: specifically, if there is a deduction of $\Gamma\vdash\hat{\Delta}$

1756: as described above, then $\Delta$ is necessarily locative. Conversely,

1757: given any locative slice, one can find such a deduction.

1758: \end{thm}

1759: \begin{proof}\\ Recall that a locative slice $L$ is obtained from the set

1760: of initial edges in its past by an inductive procedure.

1761: At each step, we choose arbitrarily a minimal vertex $u$ in the past of $L$,

1762: remove the incoming edges of $u$ and add the outgoing edges. This step

1763: corresponds to the application of a cut rule, and the method we have used

1764: of assigning the par connective

1765: to the lefthand side of an axiom ensures that it is

1766: always a legal cut. The tensor rule is necessary in order to combine

1767: spacelike separated subsystems in order to prepare for the application of

1768: the cut rule.

1769: \end{proof}

1770:

1771: Thus we have successfully given an axiomatic logic-based approach

1772: to describing evolution. In summary, to find the density matrix

1773: associated to a locative slice $\Delta$, one finds a set

1774: of linear logic formulas whose underlying set of atoms is $\Delta$

1775: and a deduction of $\Gamma\vdash\hat{\Delta}$ where $\Gamma$ is

1776: as above. This deduction is interpreted as a morphism in the

1777: corresponding polycategory, and the polyfunctor to $\cP(\mathsf{Dio})$

1778: is applied to obtain a morphism in the category $\mathsf{Dio}$.

1779: (Note that in this context a polyfunctor is furthermore required to take

1780: any tensor or par connective in $\Gamma$ or $\hat{\Delta}$ to the

1781: usual tensor in $\mathsf{Dio}$.)

1782: One then plugs in the given initial data to obtain the density matrix

1783: corresponding to that slice. Given a nonlocative slice, one simply finds

1784: a locative slice containing it, repeats the above procedure and then

1785: traces out the extraneous edges.

1786:

1787: \section{Conclusions}\label{conc}

1788:

1789: We have presented an axiomatic system for the analysis

1790: of quantum evolution. The dynamics is local as to preserve causality, but at

1791: the same time entanglement of separated quantum systems is faithfully

1792: represented. One could apply these ideas

1793: to other situations

1794: by using a category other than the category of intervention

1795: operators

1796: as the target of the functor. An appropriate

1797: categorical structure for the target is the notion of

1798: a {\it traced monoidal category} \cite{Joyal96} or

1799: the notion of a {\it traced ideal} \cite{Blute99}.

1800: See also \cite{Blute00}.

1801: One particular situation which might be analyzed in this

1802: framework is the notion of classical probabilistic

1803: information. The paper \cite{Blute99} contains a category of

1804: {\it probabilistic relations} which might be of particular

1805: interest in this setting.

1806:

1807: Our work also suggests a natural

1808: extension of the notion of {\it consistent} or {\it

1809: decoherent histories} \cite{Gell-Mann93,Griffiths96}.

1810: Restricting the intervention operators at the vertices of our graph $G$

1811: to be projection operators we can consider $G$ to denote a particular

1812: history within a set of histories. This relaxes the usual linear ordering of

1813: events considered in the literature thus far.

1814: An exposition of histories on graphs is under preparation.

1815:

1816: \section*{Acknowledgements} The authors would like to thank

1817: NSERC for its financial support.  We would also like to

1818: thank Phil Scott and Jean-Yves Girard for inviting us to

1819: present this work at the joint SMF-AMS conference in Lyon.  The

1820: paper

1821: \cite{Markopoulou00}, which led to our initial consideration

1822: of these ideas, was pointed out to us by Ioannis Raptis.  We

1823: would like to especially thank Rafael Sorkin for a lengthy

1824: discussion on causal sets and related topics.  Finally, the

1825: second author would like to thank the University of Ottawa

1826: Department of Mathematics and Statistics for its support.

1827: \bibliography{causal}

1828:

1829:

1830: \appendix

1831: \section{Linear logic}

1832:

1833: This section can safely be skipped by logicians.

1834:

1835: Linear logic~\cite{Girard87} is a logic introduced by Girard

1836: in 1987 to allow a finer analysis of how ``resources'' are

1837: consumed in the course of a deduction. As already remarked

1838: in the text, the primary objects of study in logic and

1839: especially proof theory are {\it sequents}, and the

1840: constructors of sequents, the {\it inference rules}. Several

1841: examples have already been given such as the {\it cut rule}:

1842:

1843: \begin{center}

1844: \mbox{

1845: \infer[CUT]{\Gamma,\Gamma'\vdash\Delta,\Delta'}{\Gamma\vdash\Delta,

1846: A &

1847: \Gamma',A\vdash\Delta'}}

1848: \end{center}

1849:

1850: So typically an inference rule is a prescription for

1851: creating a more complex sequent from one or possibly several

1852: simpler ones. Two typical inference rules are the rules of

1853: {\it contraction} and {\it weakening}. These are as follows:

1854:

1855: \begin{center}

1856: \mbox{\infer[CONT]{\Gamma,A\vdash\Delta}

1857: {\Gamma,A,A\vdash\Delta}}

1858: \end{center}

1859:

1860: \begin{center}

1861: \mbox{\infer[WEAK]{\Gamma,A\vdash\Delta}

1862: {\Gamma\vdash\Delta}}

1863: \end{center}

1864:

1865: There are similar rules for the righthand side as well. These

1866: have long been standard in most logics, and indeed have a

1867: strong intuitive meaning. For example, contraction says that

1868: it is unnecessary to make the same assumption twice.

1869: However, in Girard's reexamination of the sequent calculus,

1870: he proposed an interpretation in which the formulas to the

1871: left of a sequent are resources to be consumed in the course

1872: of producing the output, i.e. the conclusions. From this

1873: perspective, the rules of contraction and weakening are

1874: quite dubious. The first step towards defining linear logic

1875: then is to recover these rules from the system. The result is

1876: a remarkably rich structure, the most notable aspect of

1877: which is that the usual connectives of logic, conjunction

1878: and disjunction, each split into two connectives. These

1879: connectives are naturally split into two classes, the {\it

1880: multiplicative} and the {\it additive} connectives. It is

1881: only the multiplicative connectives that will concern us

1882: here. Here are the rules for these connectives:

1883:

1884: \begin{center}

1885: \mbox{

1886: \infer[Right-\girpar]{\Gamma\vdash \Delta,A\girpar

1887: B}{\Gamma\vdash \Delta,A, B} }

1888: \end{center}

1889:

1890: \begin{center}

1891: \mbox{

1892: \infer[Left-\girpar]{\Gamma,\Gamma' A\wp B\vdash \Delta,\Delta'}

1893: {\Gamma,A\vdash \Delta & \Gamma',B\vdash \Delta'} }

1894: \end{center}

1895:

1896: \begin{center}

1897: \mbox{

1898: \infer[Right-\ox]{\Gamma,\Gamma'\vdash \Delta,\Delta',A\ox B}

1899: {\Gamma\vdash \Delta,A & \Gamma'\vdash \Delta',B} }

1900: \end{center}

1901:

1902: \begin{center}

1903: \mbox{

1904: \infer[Left-\ox]{\Gamma,A\ox B\vdash \Delta}{\Gamma,A,B\vdash \Delta} }

1905: \end{center}

1906:

1907:

1908: Categorically, the structure of linear logic has striking

1909: properties as well. As is traditional in categorical logic,

1910: one can form a category whose objects are formulas, and

1911: morphisms are proofs. This construction is described for

1912: example in \cite{Lambek69,Lambek86}. When one applies this

1913: construction to (multiplicative) linear logic ({\bf MLL}),

1914: one obtains a special class of symmetric monoidal closed

1915: categories called {\it $*$-autonomous}. These were defined

1916: by Barr in \cite{Barr79}.

1917:

1918: Subsequently it was demonstrated that the correspondence

1919: between proofs in {\bf MLL} and morphisms in the free

1920: $*$-autonomous category is quite sharp.  See

1921: \cite{Blute93,Blute96}.   This correspondence between

1922: morphisms and proofs is best expressed using {\it proof

1923: nets}, a graph-theoretic system for representing {\bf MLL}

1924: proofs \cite{Girard87}. Proof nets had already been seen to

1925: be a remarkable deductive system, exhibiting properties of

1926: great importance in the analysis of computation, especially

1927: concurrent computation.  The precise connection between

1928: proof nets and free $*$-autonomous categories

1929: provides further evidence of their great utility.

1930:

1931:

1932:

1933:

1934: \end{document}

1935:

1936:

1937:

1938:

1939:

1940:

1941:

1942:

1943:

1944:

1945:

1946:

1947:

1948:

1949:

1950:

1951:

1952:

1953:

1954:

1955: