0401:q-bio0401005/dma.tex

1: %\documentclass{article}

2: \documentclass[twocolumn,showpacs,preprintnumbers,amsmath,amssymb,rmp]{revtex4}

3:

4: \usepackage{latexsym}

5: \usepackage{graphicx}

6:

7: \begin{document}

8: \title{How to find decision makers in neural circuits?}

9: \author{Alexei A. Koulakov$^{1}$, Dmitry Rinberg$^{2}$, and Dmitry N. Tsigankov$^{1}$}

10:

11: \address{

12: \protect{$^1$}\hspace{-.0in}Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, USA \\

13: \protect{$^2$}Monell Chemical Senses Center, Philadelphia, PA, USA

14: }

15:

16: \begin{abstract}

17: Neural circuits often face the problem of classifying stimuli into discrete

18: groups and making decisions based on such classifications. Neurons of these

19: circuits can be distinguished according to their correlations with different

20: features of stimulus or response, which allows defining sensory or motor

21: neuronal types. In this study we define the third class of neurons, which is

22: responsible for making decision. We suggest two descriptions for contribution of

23: units to decision making: first, as a spatial derivative of correlations between

24: neural activity and the decision; second, as an impact of variability in a given

25: neuron on the response. These two definitions are shown to be equivalent, when

26: they can be compared. We also suggest an experimental strategy for determining

27: contributions to decision making, which uses electric stimulation with time-

28: varying random current.

29: \end{abstract}

30:

31: \pacs{n/a}

32:

33: \maketitle

34:

35: \section{INTRODUCTION}\label{introduction}

36:

37: Nervous system is continuously confronted by megabytes of information,

38: representing light, sound, smell, etc. This information is compiled by the

39: brain into a set of decisions, representing behaviors of living organisms.

40: The mechanisms involved in this reduction have been under investigation for

41: many years (Glimcher, 2003; Romo and Salinas,

42: 2003). In this study we address a question complimentary to the issue of

43: decision making (DM) mechanisms. We define neuronal units involved in making

44: perceptual decisions. For this purpose we determine DM activity in surrogate

45: networks, defined mathematically, in which a complete control is present

46: over stimuli, mechanisms, and responses. Such decision making analysis (DMA)

47: has practical significance, since once units involved in making particular

48: decision are located, further efforts could be concentrated on uncovering

49: the underlying mechanisms.

50:

51: In this study DM task is defined as evaluation of a function in the

52: multidimensional stimulus space (Figure 1A). This function has a discrete set

53: of values, representing the repertoire of responses available to the

54: organism. The decisions may, of course, be stochastic, to reflect the

55: uncertainty, pertinent to behavior. This definition is suitable for

56: experiments where subjects perform poly-alternative forced-choice tasks,

57: such as saccadic response to the direction of stimulus motion

58: (Shadlen and Newsome, 2001).

59:

60:

61:

62:

63:

64:

65: \begin{figure}[htbp]

66: \centerline{\includegraphics[width=3.0in]{dma1.eps}}

67: \caption{\textbf{A,} Definition of decision making task. Nervous system

68: evaluates a function, whose values represent discrete decisions, in the

69: many-dimensional sensory space. \textbf{B,} Some of the visual areas

70: involved in motion-discrimination task. The areas on the left are more

71: sensory (response is correlated with the sensory input), while those of the

72: right are more motor (correlated with the response).

73: }

74: \label{fig1}

75: \end{figure}

76:

77:

78: Let us consider motion-discrimination task in more detail. Figure 1B lists

79: some visual areas, which are involved in this task. The areas are arranged

80: along a rough sensory-motor axis, so that the areas on the left are more

81: ``sensory'', while those on the right are more ``motor''. This implies that

82: the responses in these areas are more correlated with stimulus or response

83: respectively. Where on this sensory-motor axis one should position the DM

84: elements? One could argue that the elements most correlated with the

85: decision itself are the decision makers, following the analogy with the

86: definition of sensory and motor elements. It is, however, difficult, if not

87: impossible, to distinguish such definition from the definition of purely

88: motor units (Shadlen and Newsome, 2001). The latter

89: relay the results of decision making process, without involvement in the

90: formation of the decision. An alternative approach is therefore needed to

91: define the DM units.

92:

93: The DM components may be surmised to be located on the interface between

94: sensory and motor areas. More precisely, the \textit{first} element in the sensory-motor

95: chain, which carries significant correlation with the response, may be

96: identified as the decision maker. In this study we develop this idea into

97: rigorous mathematical formulation and find a special correlation function,

98: which determines contributions of units to DM. This

99: formalism allows us to answer two questions pertaining to the identities of

100: DM units. First, we consider the case when not one but \textit{several} elements are

101: involved in the same decision simultaneously. Our approach allows us to

102: evaluate relative importance of various units in such a distributed DM.

103: Second, we consider the systems with loops in connectivity. For such systems

104: the concept of `the first element' becomes more arbitrary and one has to

105: proceed more carefully in defining contributions to DM. We succeed in doing

106: so for our surrogate networks and define DM units for recurrent networks in

107: a way, which is consistent with the linear sensory-motor chains, thus

108: satisfying the requirement of the correspondence principle.

109:

110: This paper is organized as follows. We first analyze simple linear chain models,

111: and networks, such as trees, which have similar properties. We then

112: use this analysis to define decision makers in networks of arbitrary

113: connectivity. Finally, we extend our study to the cases, when electric

114: stimulation can be applied to units, and show that DM components can be

115: identified in a way consistent with our preceeding analyses.

116:

117: \section{LINEAR CHAINS AND THEIR DERIVATIVES}

118: \label{linearchain}

119:

120: The goal of this section is to formulate quantitative principles by which DM network elements can be identified.

121: We approach this task by analyzing simple cases, which can be solved exactly without the use of computer, and in which the identities of DM elements are clear.

122: These cases allow us to emphasize the properties of DM task we are attempting to describe.

123: We proceed therefore to the analysis of the simplest network capable of making decisions.

124:

125:

126: \subsection{The `nematode' network}

127: \label{nematode}

128:

129: In this subsection we consider the network, which we call `nematode', because of its resemblance to simple biological organisms, both in the layout and in the fundamental significance.

130: We first define the model; then show that it can make simple decisions; and, finally, define the positions of decision makers in the network.

131:

132:

133: \begin{figure}[htbp]

134: \centerline{\includegraphics[width=3.0in]{dma2.eps}}

135: \caption{\textbf{A,} a simple `nematode' network consists of a linear chain

136: of units. All units in the chain, but

137: the last, are linear. The last unit, shown by the square, is non-linear and

138: returns zero or one depending on the sign of the response of the preceding

139: unit. \textbf{B,} The average input-output relationship for the `nematode'

140: is given by the sigmoid function (error function). The spread of the sigmoid

141: is determined by the net noise in the chain.

142: }

143: \label{fig2}

144: \end{figure}

145:

146:

147: Consider a linear chain of units, whose response is characterized by a set

148: of real numbers $x_i$, where $i=1...N$ is the position of the element in

149: the chain (Figure 2). Response of each element does not depend on time. This

150: model is therefore static. This assumption is introduced here to simplify

151: the analysis and can be relaxed as described below (section~\ref{trees}). Each unit

152: performs a simple linear transformation between the unit's input and the

153: output. Thus, for element number $i$

154: \begin{equation}

155: \label{eq1}

156: x_i =x_{i-1} +\eta _i

157: \end{equation}

158: Here $\eta _i $ is noise associated with the element. In this work we assume

159: that noise has zero mean, is individual to each unit, and, therefore, is

160: uncorrelated between units, i.e.

161: \begin{equation}

162: \label{eq2}

163: {\begin{array}{*{20}c}

164:  {\overline {\eta _i } =0,} \hfill & \hfill \\

165: \end{array} }\overline {\eta _i \eta _j } =\left\{ {{\begin{array}{*{20}c}

166:  {\overline {\eta _i^2 } ,} \hfill & {i=j} \hfill \\

167:  {0,} \hfill & {i\ne j} \hfill \\

168: \end{array} }} \right.

169: \end{equation}

170: We further assume that noise has a Gaussian distribution. The chain of

171: linear elements is thus completely specified by a set of noise variances

172: $\overline {\eta _i^2 } $. The model described by (\ref{eq1}) and (\ref{eq2}) yields the

173: following solution for the response of the last element in the chain

174: \begin{equation}

175: \label{eq3}

176: x_N =x_0 +\eta _1 +\eta _2 +...+\eta _{N-1} +\eta _N .

177: \end{equation}

178: Thus, the response of the last element is just a sum of the input into

179: network $x_0$ and noise contributions from all units, independently on the

180: order of unit in the chain.

181:

182: The last element in the chain has non-linear response properties. Its

183: response is defined by

184: \begin{equation}

185: \label{eq4}

186: d=H(x_N ),

187: \end{equation}

188: where $H(x)$ is the Heaviside~step~function, which is equal to one/zero if

189: the argument is positive/negative. It follows then that our `nematode'

190: network is capable of making decisions based on the values of input

191: variable $x_0$. This is if we interpret variable $d$, which is equal either 0

192: or 1, as the result of DM process, as defined in Figure 1A. The decisions

193: are made stochastically and are dependent upon the instantiations of random

194: variables $\eta _i $, which vary from trial to trial.

195:

196: \begin{figure}[htbp]

197: \centerline{\includegraphics[width=3.0in]{dma3.eps}}

198: \caption{

199: \textbf{A,} If signal-to-noise ratio is high, responses of all

200: units are well correlated with the output, as shown on the right by the

201: mutual information between the response of given unit and the output.

202: \textbf{B,} For the case of low signal-to-noise ratio, the output is more

203: correlated with the motor units (right) than with the sensory ones (left).

204: }

205: \label{fig3}

206: \end{figure}

207:

208:

209: Our model is completely defined by the set of noise variances, pertinent to

210: each unit $\overline {\eta _i^2 }$. Although decisions made by this chain

211: are quite simple, the identities of decision makers are not so easy to find.

212: The distribution of impact to DM along the chain should depend upon the

213: distribution of noise variables $\overline {\eta _i^2 } $. Our next goal is

214: to develop a sensible definition of contributions to DM based on the vector

215: of variances $\overline {\eta _i^2 } $. Before doing so we describe general

216: input-output properties of the chain.

217:

218: Since decision made by the network varies from trial to trial, one can

219: define averaged over trials response of the system $\overline {d(x_0 )} $.

220: As shown in Figure 2B it has a sigmoid shape, smeared by the total amount of

221: noise in the system. One can, therefore, consider two cases, depending on

222: whether the signal-to-noise ratio for the chain is large or small. These two

223: regimes are shown in Figure 3A and B respectively.

224:

225: To analyze responses of units in these two cases we define their correlation

226: with the decision. This correlation is defined for each element in the chain

227: (Figure 3, right). As a measure of correlation we choose mutual information

228: (MI) between response of the $i-$th unit, $x_{i}$, and the decision, $d$. MI has an

229: advantage of being unitless (it is measured in bits) and having clear

230: intuitive properties, as described below. We will also show below in this

231: section that MI has limitations as a measure of DM.

232:

233: MI describes the information transmission from the $i-$th unit to the output of

234: the system. Since the output can only have values 0 or 1, MI cannot exceed

235: the value of one bit. We now consider two cases, depending on the network's

236: signal-to-noise ratio. If network input $\left| {x_0 } \right|$ is large, as

237: in Figure 3A, response of the system is well correlated with the input.

238: Hence, activities of all units are well correlated with both input and

239: output, and $MI(x_i ,d)\approx 1$ for all of the units. In the opposite

240: limit, when the signal-to-noise ratio is small, $\left| {x_0 } \right|$ is

241: smaller than noise, and the system's response is weakly correlated with the

242: input (Figure 3B). In this case MI as a function of unit's position displays a

243: structure, shown in Figure 3B (right). This structure, as shown below, has a

244: key to the definition of DM components and is qualitatively discussed here.

245: The units, which are close to the exit from the network, show strong

246: correlation with the decision, similarly to the high signal-to-noise ratio case.

247: Their MI is therefore close to 1 bit. On the

248: other hand, more `sensory' units, in the beginning of the chain are strongly

249: correlated with the input. Since input-output correlation is weak in low

250: signal-to-noise ratio case, the `sensory' units display virtually \textit{no} relation

251: to the output and $MI(x_i ,d)\approx 0$ for such units (Figure 3B, right).

252: Thus, MI, as a function of $i$ displays a transition from 0 to 1 in the

253: low signal-to-noise ratio case.

254:

255: How could one deduce identities of decision makers from these dependencies

256: (Figure 3A and B)? One could suggest that the elements perfectly correlated

257: with the output of the system, such as exit elements from the chain, are the

258: ones that make the decision. However, such elements may be just the relay or

259: `motor' units, in which case their contribution to DM is small. Indeed, when

260: we type, our decisions are perfectly correlated with activities of finger

261: muscles; but one could hardly blame our fingers for the content of the

262: typing. Thus, despite their high correlation with the output, exit elements

263: could not be called decision makers. Input elements, having no correlation

264: with the decision, are responsible for DM in even lesser degree. We thus

265: need to analyze the dependence of MI on position in more detail and suggest

266: another scheme for defining DM units.

267:

268: Our discarding of motor units as decision makers can be further extended

269: onto the entire high signal-to-noise ratio case (Figure 3A). We suggest that

270: the deterministic regime is not descriptive from the point of view of DM

271: analysis. First, in this regime all units become indistinguishable from

272: motor. The latter are not decision makers, as suggested above. Second, the

273: dependence shown in Figure 3A (right) does not reveal the contributions of

274: individual units to the decision. Since all units have the same correlation,

275: it is hard, if not impossible, to differentiate them and assign different

276: contributions. Third, the responses of units in this case are

277: deterministically related to the input. Hence, units act as relays,

278: passively transmitting information along the chain. It can be argued that

279: the external environment, providing the input variable $x_0$, acts as the

280: decision maker. We conclude that to find decision making activity one has to

281: concentrate on the low signal-to-noise ratio case.

282:

283: We show below that the identities of decision making units can be deduced

284: from the shape of transition in Figure 3B (right). To this end we analyze a

285: set of examples of networks with various distributions of noise $\overline

286: {\eta _i^2 } $. We start from the simplest example of a single noisy unit.

287:

288: \subsubsection{Example 1: 'Noisy' neuron.}

289:

290: Consider a chain in which noise is absent from all units but one, whose

291: order number in the chain is $n$ (Figure 4). Since, according to our

292: previous discussion, we need to consider the low signal-to-noise ratio case,

293: we will assume that

294: \begin{equation}

295: \label{eq5}

296: x_0 =0,

297: \end{equation}

298: i.e. network receives no input. Making the decision in this case is still possible,

299: based on the values of noise inside the network. Since noise is only present in one neuron, from

300: (\ref{eq3}) we conclude that

301: \begin{equation}

302: \label{eq6}

303: x_N =\eta _n .

304: \end{equation}

305: The decision made by the network is

306: \begin{equation}

307: \label{eq7}

308: d=H(\eta _n ).

309: \end{equation}

310: Thus, decision is causally linked to the processes controlling unit number

311: $n$, which leads us to conclusion that this neuron is the decision maker.

312:

313: Paradoxically, the noisiest unit in this simple formulation makes the

314: largest impact. All noiseless elements, even nonlinear, are deterministic,

315: and work as simple relays which transmit information from the previous node

316: to the next one. The output of the circuit is linked to the processes

317: controlling noise in neuron number $n$, rather that in any other neuron in

318: the network.

319:

320: One would be tempted to conclude that the non-linear element is actually the

321: decision maker in this case. We deduce that the non-linear element does

322: not have a causal effect on output from the circuit; therefore its role is

323: just to relay response from neuron $n$ to the output. In this respect the

324: non-linear element is not different from other noiseless elements.

325:

326: To link this example to our previous discussion (Figure 3B) we plot MI as a

327: function of position in the chain in Figure 4 (top). As we discussed, MI is

328: high for exit (`motor') units and low for input (`sensory') elements. Figure

329: 4 also shows the derivative of MI with respect to position in the chain. It

330: is clear that this derivative represents the decision making element. Thus,

331: we conclude that not correlation with the decision but the \textit{rate of change} of the latter

332: along the network is the indicator of DM.

333:

334: \begin{figure}[htbp]

335: \centerline{\includegraphics[width=3.0in]{dma4.eps}}

336: \caption{

337: The example of 'noisy' neuron (marked by asterisk). Top

338: panel, mutual information between given unit and the decision. Bottom panel,

339: derivative of mutual information. The derivative represents the

340: decision making unit in this case.

341: }

342: \label{fig4}

343: \end{figure}

344:

345:

346: \subsubsection{Example 2: Uniformly distributed noise.} Our next example shows that the conclusion about derivative of MI is

347: basically correct, but has to be slightly amended to be numerically precise.

348: Consider the chain in which all elements are noisy and the variance of noise

349: is the same for each element. In this case

350: \begin{equation}

351: \label{eq8}

352: x_N =\eta _1 +\eta _2 +...+\eta _{N-1} +\eta _N

353: \end{equation}

354: i.e. all units contribute to decision \textit{equally}. This is because Eq. (\ref{eq8}) does not

355: distinguish the order in which contributions from the units are added, and

356: all contributions are of equal strength on average. Can this conclusion be

357: confirmed by the derivative of MI?

358:

359: Figure 5A shows MI as a function of position in the chain for this case.

360: This dependence is obtained in Appendix A. It increases smoothly from 0 to 1

361: resulting in a non-zero derivative at all units. This is consistent with

362: (\ref{eq8}) and the notion that all units participate in the decision. However,

363: (\ref{eq8}) suggests that all units participate in decision \textit{equally}. The derivative of MI

364: turns out to be slightly non-uniform, as seen in Figure 5A. This can be

365: corrected if not MI itself but a non-linear function of MI, denoted $F(MI)$,

366: is considered. This non-linear function is calculated in Appendix A and is

367: shown in Figure 5B. The new correlator $F(MI)$ has the same basic properties

368: as the MI. It rises from 0 to 1 monotonously when passing through the array

369: (Figure 5C). But, in addition, its derivative turns out to be \textit{uniform}, as shown in

370: Figure 5C (bottom). This is consistent with equal participation of all units

371: in DM in the uniformly distributed noise case and Eq.~(\ref{eq8}). Thus, we

372: conclude that for this case the contributions to DM are given by the rate of

373: increase of $F(MI)$ when moving through the array

374: \begin{equation}

375: \label{eq9}

376: DM_i =F\left( {MI_i } \right)-F\left( {MI_{i-1} } \right).

377: \end{equation}

378: Here $i$ is the index along the chain. Eq.~(\ref{eq9}) is the main result of this

379: paper. It represents our definition of contributions to DM for networks of

380: simple connectivity, such as chains.

381:

382: Three points should be made about the definition (\ref{eq9}). First, it reproduces

383: the result obtained in the previous example of 'noisy' neuron. Indeed, the

384: mutual information rises from 0 to 1 on the 'noisy' neuron in Figure 4. But

385: $F(MI)$ coincides with MI at these values, as follows from its plot in

386: Figure 5B. Thus, the derivative of $F(MI)$ is also given by a single spike

387: at the position of 'noisy' neuron, as in Figure 4 (bottom). Second, Eq. (\ref{eq9})

388: implies that, from point of view of DM, not mutual information, but another

389: correlator, given by $F(MI)$, is more relevant. Function $F$ deviates from

390: linear function only slightly (Figure 5B), and for practical purposes the

391: distinction between the MI and $F(MI)$ could be ignored. However, we retain

392: it throughout the manuscript to ensure mathematical rigor. Third, when

393: deducing (\ref{eq9}) we did not postulate that contributions to DM are

394: proportional to the variance of noise. Instead, we suggested that Eq. (\ref{eq8})

395: implies that all units contribute equally, independently on the order in the

396: chain. This simple qualitative statement is powerful enough to constrain our

397: quantitative reasoning and lead to a measure of DM in form of function

398: $F(MI)$ and definition (\ref{eq9}). We do not know yet if the derivative of

399: $F(MI)$ is proportional to the variance of noise, square root of this

400: variance, or any other characteristic of noise in each element. All of these

401: parameters give the same results in the uniform noise case. We need to have

402: a difference between units to measure relative strength of their

403: contributions. This is achieved by the next example.

404:

405:

406: \begin{figure}[htbp]

407: \centerline{\includegraphics[width=3.0in]{dma5.eps}}

408: \centerline{\includegraphics[width=3.0in]{dma6.eps}}

409: \caption{

410: The example with uniformly distributed noise.

411: \textbf{A}, mutual information between response of given unit and the

412: decision. The dependence has a non-uniform increase, suggesting that mutual

413: information is not a good measure of decision making. \textbf{B}, if one

414: applies a non-linear function (solid curve) to the mutual information in

415: \textbf{A}, one obtains a uniformly increasing correlator in \textbf{C}.

416: This non-linear function, called $F(MI)$, is close to linear, shown by the

417: dotted line. \textbf{C}, the new correlator $F(MI)$ (top panel) has a

418: uniform derivative (bottom panel). Thus, derivative of $F(MI)$ is a sensible

419: measure of decision making in the case of uniform noise.

420: }

421: \label{fig5}

422: \end{figure}

423:

424:

425: \subsubsection{Example 3: `Loud' neuron.} In this example the variances of noise on all neurons are the same,

426: similarly to the previous case. However, here we amend the network

427: definition given by (\ref{eq1}). We do so for only one neuron. We assume that the

428: link between units 5 and 6 is characterized by a very large strength $K>>1$.

429: Thus, for neuron number 6 (Figure 6) instead of (\ref{eq1}) we have

430: \begin{equation}

431: \label{eq10}

432: x_6 =Kx_5 +\eta _6

433: \end{equation}

434: Therefore this example is the same as the previous, except that the single

435: network connection is changed. What are the DM units in this case?

436:

437: The network's output is given by

438: \begin{equation}

439: \label{eq11}

440: x_N =K(\eta _1 +\eta _2 +...+\eta _5 )+\eta _6 +...+\eta _{11}

441: \end{equation}

442: Thus, units 1 through 5 contribute equally to decision. In addition, their

443: contributions are multiplied by a large factor $K$. Units 6 through 11 also

444: contribute equally, but their contribution is much smaller than that of the

445: former group. We conclude that units 1 through 5 are much stronger

446: decision makers than units 6 through 11. This conclusion is supported by the

447: derivative of $F(MI)$, as shown in Figure 6 (bottom).

448:

449: \begin{figure}[htbp]

450: \centerline{\includegraphics[width=3.0in]{dma7.eps}}

451: \caption{

452: The `loud' neuron example. The link between units 5 an 6

453: is strengthened. Compare to Figure 5A.

454: }

455: \label{fig6}

456: \end{figure}

457:

458:

459: Thus, changing one link in the chain produces large effect on the

460: distribution of DM. The units downstream from the link contribute less to

461: decisions, while the units upstream contribute a lot. What is the measure of

462: decision making, which could differentiate these two types of units?

463:

464: Calculations in Appendix A show that derivative of $F(MI)$ is proportional

465: to $K^2$ for units 1 through 5. This is easy to understand qualitatively,

466: since MI increases along the chain even for negative ($K<0)$ links. This

467: is \textit{not} possible if contribution from units 1 to 5 are multiplied by $K$ for

468: example. Thus, an even power of $K$ is required, which is shown in Appendix

469: A to be $K^2$.

470:

471: \subsubsection{Alternative definition of DM.} So far we have used definition (\ref{eq9}), which is quite complex, since it

472: involves calculation of a nonlinear function $F(MI)$. Is it possible to

473: reproduce the results derived above in a simpler way? It turns out that the

474: role of given unit in DM is proportional to its contribution to the

475: variability of the output $\overline {x_N^2 } $. This leads us to an

476: alternative to (\ref{eq9}) definition of DM.

477:

478: Let us introduce the new definition using the examples, considered above.

479: From (\ref{eq11}) in the `loud' neuron case we derive

480: \begin{equation}

481: \label{eq12}

482: \overline {x_N^2 } =K^2(\overline {\eta _1^2 } +...+\overline {\eta _5^2 }

483: )+\overline {\eta _6^2 } +...+\overline {\eta _{11}^2 } .

484: \end{equation}

485: We could conjecture that the contributions to DM from different units are

486: weighted proportionally to the corresponding summands in (\ref{eq12}). Indeed, if

487: we assume

488: \[

489: DM_{1..5} =\overline {\eta _{1..5}^2 } K^2

490: \]

491: \begin{equation}

492: \label{eq13}

493: DM_{6..11} =\overline {\eta _{6..11}^2 } ,

494: \end{equation}

495: by choosing appropriate values of variance of noise and gain, we can

496: reproduce the results of all three of our previous examples. Thus, in the

497: case of 'noisy' neuron the variance of noise is only present in one unit,

498: rendering this unit decision maker, according to (\ref{eq13}). In the case of

499: uniform noise, when $K=1$ and all $\overline {\eta _{1..11}^2 } $ are the

500: same, (\ref{eq13}) gives uniform contributions to DM. In the case of `loud' neuron,

501: (\ref{eq13}) gives the correct factor $K^2$ describing the advantage of upstream

502: neurons. Thus, the contributions to DM are proportional to the variance of

503: noise on given element, multiplied by the square of the gain from this

504: element to the output. We can rewrite (\ref{eq13}) in a more compact form to

505: emphasize this latter statement

506: \begin{equation}

507: \label{eq14}

508: DM_i =\overline {\eta _i^2 } \frac{d\overline {x_N^2 } }{d\overline {\eta

509: _i^2 } }.

510: \end{equation}

511: One could verify (\ref{eq14}), by applying it to (\ref{eq12}) and obtaining

512: relationships (\ref{eq13}). This justifies (\ref{eq14}) in the three examples considered

513: above.

514:

515: Eq. (\ref{eq14}) also applies to linear chains in general. In Appendix A we derive

516: (\ref{eq14}) from previous definition (\ref{eq9}) for arbitrary distribution of

517: connection strengths and noise. Thus, (\ref{eq14}) can be considered an

518: alternative definition to (\ref{eq9}). The equivalence between (\ref{eq9}) and (\ref{eq14}) is

519: demonstrated graphically in Figure 7.

520:

521: Why should one consider an alternative definition? This is because (\ref{eq9})

522: cannot be applied to networks of arbitrary connectivity, such as circuits

523: containing loops. Definition (\ref{eq14}) however applies to all topologies,

524: including the linear chain examples, considered here.

525:

526: \begin{figure}[htbp]

527: \centerline{\includegraphics[width=3.0in]{dma8.eps}}

528: \caption{

529: Equivalence of two definitions. The top panel shows

530: distribution of noise variance (asterisk diameter) and of $F(MI)$ (bars).

531: The bottom panel displays the derivative of $F(MI)$, defined by (\ref{eq9}). The

532: derivative is numerically the same as the variance of noise. Both can be

533: used as measures of decision making.

534: }

535: \label{fig7}

536: \end{figure}

537:

538:

539: \subsection{Conclusions from `nematode' study}

540: \label{conclusions}

541:

542: Let us review our findings. First, we arrived to the definition of DM

543: activity using the information-theoretical approach (\ref{eq9}). According to this

544: definition, DM is the rate of change of correlation with decision along the

545: chain. In other words, the \textit{first} element or elements, which correlate with the

546: decision, are the decision makers. This approach has its pros and contras.

547: Indeed, the viewpoint expressed by (\ref{eq9}) has a potential to be transferred

548: to other systems, which contain non-linear elements. Eq. (\ref{eq9}) has an

549: information-theoretical origin; hence its applicability may be broader than

550: our simple system.

551: %We emphasize, however, that this point is not confirmed

552: %here, it will be further explored elsewhere.

553: Another advantage of (\ref{eq9}) is

554: that it relies on the characteristics measurable in single-electrode

555: recording experiments, such as response of single unit and its correlation

556: with behavioral decision. Thus, (\ref{eq9}) could be used experimentally. The

557: disadvantage of the information-theoretical approach is that it is not clear

558: how to apply it to the systems with loops, as we have mentioned above. Since

559: biological networks almost always contain loops this significantly limits

560: the applicability of information-theoretical formula (\ref{eq9}).

561:

562: Our second step was to derive an alternative definition (\ref{eq14}). The latter

563: is \textit{equivalent} to the former definition (\ref{eq9}) for linear-chain (`nematode') example, as

564: we have demonstrated on simple examples and have shown more rigorously in

565: Appendix A. The alternative definition (\ref{eq14}) can be understood on the basis

566: of the following two observations. First, the example of `noisy' neuron

567: shows that the variability is the source of decisions. Thus,

568:

569: \underline {\textbf{Conclusion 1:}} Under fixed other conditions, an

570: increase in variability and noise in a single unit leads to a larger

571: contribution to DM from this unit.

572: \begin{equation}

573: \label{eq15}

574: DM_i \sim \overline {\eta _i^2 }

575: \end{equation}

576:

577: Second, the example of `loud' neuron shows that not only variability and

578: noise are important but also how much of this variability reaches the motor

579: units. DM is hence a property of network connectivity too. Thus, we arrive

580: to the next rule

581:

582: \underline {\textbf{Conclusion 2:}} The stronger is the pathway from given

583: unit to the motor output, the larger is the contribution of this unit to DM.

584: \begin{equation}

585: \label{eq16}

586: DM_i \sim \frac{d\overline {x_N^2 } }{d\overline {\eta _i^2 } }

587: \end{equation}

588: These two rules are combined into the definition (\ref{eq14}). Although (\ref{eq14}) and

589: (\ref{eq16}) assume that the output element is unique, this requirement will be

590: removed below, when we consider arbitrary topology networks.

591:

592: What are the features of (\ref{eq14})? It could be used for an arbitrary topology

593: network, since it does not contain derivative along the chain, as (\ref{eq9})

594: does. Definition (\ref{eq14}) can also be used operationally to measure the

595: contribution of each neuron to the decision experimentally. To do that one

596: needs to vary noise at the given unit and measure the variability of the

597: responses. The details are discussed in section ~\ref{stimulation}

598: below.

599:

600: A special note should be made about normalization in (\ref{eq14}). Throughout this

601: work we adopt the convention that DM contributions are evaluated for all

602: units and then normalized proportionally to (\ref{eq14}), so that the total sum of

603: all contributions is equal to one (or 100{\%}). We will assume this to hold

604: below without explicitly mentioning. Finally, we give another

605: definition of DM contributions, which could be useful when noise in the

606: system is the same for all units. In this case the only difference between

607: units is due to difference in their position in the network. We therefore

608: call such quantity \textit{topological} DM.

609: \begin{equation}

610: \label{eq17}

611: TDM_i =\frac{\partial \sigma ^2(x_N )}{\partial \overline {\eta _i^2 } }

612: \end{equation}

613: As seen from e.g. (\ref{eq12}) it does not depend on the levels of noise, and can

614: be obtained from (\ref{eq14}) by assuming that $\overline {\eta _i^2 } =1$ for all

615: units. It therefore describes how strongly each elements of the circuit

616: affects the output. This quantity is sometimes helpful in describing the

617: network's topology.

618:

619: Lastly, we discuss the notion of noise and variability in our approach. Is

620: this really noise, which leads networks to decisions? Not necessarily.

621: Imagine that we have studied a chain-like network (Figure 8A) and performed

622: the DM analysis, described above. We found that the network contains two

623: decision makers, which are equally important. A more thorough investigation

624: may suggest that these units are inputs from external network, which in

625: effect is responsible for DM. For example, these hidden pathways may be

626: inputs from other sensory modalities or regulatory inputs of other type.

627: Thus, DMA may help identify entry points from other, less studied, parts of

628: the network.

629:

630: \begin{figure}[htbp]

631: \centerline{\includegraphics[width=4.0in]{dma10.eps}}

632: \caption{

633: Hidden pathway. The intensity of red shows

634: contribution to decision making for each unit. \textbf{A,} analysis for an

635: incomplete connectivity reveals two decision makers. \textbf{B,} a more

636: thorough study may show that this results from other inputs to the network.

637: }

638: \label{fig8}

639: \end{figure}

640:

641:

642: \subsection{Trees}

643: \label{trees}

644:

645: Our studies indicate that information-theoretical analysis [definition (\ref{eq9})] can be

646: further extended to tree-like topologies (Figure 9). To this end we define

647: column-vector

648: $\mathord{\buildrel{\lower3pt\hbox{$\scriptscriptstyle\rightharpoonup$}}\over

649: {f}}$ such that $f_i =F(MI_i )$. Then (\ref{eq9}) is equivalent to

650: \begin{equation}

651: \label{eq18}

652: \overrightarrow {DM} =(\hat {I}-\hat {S})\vec {f}.

653: \end{equation}

654: Here $\hat {S}$ is the structure matrix defined as follows. An element

655: $S_{ij} $ of the structure matrix is equal to 1 if there is a connection

656: from unit number $i$ to $j$. Matrix $\hat {I}-\hat {S}$ thus implements

657: evaluating differences between connected elements in (\ref{eq9}). Structure matrix

658: is related to connectivity matrix, containing network's weights through

659: $S_{ij} =\left| {sign(C_{ij} )} \right|$. Connectivity matrices for some

660: networks are shown in Figures 9 and 10.

661:

662: \begin{figure}[htbp]

663: \centerline{\includegraphics[width=4.0in]{dma11.eps}}

664: \caption{

665: Mutual information approach can be extended to

666: connectivities other than linear chains (\textbf{A}). Thus, decision makers

667: on trees (\textbf{B}) can also be found. Arbitrary network can be specified

668: by connectivity matrices, which are provided for illustration purposes. The

669: non-zero entries in a connectivity matrix indicate a connection between two

670: elements numbered on the left. An entry value describes the strength of

671: connection and does not have to be unitary or positive.

672: }

673: \label{fig9}

674: \end{figure}

675:

676:

677: Information-theoretical approach can be even further extended on the cases, when signals

678: propagate along the network in time, therefore resulting in delays between

679: signal and response. In this case by

680: $\mathord{\buildrel{\lower3pt\hbox{$\scriptscriptstyle\rightharpoonup$}}\over

681: {f}} $ one should understand a sum of correlations over all times preceding

682: the decision. This compensates for the presence of delays. So far there is

683: no understanding if Eq.~(\ref{eq18}) [or (\ref{eq9})] can be used for topologies other

684: than trees. Definition (\ref{eq14}), however, can be used with networks of

685: arbitrary connectivity. This is the topic of the next section.

686:

687: \section{DYNAMIC MODELS}

688: \label{dynamic}

689:

690: All previous examples, except the one mentioned at the end of the last

691: session, were static, i.e. variables did not depend on time. The deficiency

692: of this approach is that it is not clear how to treat networks with loops.

693: To apply our analysis to the cases with loops, and, in general, to

694: networks with \textit{arbitrary} connectivity (Figure 10), we consider time-dependent models

695: here. This allows us to observe propagation of noise around the loop

696: explicitly and to make accurate conclusions about contributions to DM.

697:

698: We limit ourselves to linear dynamical systems, where the single nonlinear

699: element is the last one, transforming an analog system output to a binary

700: response. As the first step we consider temporal dynamics in the

701: discrete-time approximation, which contains all essential features of our

702: approach. Later in the section we extend discrete model to the

703: continuous-time case and show their equivalence.

704:

705: \begin{figure}[htbp]

706: \centerline{\includegraphics[width=4.0in]{dma12.eps}}

707: \caption{

708: The network topologies considered by the dynamic models.

709: Arbitrary connectivities, such as cycles (left) or feedforward networks

710: (right) can be considered.

711: }

712: \label{fig10}

713: \end{figure}

714:

715:

716: \subsection{Discrete-time model}

717: \label{discrete}

718:

719: In this section we consider a system of $N$ elements, whose activity at each

720: instant is described by an $N$-dimensional column-vector $\vec {x}(t)$. Time

721: has discrete values separated by an interval $\tau $. Therefore this model

722: is called the discrete-time model. The values of activity at two neighboring

723: time-slices are related by the connection matrix $\hat {C}$

724: \begin{equation}

725: \label{eq19}

726: \vec {x}(t+\tau )=\hat {C}\vec {x}(t)+\vec {\eta }(t)+\vec {s}(t)

727: \end{equation}

728: Here $\vec {\eta }(t)$ is the vector describing noise added to activity

729: vector on each time-slice. The variable $\vec {s}(t)$ describes sensory

730: input into the system. The rules of temporal evolution of activities

731: described by this equation are general enough to include almost all

732: interesting phenomena and mimic modeling of real systems on digital

733: computers. In appendix B we will prove that this model is equivalent to systems with

734: continuously defined time.

735:

736: Noise is specified by the parameter $\vec {\eta }(t)$, which has a zero mean

737: and is defined by the correlation matrix

738: \begin{equation}

739: \label{eq20}

740: \overline {\eta _i (t_1 )\eta _j (t_2 )} ={\cal N}_{ij} \delta _{t_1 ,t_2 }

741: \end{equation}

742: We assume here that neighboring in time values on noise are not correlated,

743: implying that we consider a system with white noise. This assumption can be

744: easily relaxed and is used here to simplify the analysis. It becomes

745: rigorously valid when time-interval $\tau $ is longer than the correlation

746: time of noise. Further, if noise is specific to each neuron, the same-time

747: correlation matrix $\hat {{\cal N}}$ is diagonal

748: \begin{equation}

749: \label{eq21}

750: {\cal N}_{ij} =\overline {\eta _i^2 } \delta _{ij}

751: \end{equation}

752: This takes place i.e. when stochasticity is induced by probabilistic nature

753: of synaptic vesicle release, in which case every two neurons receive

754: uncorrelated fluctuating inputs.

755:

756: Some time after presentation of the stimulus [$\vec {s}(t)\ne 0$] the system

757: is forced to make a decision through the following process. First, a scalar

758: quantity

759: \begin{equation}

760: \label{eq22}

761: y=\vec {v}^T\cdot \vec {x}(t)

762: \end{equation}

763: is evaluated. Here time corresponds to the instant, when the choice is to be

764: made. The output metrics vector $\vec {v}$ describes the way in which

765: system's activity affects motor response. In the simplest case, which was

766: considered in the previous section, when a single element number $n$ evokes

767: responses, $v_i =\delta _{in} $. In a more complex situation, when multiple

768: areas/neurons have direct influence on decision, vector $\vec {v}$ has more

769: than one non-zero element. On the second step, decision is made based on the

770: sign of $y$

771: \begin{equation}

772: \label{eq23}

773: d=H(y)

774: \end{equation}

775: Thus, this model describes a two-alternative forced-choice task.

776:

777: Our system is completely defined by the following set of parameters: $\hat

778: {C}$, $\vec {s}(t)$, $\hat {{\cal N}}$, and $\vec {v}$. As we have shown in

779: the previous section, the presence of the stimulus is not required to define

780: DM elements [Eq. (\ref{eq5})]. We therefore set $\vec {s}(t)$ to zero and are left

781: with three parameters $\hat {C}$, $\hat {{\cal N}}$, and $\vec {v}$. We now

782: are ready to determine DM elements in our simple model.

783:

784: To find decision makers we will use Eq. (\ref{eq14}). In this case it becomes

785: \begin{equation}

786: \label{eq24}

787: DM_i ={\cal N}_{ii} \frac{\partial \sigma ^2(y)}{\partial {\cal N}_{ii} }

788: \end{equation}

789: Therefore, we need to evaluate the variability on the output from the system

790: $\sigma ^2(y)$. This is accomplished if we notice that $y=\vec {v}^T\cdot

791: \vec {x}(t)=\vec {x}^T(t)\cdot \vec {v}$ and

792: \begin{equation}

793: \label{eq25}

794: \sigma ^2(y)=\vec {v}^T\cdot \overline {\vec {x}(t)\vec {x}^T(t)} \cdot \vec

795: {v}=\vec {v}^T\hat {X}(t,t)\vec {v}

796: \end{equation}

797: Here we introduced the cross-correlation matrix defined as follows

798: \begin{equation}

799: \label{eq26}

800: \hat {X}(n,k)\equiv \overline {\vec {x}(n)\vec {x}^T(k)} =\overline {\left(

801: {{\begin{array}{*{20}c}

802:  {x_1 } \hfill \\

803:  \vdots \hfill \\

804:  {x_N } \hfill \\

805: \end{array} }} \right)\left( {{\begin{array}{*{20}c}

806:  {x_1 } \hfill & \cdots \hfill & {x_N } \hfill \\

807: \end{array} }} \right)}

808: \end{equation}

809: We replace here the time variable by the integers, specifying the time-slice

810: number. The averaging in (\ref{eq25}) and (\ref{eq26}) is assumed over different

811: instantiations of noise (trials).

812:

813: Due to the properties of noise in our model, this correlator does not depend

814: on the absolute values of time ($n$ and $k)$, but only on the difference ($n - k)$. As

815: follows from (\ref{eq25}), of particular interest is the same-time correlator $\hat

816: {X}_0 \equiv \hat {X}(n,n)$, which determines fluctuations in $y$. We now

817: derive equation for same-time correlator $\hat {X}_0 $.

818:

819: Using (\ref{eq19}) we obtain

820: \begin{equation}

821: \label{eq27}

822: \begin{array}{l}

823:  \hat {X}_0 =\overline {\vec {x}(n+1)\vec {x}^T(n+1)} = \\

824:  =\overline {\left[ {\hat {C}\vec {x}(n)+\vec {\eta }(n)} \right]\left[

825: {\vec {x}^T(n)\hat {C}^T+\vec {\eta }^T(n)} \right]} \\

826:  \end{array}

827: \end{equation}

828: We then notice that the correlator $\overline {\vec {x}(n)\vec {\eta }^T(n)}

829: $ is identically zero, since $\vec {x}(n)$ is a linear combination of values

830: of noise at times $k<n$ [see Eq. (\ref{eq19})]. We thus deduce from Eq. (\ref{eq27}) that

831: \[

832: \hat {X}_0 =\overline {\hat {C}\vec {x}(n)\vec {x}^T(n)C^T} +\overline {\vec

833: {\eta }(n)\vec {\eta }^T(n)} ,

834: \]

835: which leads us, finally, to

836: \begin{equation}

837: \label{eq28}

838: \hat {X}_0 -\hat {C}\hat {X}_0 \hat {C}^T=\hat {{\cal N}}

839: \end{equation}

840: This equation allows us to determine the same-time correlator $\hat {X}_0 $

841: from connectivity and noise cross-correlogram, defined in (\ref{eq20}), which is a

842: diagonal matrix.

843:

844: We would like to pause here and describe the properties of this equation.

845: First of all, in the most generic case (\ref{eq28}) allows us to determine $\hat

846: {X}_0 $ from $\hat {C}$ and $\hat {{\cal N}}$ uniquely. Indeed, (\ref{eq28}) is a

847: system of $N^2$ linear equations for $N^2$ unknowns $\hat {X}_0 $, arranged

848: in the matrix form. Hence, this system, in most cases, can be solved

849: uniquely. On the other hand, with one exception, $\hat {X}_0 $ cannot

850: be expressed explicitly in terms of matrices $\hat {C}$ and $\hat {{\cal

851: N}}$. Thus, one has to either appeal to the representation of $\hat {X}_0 $

852: in terms of eigenvectors and eigenvalues of $\hat {C}$, or use computer to

853: arrange elements of matrix $\hat {X}_0 $ in vector form and solve resulting

854: linear system.

855:

856: The contribution to DM from a given element can be determined from Eq. (\ref{eq25})

857: \begin{equation}

858: \label{eq29}

859: DM_i ={\cal N}_{ii} \frac{\partial \sigma ^2(y)}{\partial {\cal N}_{ii}

860: }=\vec {v}^T\frac{\partial \hat {X}_0 }{\partial \ln {\cal N}_{ii} }\vec

861: {v}

862: \end{equation}

863: The topological DM contributions are

864: \begin{equation}

865: \label{eq30}

866: TDM_i =\vec {v}^T\frac{\partial \hat {X}_0 }{\partial {\cal N}_{ii} }\vec

867: {v}

868: \end{equation}

869: Using Eqs. (\ref{eq28}) and (\ref{eq29}) one can analyze a variety of network

870: connectivities. Some new effect emerging for non-tree systems are described

871: next.

872:

873: \subsection{Case 1: fan-out hub effect}

874: \label{fanout}

875:

876: We now consider network shown in Figure 11A, in which all elements have the

877: same variance of noise and all connections have unitary strength. Figure 11A

878: shows two pathways from unit \textbf{2} to the exit unit, \textbf{6}. The

879: resulting network gain from unit \textbf{2} to unit \textbf{6} is thus equal

880: to two. All other units' gain at the exit is one. The contribution to DM

881: from unit \textbf{2 }is thus four times larger that from other units. This

882: is because noise at this unit is multiplied by a factor of two, and the

883: variance of noise, by a factor of four. We conclude that there may be some

884: special elements in network, which occupy hub-like positions, gaining large

885: influence due to abundance of their outputs. It should be noted that fan-in

886: hubs are not special from the point of view of DM in any way.

887:

888:

889:

890: \begin{figure}[htbp]

891: \centerline{\includegraphics[width=4.0in]{dma13.eps}}

892: %\centerline{\includegraphics[width=1.6in]{dma14.eps}}

893: \caption{

894: Two cases, in which the identities of decision makers

895: can be found using discrete-time approach. Variance of noise on all elements

896: is the same; all network links have unitary strength. The degree of

897: decision making is shown by the intensity of red. \textbf{A}, The fan-out

898: effect. \textbf{B}, the temporal integrator.

899: }

900: \label{fig11}

901: \end{figure}

902:

903:

904: \subsection{Case 2: temporal integrator}

905: \label{integrator}

906:

907: Let us now examine the network with a loop. Figure 11B shows such an example

908: with unitary link strength and uniform noise variance, as in previous case.

909: The presence of loop affects DM drastically: our discrete-time model marks

910: units belonging to the loop as decision makers\footnote{ Rigorously

911: speaking, the set of equations (\ref{eq28}) and (\ref{eq29}) does not have a valid

912: solution for the loop with all connection equal to unity. One needs to set

913: one of the connection as a parameter, $\alpha <1$, solve the equations, and

914: consider the limit $\alpha \to 1 .$ }. This is easy to understand, since

915: noise, generated by each unit on each time-step, cannot leave the loop and,

916: therefore, builds up there without limits. Therefore the variance of noise

917: in the output of element number three grows proportionally to time

918: $\overline {\left[ {x_3 (t)-\overline {x_3 } (t)} \right]^2} =\overline

919: {\eta ^2} t\to \infty $. Here averaging is assumed over instantiations of

920: noise (trials). Thus, loop becomes the crucial decision maker. This case is

921: somewhat analogous to our previous `noisy' neuron example.

922:

923: What is the possible role of loops in biological networks? Why would one

924: introduce such unreliable components? Loops, similar to shown in Figure 11B,

925: have many useful properties. For instance, they can act as parametric memory

926: systems. Indeed, imagine that responses of all units in the loop have the

927: same values, equal to $x$. This could be accomplished by manipulating the

928: sensory inputs. Assume that no more inputs are received from the outside of

929: the system. It follows that, in the absence of noise on each element, this

930: value of response will reverberate around the loop forever. This is because

931: all links have unitary strength. Loops can thus memorize a graded value,

932: such as $x$, functioning as parametric memory elements.

933:

934: Suppose, in addition, that a non-zero input $s$ is applied to element number

935: \textbf{1} at all times. Since this element acts as a summator, its response

936: on the next step is $x_1 (1)=x+s$. The signal $s$ propagates around the

937: loop, and in four steps it reaches the first element again, at which time

938: its response is $x_1 (5)=x+2s$. In four more steps $x_1 (9)=x+3s$. Thus, not

939: only noise, but also signal can build up in the system. Therefore, a loop

940: can operate as a temporal integrator. The integration is not perfect if one

941: of the links has a non-unitary strength, in which case integrator becomes

942: leaky (Robinson, 1989).

943:

944: Temporal integrators play special role in DM, since they act as accumulators

945: of sensory information, which puts them into special position with respect

946: to other areas (Gold and Shadlen, 2002). As an example,

947: such is area LIP in primate visual cortex, which is involved in DM in

948: direction-discrimination task

949: (Shadlen and Newsome, 2001; Roitman and Shadlen, 2002; Mazurek et al., 2003).

950:

951: \subsection{Continuous-time model}

952: \label{continuous}

953:

954: We finally consider a model, in which time runs continuously. This model has

955: potential relevance to real-life networks. The responses of units satisfy

956: the following equation

957: \begin{equation}

958: \label{eq31}

959: \frac{d\vec {x}(t)}{dt}=-\hat {A}\vec {x}(t)+\vec {\eta }(t)+\vec {s}(t).

960: \end{equation}

961: The network connectivity matrix $\hat {A}$ can be related to connection

962: matrix from the discrete-time model in (\ref{eq19}) through $\hat {C}=e^{-\hat

963: {A}\tau }$ (see Appendix B). Noise is defined by its cross-correlation

964: \begin{equation}

965: \label{eq32}

966: \overline {\eta _i (t_1 )\eta _j (t_2 )} ={\cal N}_{ij} \delta (t_1 -t_2 ),

967: \end{equation}

968: where

969: \begin{equation}

970: \label{eq33}

971: \hat {{\cal N}}=\left( {{\begin{array}{*{20}c}

972:  {\overline {\eta _1^2 } } \hfill & \hfill & \hfill & 0 \hfill \\

973:  \hfill & {\overline {\eta _2^2 } } \hfill & \hfill & \hfill \\

974:  \hfill & \hfill & {...} \hfill & \hfill \\

975:  0 \hfill & \hfill & \hfill & {\overline {\eta _N^2 } } \hfill \\

976: \end{array} }} \right)

977: \end{equation}

978: is a diagonal cross-correlogram of noise. Eqs. (\ref{eq31})-(\ref{eq33}) are analogous

979: to the discrete-time case (\ref{eq19})-(\ref{eq21}). Similarly, we define the output

980: scalar and the decision variable

981: \[

982: y=\vec {v}^T \cdot \vec {x}(t)

983: \]

984: \begin{equation}

985: \label{eq34}

986: d=H(y).

987: \end{equation}

988: Here $t$ is time when the system makes the decision.

989:

990: Our model is thus defined by Eqs. (\ref{eq31})-(\ref{eq34}). We will now use definition

991: (\ref{eq29}) to find decision makers. As in discrete-time case we need to know the

992: variance of the output variable, $\sigma ^2(y)$, after which (\ref{eq29}) leads to

993: \begin{equation}

994: \label{eq35}

995: DM_i =\overline {\eta _i^2 } \frac{\partial \sigma ^2(y)}{\partial \overline

996: {\eta _i^2 } }

997: \end{equation}

998: Important for us is the time-dependent correlator

999: \begin{equation}

1000: \label{eq36}

1001: \hat {X}(t_1 ,t_2 )=\overline {\vec {x}(t_1 )\vec {x}^T(t_2 )} ,

1002: \end{equation}

1003: which we now evaluate. Solution of (\ref{eq31}) is obtained using matrix

1004: exponentials

1005: \begin{equation}

1006: \label{eq37}

1007: \vec {x}(t)=\int\limits_{-\infty }^t {dt'e^{\hat {A}(t'-t)}\left[ {\vec

1008: {\eta }(t)+\vec {s}(t)} \right]}

1009: \end{equation}

1010: If external stimulus is zero or a constant in time, due to (\ref{eq5}), the

1011: correlator at $t_1 >t_2 $

1012: \begin{equation}

1013: \label{eq38}

1014: \hat {X}(t_1 ,t_2 )=\int\limits_{-\infty }^{t_2 } {dt'e^{\hat {A}(t'-t_1

1015: )}\hat {{\cal N}}e^{\hat {A}^T(t'-t_2 )}}

1016: \end{equation}

1017: We seek $\hat {X}(t_1 ,t_2 )$ in the form

1018: \begin{equation}

1019: \label{eq39}

1020: \hat {X}=e^{\hat {A}(t_2 -t_1 )}\hat {X}_0 ,

1021: \end{equation}

1022: where $\hat {X}_0 $ is equal-time cross-correlation. To find equation for

1023: $\hat {X}_0 $ we differentiate (\ref{eq38}) as follows

1024: \begin{equation}

1025: \label{eq40}

1026: \begin{array}{l}

1027:  \frac{\partial \hat {X}}{\partial t_2 }=\hat {A}e^{\hat {A}(t_2 -t_1 )}\hat

1028: {X}_0 = \\

1029:  =e^{\hat {A}(t_2 -t_1 )}\hat {{\cal N}}-\hat {X}\hat {A}^T \\

1030:  \end{array}

1031: \end{equation}

1032: We arrive thus to the following equation for $\hat {X}_0 $

1033: \begin{equation}

1034: \label{eq41}

1035: \hat {A}\hat {X}_0 +\hat {X}_0 \hat {A}^T=\hat {{\cal N}}

1036: \end{equation}

1037: This equation is the central tool for the continuous-time theory. The

1038: contributions to DM from each unit are found by differentiating $\sigma

1039: ^2(y)=\vec {v}^T\hat {X}_0 \vec {v}$ with respect to noise, as in Eq. (\ref{eq29})

1040: \begin{equation}

1041: \label{eq42}

1042: DM_i ={\cal N}_{ii} \vec {v}^T\frac{\partial \hat {X}_0 }{\partial {\cal

1043: N}_{ii} }\vec {v}

1044: \end{equation}

1045: Once the same-time correlation matrix $\hat {X}_0 $ is found from Eq.

1046: (\ref{eq41}), cross-correlation for arbitrary time is

1047: \begin{equation}

1048: \label{eq43}

1049: \hat {X}(t_1 ,t_2 )=\left\{ {{\begin{array}{*{20}c}

1050:  {e^{\hat {A}(t_2 -t_1 )}\hat {X}_0 ,} \hfill & {t_1 \ge t_2 } \hfill \\

1051:  {\hat {X}_0 e^{\hat {A}^T(t_1 -t_2 )},} \hfill & {t_1 <t_2 } \hfill \\

1052: \end{array} }} \right.

1053: \end{equation}

1054: This equation suggests a helpful strategy for determining noise matrix $\hat

1055: {{\cal N}}$. Indeed, (\ref{eq41}) and (\ref{eq43}) imply that

1056: \begin{equation}

1057: \label{eq44}

1058: \hat {{\cal N}}=\left. {\frac{\partial \hat {X}(t_1 ,t_2 )}{\partial t_1 }}

1059: \right|_{t_1 =t_2 -\varepsilon } \left. {-\frac{\partial \hat {X}(t_1 ,t_2

1060: )}{\partial t_1 }} \right|_{t_1 =t_2 +\varepsilon }

1061: \end{equation}

1062: Here $\varepsilon $ is infinitesimally small positive number. In other

1063: words, noise matrix is equal to discontinuity in time-derivative of

1064: cross-correlation at $t_1 =t_2 $. Since noise correlation matrix is

1065: diagonal, the non-zero elements are

1066: \begin{equation}

1067: \label{eq45}

1068: \overline {\eta _i^2 } =\left. {\frac{\partial X_{ii} (t_1 ,t_2 )}{\partial

1069: t_1 }} \right|_{t_1 =t_2 -\varepsilon } \left. {-\frac{\partial X_{ii} (t_1

1070: ,t_2 )}{\partial t_1 }} \right|_{t_1 =t_2 +\varepsilon }

1071: \end{equation}

1072: Two comments are in order here. First, noise term $\vec {\eta }(t)$ plays

1073: the role of input noise in (\ref{eq31}). It cannot be measured directly. Equation

1074: (\ref{eq44}) provides a way to single it out. Second, (\ref{eq44}) does not apply to the

1075: discrete-time model. Indeed, in the latter we either have $t_1 =t_2 $, or

1076: $t_1 =t_2 \pm 1$, etc., i.e. the condition $t_1 =t_2 \pm \varepsilon $ with

1077: $\varepsilon $ infinitesimally small is hard to enforce. It may happen that

1078: $\varepsilon \approx 1$ is acceptable due to presence of slow components in

1079: the circuit, such as temporal integrators. However, in general case (\ref{eq44})

1080: cannot be applied to the discrete-time case. For instance, it fails

1081: dramatically for the case of `nematode' chain considered above.

1082:

1083: Equations (\ref{eq41}), (\ref{eq42}), and (\ref{eq44}) represent a useful set of tools to find

1084: DM components for various connectivities. We present here two possible

1085: cases, in which decision makers can be found. They differ in what is known

1086: about the system.

1087:

1088: \underline {\textbf{Scenario 1:}} Assume we know the network connectivity

1089: $\hat {A}$, output metrics vector $\vec {v}$, and autocorrelation for each

1090: unit $X_{ii} (t_1 ,t_2 )$. The steps below allow finding the

1091: decision makers.

1092:

1093: \begin{enumerate}

1094: \item Since noise matrix is diagonal, as per (\ref{eq33}), it can be found from autocorrelation using (\ref{eq44}).

1095: \item Solving (\ref{eq41}) allows determining $\partial \hat {X}_0 /\partial {\cal N}{ }_{ii}$, the derivative of equal-time crosscorrelation with respect to noise in each element.

1096: \item Decision makers are found from (\ref{eq42}).

1097: \item Normalize contributions to DM so that $\sum\limits_i {DM_i =1} $.

1098: \end{enumerate}

1099:

1100: Scenario 1 does not require simultaneous measurements from all units. It

1101: requires the knowledge of the network connectivity however. The next

1102: scenario is complimentary in this respect.

1103:

1104: \underline {\textbf{Scenario 2:}} Suppose we have measured the full

1105: crosscorrelation matrix $\hat {X}(t_1 ,t_2 )$ by simultaneous recordings

1106: from all units. Suppose also that we know how the output of the system is

1107: evaluated (vector $\vec {v})$. These are the steps to determine DM units.

1108:

1109: \begin{enumerate}

1110: \item Use (\ref{eq44}) to find noise matrix $\hat {{\cal N}}$.

1111: \item Use (\ref{eq41}) to find the connection matrix $\hat {A}$.

1112: \item Solve (\ref{eq41}) to calculate $\partial \hat {X}_0 /\partial {\cal N}{ }_{ii}$ for each element.

1113: \item Use (\ref{eq42}) to find decision makers.

1114: \item Normalize contributions to DM so that $\sum\limits_i {DM_i =1} $.

1115: \end{enumerate}

1116:

1117: Both scenarios use extensive knowledge about the system, which renders them

1118: useless in experimental conditions. In the next subsection we discuss a way

1119: to bypass these limitations.

1120:

1121: Finally, we would like to provide solution to (\ref{eq41}) using eigenbasis of

1122: matrix $\hat {A}$. Since $\hat {A}$ is not necessarily symmetric, a

1123: distinction should be made between right and left eigenvectors. The latter

1124: turn out to be useful for our purposes. They are defined by

1125: \begin{equation}

1126: \label{eq46}

1127: \vec {\xi }_\alpha ^+ \hat {A}=\lambda _\alpha \vec {\xi }_\alpha ^+ .

1128: \end{equation}

1129: Here and below Greek indexes denote numbers of eigenvalues, while Latin ones

1130: label spatial components of vectors and matrices. Solution of (\ref{eq41}) is

1131:

1132: \begin{equation}

1133: \label{eq46_5}

1134: %X_{0ij} =\sum\limits_{{\begin{array}{*{20}c}

1135: % {\alpha \beta \gamma \delta } \hfill \\

1136: % {mn} \hfill \\

1137: %\end{array} }} {\frac{\xi _{i\alpha } \xi _{j\beta }^\ast \xi _{m\gamma

1138: %}^\ast \xi _{n\delta } }{\lambda _\gamma +\lambda _\delta ^\ast }} \left(

1139: %{G^{-1}} \right)_{\alpha \gamma } \left( {G^{-1}} \right)_{\beta \delta

1140: %}^\ast {\cal N}_{mn} ,

1141: X_{0ij} =\sum\limits_{\alpha \beta \gamma \delta \\ \\mn} {\frac{\xi _{i\alpha } \xi _{j\beta }^\ast \xi _{m\gamma

1142: }^\ast \xi _{n\delta } }{\lambda _\gamma +\lambda _\delta ^\ast }} \left(

1143: {G^{-1}} \right)_{\alpha \gamma } \left( {G^{-1}} \right)_{\beta \delta

1144: }^\ast {\cal N}_{mn} ,

1145: \end{equation}

1146: where

1147: \begin{equation}

1148: \label{eq47}

1149: G_{\alpha \beta } =\sum\limits_i {\xi _{i\alpha }^\ast \xi _{i\beta } }

1150: \end{equation}

1151: is the Gram matrix of eigenvectors. Eq. (\ref{eq46_5}) is valid if the eigenvectors form

1152: a complete basis in the $N$-dimensional space. As follows from (\ref{eq46_5}), eigenvalues of

1153: $\hat {A}$ with small real part contribute to DM in a large degree. This

1154: justifies the use of principal component analysis when such eigenvalues are

1155: present. An example of such principal component is the temporal integrator

1156: loop in Figure 11B, which has vanishing $\lambda $.

1157:

1158: In case if matrix $\hat {A}$ is symmetric, its eigenvalues are real and

1159: eigenvectors are orthogonal. This leads to a unit Gram matrix. Then, Eq.

1160: (3.29) becomes more compact

1161: \begin{equation}

1162: \label{eq48}

1163: X_{0ij} =\sum\limits_{

1164:  {\alpha \beta }

1165:  {mn} } {\frac{\xi _{i\alpha } \xi _{j\beta }^\ast \xi _{m\alpha

1166: }^\ast \xi _{n\beta } }{\lambda _\alpha +\lambda _\beta }} {\cal N}_{mn}

1167: \end{equation}

1168: Similar equations, called Kubo formulas, are obtained for various

1169: correlators in case of diffusion of particles in random media

1170: (Efetov, 1997). The distinguishing feature of (\ref{eq48}) is that

1171: a product of four eigenvectors enters the expression. Thus, propagation of

1172: noise in this case can be accompanied by interference between different

1173: pathways. An example of destructive interference of this kind is given

1174: below, in section \ref{stimulation}.

1175:

1176: Eq. (\ref{eq48}) can be further simplified. Indeed, our model uses diagonal noise

1177: matrices, i.e. $n=m$ in (\ref{eq48}). Suppose also that the output from the

1178: network occurs through one exit element number $i$, which is specified by

1179: taking $\vec {v}=\hat {e}_i $. In this case the use of Eq. (\ref{eq42}) gives

1180: \begin{equation}

1181: \label{eq49}

1182: DM_i ={\cal N}_{nn} \sum\limits_{\alpha \beta } {\frac{\xi _{i\alpha } \xi

1183: _{i\beta }^\ast \xi _{n\alpha }^\ast \xi _{n\beta } }{\lambda _\alpha

1184: +\lambda _\beta }} .

1185: \end{equation}

1186: From this equation we conclude that for element $n$ to contribute to DM, an

1187: eigenvector should exist, which is non-zero on both unit number $n$ and exit

1188: unit $i$. Thus, we conclude that eigenvectors of $\hat {A}$ should be

1189: delocalized for broader impact of elements on the decision. This is not

1190: surprising in view of the mentioned analogy with the diffusion problem. In

1191: case if matrix $\hat {A}$ is not symmetric, the Gram matrix may be non-diagonal

1192: and (\ref{eq49}) cannot be used. However, the off-diagonal elements of $\hat {G}$

1193: are usually smaller than diagonal ones, due to uncorrelated sign changes,

1194: when (\ref{eq47}) is computed with $\alpha \ne \beta $. Therefore, (\ref{eq49}) may

1195: apply approximately.

1196:

1197: \section{Analysis using stimulation}

1198: \label{stimulation}

1199:

1200: Stimulations with electric current add a new degree of freedom to DMA, thus

1201: leading to more effective ways of finding decision makers. There are two

1202: great advantages of the stimulation method. First, it only involves

1203: stimulation of a single neuron, therefore no simultaneous multiple-electrode

1204: measurements are required. Second, the knowledge of network connectivity is

1205: not needed to solve the problem. In this section we study our simple

1206: networks and find what stimulation strategies are consistent with our

1207: earlier definitions, such as Eq. (\ref{eq14}).

1208:

1209: We will use continuous model for concreteness (section \ref{continuous}). Consider the

1210: output variable $y$. It is a linear function of the inputs. It is also a

1211: function, which contains noise components, variable from trial to trial. The

1212: noise components were acquired from all units in different degree. Since

1213: noise in each unit is gaussian, the output variable is described by gaussian

1214: distribution too

1215: \begin{equation}

1216: \label{eq50}

1217: \rho (y)=\frac{1}{\sqrt {2\pi \sigma ^2(y)} }

1218: e^{-(y-\bar {y}(s))^2/2\sigma ^2(y)}

1219: \end{equation}

1220: In each trial a random value of $y$ is obtained, according to distribution

1221: (\ref{eq50}). The response of the system is equal to 1 if $y$ is positive, and

1222: 0 otherwise. The probability to obtain response equal to 1 to given

1223: stimulus $s$ is given by the error function (Abramowitz and

1224: Stegun, 1972)

1225: \begin{equation}

1226: \label{eq50_5}

1227: p_1 (s)=\int\limits_0^\infty {\rho (y)dy} =\frac{1}{2}\left[

1228: {1+\mbox{erf}\left( {\frac{\bar {y}(s)}{\sigma (y)\sqrt 2 }} \right)}

1229: \right],

1230: \end{equation}

1231: whereas the probability of zero response is

1232: \begin{equation}

1233: \label{eq51}

1234: p_0 (s)=\int\limits_{-\infty }^0 {\rho (y)dy} =\frac{1}{2}\left[

1235: {1-\mbox{erf}\left( {\frac{\bar {y}(s)}{\sigma (y)\sqrt 2 }} \right)} \right].

1236: \end{equation}

1237: Both probabilities depend upon the mean response to stimulus $\bar {y}(s)$

1238: and the standard deviation $\sigma (y)$. Therefore the electric stimulation

1239: strategies may be based on affecting either the former or the latter. We now

1240: consider both of these strategies and show that affecting the mean response

1241: may provide misleading results, while changing the variance of response

1242: allows estimating contributions to DM consistently with our previous

1243: definitions. Thus, strategies of stimulation based on standard deviation of

1244: the output variable are \textit{always} correct in our simple model, independently on the

1245: topology of the network. This may seem a trivial consequence of definition

1246: (\ref{eq14}), but we will discuss it here for the sake of comparison of two

1247: strategies and optimizing them.

1248:

1249: We start with the strategies of stimulation, which affect the mean response

1250: $\bar {y}(s)$. In our simple model this may be accomplished by injecting a

1251: tonic input current into a unit number $i$. Mathematically it is

1252: accomplished by adding extra stimulus $s_i $ to this unit in Eq. (\ref{eq31}).

1253: Note that in biological systems the stimulating current is alternating with

1254: constant amplitude (Salzman et al., 1992). The

1255: mean response is shifted by the stimulation, i.e.

1256: \begin{equation}

1257: \label{eq52}

1258: \Delta \bar {y}=\frac{\partial \bar {y}}{\partial s_i }s_i ,

1259: \end{equation}

1260: where $s_i $ is the magnitude of injected tonic current. This leads to

1261: observable changes in the probability $p_1 $

1262: \begin{equation}

1263: \label{eq53}

1264: \Delta p_1 (i)=\frac{\partial p}{\partial \bar {y}}\frac{\partial \bar

1265: {y}}{\partial s_i }s_i .

1266: \end{equation}

1267: Here $\Delta p_1 (i)$ is the change in probability of correct responses

1268: after unit number $i$ is electrically stimulated. Can $\Delta p_1 (i)$ be a

1269: measure of DM?

1270:

1271: We notice that $\Delta p_1 (i)$ can be either positive or negative. This

1272: depends on the sign of derivative $\partial \bar {y}/\partial s_i $, which

1273: is positive for excitatory pathway from unit $i$ to the output and negative

1274: for inhibitory pathway. Since contribution to DM ought to be positive, we

1275: cannot assume simply that $DM_i \sim \Delta p_1 (i)$. The correct

1276: expression, which we provide here without derivation is

1277: \begin{equation}

1278: \label{eq54}

1279: DM_i \sim \overline {\eta _i^2 } \left[ {\Delta p_1 (i)} \right]^2.

1280: \end{equation}

1281: This equation is understood in proportional sense, since $DM_i $ should be

1282: normalized to ensure that $\sum\limits_i {DM_i } =1$. Our

1283: investigations show that this expression is accurate for trees and is

1284: consistent with both our earlier definitions (\ref{eq9}) or (\ref{eq14}). Remarkably, it

1285: employs quantities, which can be measured in a single-electrode experiment.

1286: Indeed, the amplitude of noise $\overline {\eta _i^2 } $ can be found from

1287: autocorrelation of unit's response, using (\ref{eq44}); and $\Delta p_1 (i)$ is

1288: determined from behavioral changes in response to single-unit stimulation.

1289: This equation thus provides an approach potentially useful in practice. Does

1290: this relationship work for networks of arbitrary connectivity?

1291:

1292: Figure 12B shows a counterexample, in which a unit is stimulated, which

1293: results in \textit{no} change in probability of correct response (we consider trials in which 1 is the correct response throughout this section).

1294: This is because there are two pathways, leading from this unit to the exit, one positive and one

1295: negative. They have equal strength, and, therefore, compensate each other.

1296: On the other hand, unit number one \textit{does} participate is DM, because if a

1297: non-stationary stimulation/stimulus is applied, its effect on the decision

1298: is not zero. Thus, (\ref{eq54}) and tonic stimulation method cannot be applied to

1299: arbitrary circuits, such as shown in Figure 12B, to accurately reveal

1300: decision makers.

1301:

1302:

1303: \begin{figure}[htbp]

1304: \centerline{\includegraphics[width=3.0in]{dma15.eps}}

1305: \centerline{\includegraphics[width=3.0in]{dma16.eps}}

1306: %\centerline{\includegraphics[width=2.67in]{dma17.eps}}

1307: %\centerline{\includegraphics[width=1.76in]{dma18.eps}}

1308: \caption{Finding decision makers using electric stimulation.

1309: \textbf{A} and \textbf{B}, tonic stimulation; \textbf{C}, random

1310: stimulation. \textbf{A}, tonic stimulation for trees results in shift in

1311: probability, which leads to correct estimation of decision making units.

1312: \textbf{B}, example of a circuit for which tonic stimulation leads to

1313: incorrect estimation of decision making, since it does not lead to the shift

1314: in probability. \textbf{C}, stimulation with a random current leads to

1315: correct estimate of decision making for networks with \textit{any} connectivity.

1316: \textbf{D}, for optimal performance in random stimulation paradigm, the task

1317: should be set so that the probability of correct responses is close to

1318: $p_{optimal} \approx 0.84$.

1319: }

1320: \label{fig12}

1321: \end{figure}

1322:

1323:

1324:

1325: Is there a stimulation method for finding decision making components in

1326: arbitrary networks? The method follows directly from the definition (\ref{eq14})

1327: [or (\ref{eq35}), which is equivalent]. Indeed, when stimulating current is a

1328: temporal white noise, the output variable $y$ acquires a larger variance

1329: (Figure 12C). Hence, the derivative of output variance, entering (\ref{eq35}) can

1330: be calculated operationally, by injecting a distracter current. More

1331: precisely, if the variance of stimulating current applied to unit $i$ is

1332: $\overline {s_i^2 } $ the derivative entering definition (\ref{eq35}) is

1333: \begin{equation}

1334: \label{eq55}

1335: \frac{\partial \sigma ^2(y)} {\partial \overline {\eta _i^2 } } =

1336: \frac{\Delta \sigma^2 (y)} {\overline {s_i^2 } }.

1337: \end{equation}

1338: In practice one has no access to the variable $y$, so one cannot measure

1339: directly the change in variance $\Delta \sigma^2 (y)$. Instead, one could

1340: measure the change in the probability of correct responses under the

1341: influence of distracting current. Indeed, from (\ref{eq50_5}) we obtain

1342: \begin{equation}

1343: \label{eq56}

1344: \Delta p_1 (i) = \frac{\partial p_1 } {\partial \sigma^2 (y) } \Delta \sigma^2 (y)

1345: \end{equation}

1346: Combining the last two equations we obtain for the important derivative

1347: \begin{equation}

1348: \label{eq57}

1349: \frac{\partial \sigma ^2(y)}{\partial \overline {\eta _i^2 } }=\frac{\Delta

1350: p_1 (i)}{\overline {s_i^2 } }\left( {\frac{\partial p_1 }{\partial \sigma

1351: ^2(y)}} \right)^{-1}

1352: \end{equation}

1353: Since the probability of correct responses always decreases under the

1354: influence of distracters, the derivative $\partial p_1 /\partial \sigma

1355: ^2(y)$ is a negative constant. It is the same for all units. We arrive

1356: therefore to the expression for contributions to DM, which follows from

1357: (\ref{eq35})

1358: \begin{equation}

1359: \label{eq58}

1360: DM_i \sim -\overline {\eta _i^2 } \frac{\Delta p_1 (i)}{\overline {s_i^2 }

1361: }

1362: \end{equation}

1363: Here $\Delta p_1 (i)$ is the decrease in probability of correct responses

1364: produced by electric stimulation with variance of the random current equal

1365: to $\overline {s_i^2 } $. The variance of noise on each unit $\overline {\eta

1366: _i^2 } $ can be found from autocorrelation using (\ref{eq45}). This procedure

1367: works for any topology in our simplified model. It should be noted here that

1368: if noise is not entirely white or cannot be considered white, (\ref{eq45}) cannot

1369: be used directly and should be replaced by an expression reflecting the

1370: spectral characteristics of noise appropriate for the system under

1371: investigation. Thus, if noise is provided by other parts of the network, its

1372: dynamic features may be more complex. Therefore, (\ref{eq45}) may not apply

1373: directly to the `hidden pathway' example given in the end of section \ref{conclusions}.

1374:

1375: The procedure, which we just described, permits further optimization.

1376: Indeed, imagine that the probability of correct responses is exactly

1377: $\raise.5ex\hbox{$\scriptstyle 1$}\kern-.1em/

1378: \kern-.15em\lower.25ex\hbox{$\scriptstyle 2$} $. Adding distracting

1379: stimulation current will not change this probability, i.e. $\Delta p_1

1380: (i)=0$ no matter what unit is stimulated. In the opposite limiting case when

1381: $p_1 \approx 1$, the effect of distracter on performance is exponentially

1382: small. Hence, behavioral response to stimulation has an optimum between $p_1

1383: =1/2$ and $1$. To find the optimum we observe from (\ref{eq56}) that $\Delta p_1 $

1384: is maximum for the same variation in $\Delta \sigma ^2(y)$ when $\partial

1385: p_1 /\partial \sigma ^2(y)$ is maximum. We therefore plot the latter

1386: derivative as a function of $p_1 $ in Figure 12D. We indeed observe a

1387: maximum at the value of probability of correct responses close to

1388: \begin{equation}

1389: \label{eq59}

1390: p_{optimal} \approx 0.841

1391: \end{equation}

1392: To summarize, the following scenario describes algorithm for finding

1393: contributions to DM using random stimulation.

1394:

1395: \underline {\textbf{Scenario 3:}} Assume that we \textit{do not know} the network connectivity

1396: $\hat {A}$ and output metrics vector $\vec {v}$; but we know autocorrelation

1397: for each unit $X_{ii} (t_1 ,t_2 )$. The steps below allow finding the

1398: decision makers.

1399:

1400: \begin{enumerate}

1401: \item Prepare stimulus so that the probability of correct responses is close to the value given by (\ref{eq59}).

1402: \item Stimulate one unit with random current, whose variance is $\overline {s_i^2 } $, and measure the decrease in probability of correct responses $\Delta p_1 $.

1403: \item Record autocorrelation and evaluate noise variance $\overline {\eta _i^2 } $ for this unit using (\ref{eq45}).

1404: \item Find contribution to DM for this unit using equation (\ref{eq58}).

1405: \item Repeat steps 1 through 4 for all units in the system.

1406: \item Normalize contributions to DM so that $\sum\limits_i {DM_i =1} $.

1407: \end{enumerate}

1408:

1409: \section{Discussion}

1410: \label{discussion}

1411:

1412: In this work we defined decision makers in networks, which behave in a well-defined fashion.

1413: As with any definition, there is certain degree of arbitrariness in our study,

1414: since this is the first mathematical study of this sort.

1415: We had to make choices about the features of decision making we

1416: were attempting to describe as well as about the way they were quantified.

1417: We demonstrated these features in a set of examples.

1418: Future studies will show if these features can be used as a basis of a more complete

1419: model-independent theory.

1420:

1421:

1422: In this study we postulated that variability and noise, causally linked to decisions,

1423: are the chief descriptors of DM. Although this point may seem paradoxical we

1424: suggest three arguments in its favor. First, variability may reflect

1425: additional information needed to make a decision in case of uncertainty.

1426: Such may be inputs from other modalities, memories, or some other relevant

1427: modulatory inputs, supplying e.g. emotional condition of the subject

1428: or changing utility values (Figure 8). Second, many behaviors, such as C-start

1429: escape responses in fish (Eaton and Emberley, 1991) and

1430: other organisms (Glimcher, 2003), have stochastic character.

1431: This makes the task of pursuer more difficult.

1432: Such unpredictable behaviors are reproduced in our model if the sensory input is weak

1433: or in the small signal-to-noise ratio case. Third,

1434: the goal of DM is to dissipate sensory information, as suggested in the

1435: introduction (Figure 1A), whereby an analog multi-dimensional stimulus space

1436: is reduced to a discrete space of several decisions. We argue that

1437: this transformation is facilitated by noise.

1438:

1439: We have studied the problem of finding decision making units in

1440: networks of various connectivities. This path took us from simple linear chains, for which

1441: the information-theoretical (IT) approach was found to be effective, to trees, and, finally to

1442: an alternative definition of decision makers, based on propagation of noise in networks.

1443: This latter definition is valid in networks of arbitrary topology.

1444: All these approaches are equivalent,

1445: when they can be compared, but include progressively broader classes of networks.

1446: As a practical application for the alternative definition we

1447: considered the problem of electric stimulation in the surrogate networks and

1448: showed a way of determining DM contributions for arbitrary networks using

1449: stimulation with random current. Our findings are summarized in Figures 13 and 14.

1450:

1451:

1452: \begin{figure}[htbp]

1453: \centerline{\includegraphics[width=3.2in]{dma19.eps}}

1454: \caption{

1455: The cluster of problems covered in this study.

1456: Solid/dashed arrows show the derivations performed here/yet to be confirmed

1457: or denied. IT stands for information-theoretical.

1458: }

1459: \label{fig13}

1460: \end{figure}

1461:

1462: Although we studied networks of complex connectivity,

1463: the model describing a single network element was quite simple.

1464: Not all of the units are linear, of course, since DM is a non-linear task

1465: (Figure 1A). However, our model is essentially based on linear elements. The

1466: motivation for this model is that it is easy to analyze. The study of

1467: simple models is a necessary step before analysis proceeds any further.

1468: Once the methodological issues are resolved for simpler models, complex

1469: non-linear systems can be studied in the same paradigm.

1470: One of important questions resolved here is that a completely linear element

1471: can be a decision maker, despite the presence of non-linear units in the

1472: network. Thus, nonlinearity is not a necessary attribute of DM. This question

1473: would be impossible to answer for more realistic system, since in practice

1474: all units contain nonlinearities.

1475:

1476: %In this study we obtained identities of decision makers using correlations

1477: %with the response. An alternative approach would be to consider correlations

1478: %with sensory inputs and to surmise that the points, where such correlations

1479: %disappear are responsible for making decisions. It is clear, however, that

1480: %some pathway between the unit and motor response is necessary for the unit

1481: %to impact DM. If such pathway exists, our analysis can be applied. If the

1482: %pathway does not exist, the approach, involving sensory correlations

1483: %produces misleading results. On the other hand, it is possible that a

1484: %combined sensory-motor approach would be more useful. This question warrants

1485: %further investigation.

1486:

1487: \begin{figure}[htbp]

1488: \centerline{\includegraphics[width=3.2in]{dma20.eps}}

1489: \caption{Comparison between different approaches studied here.}

1490: \label{fig20}

1491: \end{figure}

1492:

1493:

1494:

1495:

1496:

1497: Decision making task, as formulated in Figure 1A, is similar to general

1498: object discrimination task. Representation of motor response in our model is

1499: not distinguishable mathematically from the representation of abstract

1500: object/decision category (Horwitz and

1501: Newsome, 1998; Shadlen and Newsome, 2001). The latter does not necessarily

1502: lead to a motor command. Thus, our analysis may uncover the identities of

1503: units responsible for categorization of sensory inputs. In terms of this

1504: analysis we emphasize the distinction between units representing the object

1505: category and the units in which this representation is actually formed. The

1506: former are analogous to motor units in the decision task, while the latter

1507: are similar to decision makers. As follows from this study, the analysis is

1508: dependent upon the topology of the network involved. For simple linear

1509: sensory chains our conclusion is that the \textit{first} unit, spatially or temporally, in

1510: which the representation of the object is correlated with final outcome of

1511: the discrimination process, is responsible for casting the stimulus in one

1512: of the abstract classes. In case of recurrent networks a more detailed

1513: quantitative analysis is needed to draw conclusions about identities of

1514: categorizing units. Thus, DMA may find a broader use in identifying units

1515: representing abstract object's percepts.

1516:

1517: A special care should be taken in distinguishing the DM task from the

1518: sensory discrimination task. It may occur that in the same experiment these

1519: tasks are performed by different populations of neurons. An example is given

1520: by (Salinas and Romo, 1998). They discovered a population of M1

1521: neurons responding differentially to two categories of tactile stimuli. Some

1522: of these neurons did not respond, when the same behavior was guided by

1523: visual cues. This observation is consistent with these neurons performing

1524: sensory discrimination of tactile stimuli,

1525: while some other population making decisions about the actual motor response.

1526: Our mathematical analysis is general enough to include both

1527: of these functions. Thus, if correlations with motor response are

1528: studied, it will result in the decision makers; while when the correlations

1529: with percepts are investigated, DMA should provide the identities of

1530: discriminating elements.

1531:

1532: We suggest that DMA may be relevant to other biological systems.

1533: Possible applications may include the analysis of molecular networks, such

1534: as genetic regulatory or protein binding networks; finding decision makers

1535: in compartmental models of dentritic trees (Poirazi and Mel,

1536: 2001); studies of neural networks and structural networks of connectivities

1537: between different brain areas; and analysis of social networks.

1538:

1539:

1540:

1541:

1542: \section{Conclusion}

1543: \label{gen_conclusion}

1544:

1545: In this study we define network elements responsible for making decisions.

1546: We obtain two equivalent definitions. According to one, decisions are made

1547: by elements, in which correlations with the decision are first formed.

1548: According to the second definition, decision making activity is measured by

1549: the impact of variability in given unit on the response. We give examples of

1550: network motifs, especially potent from decision making prospective, such as

1551: fan-out hubs and recurrent loops. The latter can function as temporal

1552: integrators of sensory inputs. We also study how electric stimulations can

1553: reveal decision making components. We conclude that stimulations with

1554: time-varying random current produce correct results for all network

1555: topologies.

1556:

1557:

1558:

1559:

1560: \appendix

1561:

1562: \section{The linear chain model.}

1563: \label{appendixa}

1564:

1565: Here we solve a more general version of linear chain model than considered

1566: in the text. The responses of neighboring neurons are related linearly

1567: \begin{equation}

1568: x_i =C_{i-1} x_{i-1} +\eta _i

1569: \label{Eqa1}

1570: \end{equation}

1571: This is a generalization of (\ref{eq1}). The response of the $n$th unit is

1572: \begin{equation}

1573: x_n =\sum\limits_{i=1}^n {\alpha _{ni}\eta _i } +\alpha _{n0}x_0

1574: \label{Eqa2}

1575: \end{equation}

1576: where coefficients $\alpha _{ni} =C_{n-1} C_{n-2} \ldots C_i $, $\alpha

1577: _{nn} =1$. The external signal $x_0 $ is assumed to be zero in this

1578: appendix, due to (\ref{eq5}). For the last element in the chain we have

1579: \begin{equation}

1580: x_N =\sum\limits_{i=1}^N {\alpha _{Ni}\eta _i } .

1581: \label{Eqa3}

1582: \end{equation}

1583:

1584:

1585: Comparing (\ref{Eqa2}) and (\ref{Eqa3}) we conclude that

1586: \begin{equation}

1587: x_N =\alpha _{Nn} x_n +\xi ,

1588: \label{Eqa4}

1589: \end{equation}

1590: where $\xi $ is a variable, which describes noise in the networks downstream

1591: from unit $n$. It is, thus, uncorrelated with $x_n $. This is where

1592: tree-like topology enters our solution, since in case of loops, $x_n $ and

1593: $\xi $ are correlated. Our goal now is to calculate MI between the decision

1594: variable $d=H(x_N )$ and $x_n $. We will use the definition for MI

1595: \begin{equation}

1596: MI(d,x_n )=\sum\limits_{d=0,1} {\int\limits_{-\infty }^\infty {dx_n \rho

1597: \left( {d,x_n } \right)\log _2 \left[ {\frac{\rho \left( {d,x_n }

1598: \right)}{\rho (d)\rho (x_n )}} \right]} }

1599: \label{Eqa6}

1600: \end{equation}

1601: Here $\rho \left( d \right)=1/2$, since there is no signal;

1602: \begin{equation}

1603: \rho \left( {x_n } \right)=\exp \left( {-x_n^2 /2\overline {x_n^2 } }

1604: \right)/\left( {2\pi \overline {x_n^2 } } \right)^{1/2}

1605: \label{Eqa8}

1606: \end{equation}

1607: and

1608: \begin{equation}

1609: \rho (d;x_n )=\frac{\rho (x_n )}{2}\left[ {1\pm \mbox{erf}\left(

1610: {\frac{\alpha _{Nn} x_n }{\sigma \left( \xi \right)\sqrt 2 }} \right)}

1611: \right].

1612: \label{Eqa9}

1613: \end{equation}

1614: The upper/lower sign is assumed for $d=0$ or $1$ in (\ref{Eqa9}); $\sigma (\xi )$

1615: is the standard deviation of Gaussian variable $\xi $ defined in (\ref{Eqa4}). The

1616: expression for MI (\ref{Eqa6}) results in

1617: \begin{equation}

1618: \begin{array}{l}

1619:  MI_n =M(s_n ) \\

1620:  M(s_n )=\frac{1}{\sqrt \pi }\int\limits_{-\infty }^\infty {dze^{-z^2}\left[

1621: {1+\mbox{erf}\left( {zs_n } \right)} \right]} \log _2 \left[

1622: {1+\mbox{erf}\left( {zs_n } \right)} \right] \\

1623:  s_n =\sigma (\alpha _{Nn} x_n )/\sigma (\xi ). \\

1624:  \end{array}

1625: \label{Eqa11}

1626: \end{equation}

1627: MI is therefore a function of signal-to-noise ratio $s_n$. Inversely,

1628: \begin{equation}

1629: s_n^2 =\frac{\alpha _{Nn}^2 \overline {x_n^2 } }{\overline {\xi ^2} }=\left[

1630: {M^{-1}(MI_n )} \right]^2

1631: \label{Eqa12}

1632: \end{equation}

1633: On the other hand, (\ref{Eqa4}) leads to

1634: \begin{equation}

1635: \overline {x_N^2 } =\alpha _{Nn}^2 \overline {x_n^2 } +\overline {\xi ^2}

1636: \label{Eqa13}

1637: \end{equation}

1638: Solving (\ref{Eqa12}) and (\ref{Eqa13}) with respect to $\alpha _{Nn}^2 \overline {x_n^2 }$ we have

1639: \begin{equation}

1640: \frac{\alpha _{Nn}^2 \overline {x_n^2 } }{\overline {x_N^2 } }=\frac{\left[

1641: {M^{-1}(MI_n )} \right]^2}{1+\left[ {M^{-1}(MI_n )} \right]^2}\equiv F(MI_n

1642: )

1643: \label{Eqa14}

1644: \end{equation}

1645: Function $M^{-1}$ here is inverse to $M$ defined in (\ref{Eqa11}). Function $F(MI)$

1646: numerically calculated from (\ref{Eqa11}) and (\ref{Eqa14}) is shown in Figure 5. Lastly,

1647: we recall that variances $\alpha _{Nn}^2 \overline {x_n^2 } $ are related to

1648: the strength of noise $\overline {\eta _i^2 } $ through (\ref{Eqa2}). We have

1649: \begin{equation}

1650: \displaystyle

1651: \alpha _{Nn}^2 \overline {x_n^2 } =\sum\limits_{i=1}^n {\alpha _{Ni}^2

1652: \overline {\eta _i^2 } }

1653: \label{Eqa15}

1654: \end{equation}

1655: Eqs. (\ref{Eqa14}) and (\ref{Eqa15}) are used below to prove a variety of statements about

1656: function $F(MI)$ used in the main text.

1657:

1658: \subsection{In the uniform noise example $F(MI)$ is a linear function of position in the chain.}

1659:

1660: In this case $C_1 =\ldots =C_{N-1} =1$,

1661: and, consequently, $\alpha _{N1} =\ldots =\alpha _{NN} =1$. Noise variance

1662: is the same on every node, i.e. $\overline {\eta _i^2 } \equiv \eta^2 $.

1663: As follows from (\ref{Eqa15}) $\overline {x_n^2 } =\eta ^2n$, which results in

1664: \begin{equation}

1665: F(MI_n )=n/N

1666: \label{Eqa16}

1667: \end{equation}

1668: It follows that contributions to DM defined by (\ref{eq9}) are the same for all

1669: units.

1670:

1671: \subsection{In the `loud' neuron example the contributions of units upstream

1672: from the strong link are larger by a factor of $K^2$ than

1673: contribution from the downstream units.}

1674:

1675:

1676: In this case $\alpha _{1...k} =K$,

1677: while $\alpha _{k+1...N} =1$, assuming that the link from unit $k$ to $k+1$

1678: is strengthened. In the example in the text $k=5$ [cf. (\ref{eq10})]. Eq. (\ref{Eqa15})

1679: leads us to the values for variances of responses

1680: \begin{equation}

1681: \displaystyle

1682: \alpha _{Nn}^2 \overline {x_n^2 } =\left\{ {{\begin{array}{*{20}c}\displaystyle

1683:  {\displaystyle \eta ^2K^2n,} \hfill & {n\le k} \hfill \\

1684:  {\eta ^2K^2k+\eta ^2\left(\displaystyle {n-k} \right),} \hfill & {n>k} \hfill \\

1685: \end{array} }} \right.

1686: \label{Eqa17}

1687: \end{equation}

1688: Applying (\ref{Eqa14}) we obtain the expression for $F(MI)$

1689: \begin{equation}

1690: F(MI_n )=\left\{ {{\begin{array}{*{20}c}\displaystyle

1691:  {\displaystyle \frac{K^2n}{N-k+K^2k},} \hfill & {n\le k} \hfill \\

1692:  {\displaystyle \frac{n-k+K^2k}{N-k+K^2k},} \hfill & {n>k} \hfill \\

1693: \end{array} }} \right.,

1694: \label{Eqa18}

1695: \end{equation}

1696: which is a piece-wise linear function of $n$. Eq.~(\ref{eq9}) determines

1697: contributions to DM as

1698: \begin{equation}

1699: DM_n =\left\{ {{\begin{array}{*{20}c}

1700:  {\displaystyle \frac{K^2}{N-k+K^2k},} \hfill & {n\le k} \hfill \\

1701:  {\displaystyle \frac{1}{N-k+K^2k},} \hfill & {n>k} \hfill \\

1702: \end{array} }} \right.

1703: \label{Eqa19}

1704: \end{equation}

1705: This confirms that the upstream units ($n\le k)$ are $K^2$ times more potent

1706: than the downstream ones ($n>k)$.

1707:

1708: \subsection{Two definitions of contribution to DM using derivative of

1709: $F(MI)$ (\ref{eq9}) and the impact of noise (\ref{eq14}) are equivalent. }

1710:

1711: Let

1712: us start by determining decision makers from definition (\ref{eq14}). According to

1713: (\ref{Eqa3})

1714: \begin{equation}

1715: \overline {x_N^2 } =\sum\limits_{i=1}^N {\alpha _{Ni}^2 \overline {\eta _i^2

1716: } } .

1717: \label{Eqa20}

1718: \end{equation}

1719: Definition (\ref{eq14}) gives

1720: \begin{equation}

1721: DM_i \propto \overline {\eta _i^2 } \frac{\partial \overline {x_N^2 }

1722: }{\partial \overline {\eta _i^2 } }=\alpha _{Ni}^2 \overline {\eta _i^2 } .

1723: \label{Eqa21}

1724: \end{equation}

1725: After normalization we obtain

1726: \begin{equation}

1727: DM_i =\frac{\alpha _{Ni}^2 \overline {\eta _i^2 } }{\overline {x_N^2 } }.

1728: \label{Eqa22}

1729: \end{equation}

1730: Let us derive the same result from (\ref{eq9}). As follows from (\ref{Eqa14})

1731: \begin{equation}

1732: \begin{array}{l}

1733: \displaystyle F(MI_n )-F(MI_{n-1} )=\frac{1}{\overline {x_N^2 } }\left( {\alpha _{Nn}^2

1734: \overline {x_n^2 } -\alpha _{N\mbox{ }n-1}^2 \overline {x_{n-1}^2 } }

1735: \right) \\

1736:  \displaystyle =\frac{\alpha _{Nn}^2 }{\overline {x_N^2 } }\left( {\overline {x_n^2 }

1737: -C_{n-1}^2 \overline {x_{n-1}^2 } } \right)=\frac{\alpha _{Nn}^2 \overline

1738: {\eta _n^2 } }{\overline {x_N^2 } } \\

1739:  \end{array}

1740: \label{Eqa23}

1741: \end{equation}

1742: This proves the equivalence of (\ref{eq9}) and (\ref{eq14}), since the result is

1743: identical to (\ref{Eqa22}).

1744:

1745: \section{Connection between discrete- and continuous-time models.}

1746: \label{appendixb}

1747:

1748: In this section we show that the discrete-time model can be derived from

1749: continuous-time model. Starting from equation (\ref{eq37}) for the unit responses

1750: in the continuous case we obtain the relation for solutions at two different

1751: time points separated by the time-interval $\tau$, analogous to (\ref{eq19}) in

1752: the discrete-time description. Then we show that in the limiting case $\tau \to 0$ two descriptions are equivalent.

1753:

1754: From (\ref{eq37}) we obtain

1755: \begin{equation}

1756: \vec {x}(t+\tau )=e^{-\hat {A}\tau }\vec {x}(t)+\int\limits_t^{t+\tau }

1757: {e^{-\hat {A}(t+\tau -{t}')}\vec {\eta }({t}')d{t}'}.

1758: \label{Eqb1}

1759: \end{equation}

1760: This equation can be rewritten as $\vec {x}(t+\tau )=\hat {C}\vec {x}(t)+{\vec {\eta }}'(t)$,

1761: where

1762: \begin{equation}

1763: \hat {C}=e^{-\hat {A}\tau }\approx \hat {I}-\hat {A}\tau .

1764: \label{Eqb2}

1765: \end{equation}

1766: Thus it has the same form as (\ref{eq19}). Using (\ref{eq32}) we obtain that the new noise cross-correlation matrix

1767: \begin{equation}

1768: {\hat {{\cal N}}}'=\int\limits_0^\tau {e^{-\hat {A}{t}'}\hat {{\cal

1769: N}}e^{-\hat {A}^T{t}'}d{t}'} .

1770: \label{Eqb3}

1771: \end{equation}

1772: The solution of the continuous-time problem satisfies the equations of the

1773: discrete-time model for an arbitrarily large time interval $\tau $, but

1774: the new noise cross-correlation matrix ${\hat {{\cal N}}}'$ is non-diagonal

1775: in this case. In the limiting case $\tau \to 0$ it becomes

1776: diagonal. Indeed (\ref{Eqb3}) implies that in this limit

1777: \begin{equation}

1778: {\hat {{\cal N}}}'=\hat {{\cal N}}\tau \quad ,

1779: \label{Eqb4}

1780: \end{equation}

1781: which is diagonal by the definition of the continuous-time model.

1782: Here we kept only terms linear in $\tau $. Thus, in this limit the matrix ${\hat {{\cal N}}}'$ is diagonal as needed in our

1783: formulation of discrete-time model. One can also derive (\ref{eq41}) from (\ref{eq28})

1784: using (\ref{Eqb2}) and (\ref{Eqb4}) and taking the limit $\tau \to 0$.

1785:

1786:

1787:

1788: \begin{thebibliography}{99}

1789: \small

1790:

1791:

1792: \bibitem{1}

1793: Abramowitz M, Stegun IA (1972) Handbook of mathematical functions with

1794: formulas, graphs, and mathematical tables, 10th printing, 1972, with

1795: corrections. Edition. Washington: U.S. Govt. Print. Off.

1796:

1797: \bibitem{2}

1798: Eaton RC, Emberley DS (1991) How stimulus direction determines the

1799: trajectory of the Mauthner-initiated escape response in a teleost fish. J

1800: Exp Biol 161:469-487.

1801:

1802: \bibitem{3}

1803: Efetov K (1997) Supersymmetry in disorder and chaos. Cambridge [England] ;

1804: New York: Cambridge University Press.

1805:

1806: \bibitem{4}

1807: Glimcher PW (2003) The neurobiology of visual-saccadic decision making. Annu

1808: Rev Neurosci 26:133-179.

1809:

1810: \bibitem{5}

1811: Gold JI, Shadlen MN (2002) Banburismus and the brain: decoding the

1812: relationship between sensory stimuli, decisions, and reward. Neuron

1813: 36:299-308.

1814:

1815: \bibitem{6}

1816: Horwitz GD, Newsome WT (1998) Neurophysiology: sensing and categorizing.

1817: Curr Biol 8:R376-378.

1818:

1819: \bibitem{7}

1820: Mazurek ME, Roitman JD, Ditterich J, Shadlen MN (2003) A role for neural

1821: integrators in perceptual decision making. Cereb Cortex 13:1257-1269.

1822:

1823: \bibitem{8}

1824: Poirazi P, Mel BW (2001) Impact of active dendrites and structural

1825: plasticity on the memory capacity of neural tissue. Neuron 29:779-796.

1826:

1827: \bibitem{9}

1828: Robinson DA (1989) Integrating with neurons. Annu Rev Neurosci 12:33-45.

1829:

1830: \bibitem{10}

1831: Roitman JD, Shadlen MN (2002) Response of neurons in the lateral

1832: intraparietal area during a combined visual discrimination reaction time

1833: task. J Neurosci 22:9475-9489.

1834:

1835: \bibitem{11}

1836: Romo R, Salinas E (2003) Flutter discrimination: neural codes, perception,

1837: memory and decision making. Nat Rev Neurosci 4:203-218.

1838:

1839: \bibitem{12}

1840: Salinas E, Romo R (1998) Conversion of sensory signals into motor commands

1841: in primary motor cortex. J Neurosci 18:499-511.

1842:

1843: \bibitem{13}

1844: Salzman CD, Murasugi CM, Britten KH, Newsome WT (1992) Microstimulation in

1845: visual area MT: effects on direction discrimination performance. J Neurosci

1846: 12:2331-2355.

1847:

1848: \bibitem{14}

1849: Shadlen MN, Newsome WT (2001) Neural basis of a perceptual decision in the

1850: parietal cortex (area LIP) of the rhesus monkey. J Neurophysiol

1851: 86:1916-1936.

1852:

1853: \end{thebibliography}

1854: \end{document}

1855: