0610:q-bio0610034/smp5.tex

1: \documentclass[12pt]{iopart}

2: \usepackage[latin1]{inputenc}

3: \usepackage{graphicx,psfrag}

4: %\usepackage[hypertex]{hyperref}

5:

6: \newcommand{\average}[1]{\left<{#1}\right>}

7:

8: \begin{document}

9: %\preprint{\bf Revision 1}

10: \title[Population size effects in evolutionary dynamics]{Population

11: size effects in evolutionary dynamics on neutral

12: networks and toy landscapes}

13: \author{Sumedha$^1$,

14: Olivier C. Martin$^1,^2$, and Luca Peliti$^3$}

15: \address{$^1$ Universit� Paris-Sud, UMR8626, LPTMS, Orsay, F-91405; CNRS, Orsay,

16: F-91405, France}

17: \address{$^2$ UMR de G�n�tique V�g�tale du Moulon,

18: INRA-UPS-CNRS-INA PG, F--91190 Gif-sur-Yvette, France}

19: \address{$^3$ Dipartimento di Scienze Fisiche, Sezione INFN and Unit�

20: CNISM, Universit� di Napoli ``Federico II'', Complesso

21: Universitario di Monte S. Angelo, I--80126 Napoli, Italy}

22: \eads{\mailto{sumedha.sumedha@gmail.com},

23: \mailto{olivier.martin@u-psud.fr},

24: \mailto{Luca.Peliti@na.infn.it}}

25: \begin{abstract}

26: We study the dynamics of a population subject to selective

27: pressures, evolving either on RNA neutral networks or in toy fitness

28: landscapes. We discuss the spread and the neutrality of the

29: population in the steady state. Different limits arise depending on

30: whether selection or random drift are dominant. In the presence of

31: strong drift we show that observables depend mainly on $M \mu$, $M$

32: being the population size and $\mu$ the mutation rate, while

33: corrections to this scaling go as $1/M$: such corrections can be

34: quite large in the presence of selection if there are barriers in

35: the fitness landscape. Also we find that the convergence to

36: the large $M \mu$ limit is linear in $1/M \mu$. Finally

37: we introduce a

38: protocol that minimizes drift; then

39: observables scale like $1/M$ rather than $1/(M\mu)$, allowing one to

40: determine the large $M$ limit faster when $\mu$ is small; furthermore

41: the genotypic diversity increases from $O(\ln M)$ to

42: $O(M)$.

43: \end{abstract}

44:

45: \pacs{87.23.-n (Ecology and evolution),

46: 87.15.Aa (Theory and modeling; computer simulation),

47: 89.75.Hc (Networks and genealogical trees)}

48:

49: \submitto{JSTAT}

50:

51: Keywords: evolutionary dynamics, neutral networks, drift, selection

52:

53: \maketitle

54:

55: \section{Introduction}

56:

57: In evolutionary biology, populations are subject to a number of

58: forces that shape their genetic composition~\cite{HerronFreeman03}.

59: Amongst these, mutations, selection and drift play a central role.

60: Drift becomes dominant for small populations, while for large

61: populations one reaches a steady state where mutations balance

62: effects of selection. The landscape

63: paradigm~\cite{Wright32,Gavrilets04} provides a relation between

64: genotype/phenotype and fitness, allowing for quantitative studies of

65: evolving populations, while at the same time giving a qualitative

66: picture. This has been particularly developed in the context of

67: quasispecies theory~\cite{Eigen71,EigenSchuster77,EigenMcCaskill89}

68: and of evolution on networks~\cite{Stadler02,Schuster02}; in a

69: different framework these problems have been analyzed within

70: evolutionary game

71: theory~\cite{LiebermanHauert05,TaylorRudenbert04,RocaGuesta06,AntalRedner06}

72: although, in that case, usually there is no drift in the usual sense and

73: the number of genotypes is low.

74:

75: Let us consider a population of $M$ individuals evolving in a static

76: fitness landscape. We can define its steady-state distribution from

77: the average number of individuals with a given genotype, averaged

78: over a long stretch of time. One can also consider how

79: the population is spread out in genotype

80: space. If all genotypes have the same fitness (flat

81: landscape), the steady state distribution is independent of $M$

82: while the spread of the population depends mainly on the product $M

83: \mu$ where $\mu$ is the mutation rate~\cite{DerridaPeliti91}. In the

84: limit of an infinite population ($\mu$ fixed),

85: the generation-to-generation

86: fluctuations vanish and the instantaneous population distribution

87: coincides with the steady-state one. This is the quasispecies limit,

88: where the steady-state distribution is given by the leading

89: eigenvector of the evolution

90: operator~\cite{NimwegenCrutchfield99}. When the population

91: is finite, no general analytic solution for the steady-state

92: distribution is known. If the population size is much greater than

93: the number of genotypes, then the so-called diffusion

94: approximation~\cite{Ewens04} can be used. However, in most

95: realistic cases, the number of possible genotypes is much greater

96: than the population size. We are interested in understanding how

97: mutation, selection and drift affect different properties of the

98: steady state in such systems.

99:

100: To unravel the different effects, we consider different evolutionary

101: dynamics, in which some of these processes may be present or not.

102: Most of our study is conducted in the framework of RNA neutral

103: networks~\cite{FontanaStadler93,Schuster02}, the archetypes of

104: genotype to phenotype mappings, but we also consider toy fitness

105: landscapes. After specifying our systems and population dynamics in

106: sections~\ref{sec:NeutralNetworks} to

107: \ref{sec:EvolutionaryDynamics}, we examine the dependence on the

108: mutation rate $\mu$ and on $M$ of the different processes driving

109: the dynamics. We begin with the case of neutral evolution in

110: section~\ref{sec:DynamicsWithoutSelection}, turning on and off the

111: drift. In section~\ref{sec:SelectionAndDrift} we allow for selection

112: in the usual way that leads to significant drift. As a general rule,

113: drift in these situations leads to $M \mu$ scaling, as shown

114: previously in the absence of selection~\cite{DerridaPeliti91}. Then

115: in section~\ref{sec:SelectionLowDrift} we introduce a particular

116: dynamics with selection but \emph{low drift}. There the $M \mu$

117: scaling is replaced by a smooth large $M$ limit even when $\mu \to

118: 0$; furthermore the genotypic diversity becomes proportional to $M$

119: rather than being nearly constant. The corrections to these scalings

120: generically go as $O(1/M)$, with large effects when there are

121: barriers in the landscape, as we exhibit in

122: section~\ref{sec:Bottlenecks}. Section~\ref{sec:Conclusions} is

123: devoted to some final considerations.

124:

125: \section{RNA neutral networks as fitness landscapes}

126: \label{sec:NeutralNetworks}

127:

128: In studies of genotype to phenotype mappings, one often focuses on

129: biological molecules because the corresponding mapping is relatively

130: well defined. The genotype is simply the sequence of the

131: bio-molecule, while the phenotype is its shape, as specified for

132: instance by the minimum free-energy structure it folds into. Using

133: either protein or secondary RNA structures, it has been

134: found~\cite{Schuster02,FontanaStadler93,LipmanWilbur91,GrGiegerich96}

135: that neutral genotypes (genotypes that have a given phenotype) which

136: are connected via single mutational steps form extended networks

137: that permeate large regions of genotype space. These are known as

138: ``neutral networks''. Via the neutral network, a population can move

139: in genotype space without crossing unfavorable low-fitness regions,

140: in contrast to what happens in many rugged fitness

141: landscapes~\cite{Wright32,KauffmanLevin87,

142: FlyvbjergHenrik92,NimwegenCrutchfield00}. However, because of the

143: huge dimensionality of our genotype space, large neutral (or nearly

144: neutral) networks can be argued to be inevitable~\cite{Gavrilets04}.

145:

146: Here we shall work with an RNA neutral network, i.e., all RNA

147: sequences which fold into a given target RNA secondary structure.

148: The genotype of an RNA molecule is given by its base sequence: there

149: are four bases, A,C,G, and U, and thus $4^L$ genotypes for molecules

150: of $L$ bases. The molecule's phenotype is given by its

151: \emph{secondary} structure, i.e., by which bases are paired with which

152: as occur in its folded form. To every genotype one associates just

153: one phenotype (the secondary structure of minimum free energy) while

154: in general there will be many genotypes compatible with a given

155: phenotype. This many-to-one genotype to phenotype mapping has been

156: widely

157: studied~\cite{Schuster02,NimwegenCrutchfield99,FontanaStadler93,

158: GrGiegerich96,ReidysSchuster95,FontanaSchuster98,SchusterStadler02}.

159: Standard computational tools are available on the web to fold given

160: sequences; see for instance the \emph{fold} subroutine from the

161: Vienna package~\cite{Vienna95} which we used for all of this work's

162: computations. Two sequences are nearest neighbors (connected on the

163: neutral network) if and only if they differ by a single nucleotide

164: substitution. In general, RNA neutral networks are heterogenous

165: graphs, so that for instance the local connectivity varies quite a

166: lot from site to site.

167:

168: The secondary structure (phenotype) chosen in this study is the one

169: used by van~Nimwegen et

170: al.~\cite{NimwegenCrutchfield99,NimwegenCrutchfield00}: it has $18$

171: nucleotides with six base pairs and is depicted in figure~\ref{rna}.

172: By single-nucleotide substitutions, purine-pyrimidine base pairs

173: (G--C, G--U, A--U) can mutate into each other, but not into

174: pyrimidine-purine (C--G, U--G, U--A) base pairs. Hence we considered

175: only the purine-pyrimidine base pairs. Given the base pairing rules

176: for this system, the number of a priori ``compatible'' sequences for

177: such a structure is $4^{6} \times 3^6 =2985884$. At $30^\circ$C,

178: 37963 of these fold into the target structure; this number depends a

179: bit on the choice of temperature since the \textsl{fold} algorithm

180: computes free energies. We find these genotypes to be organized into

181: \emph{three} neutral networks (connected components), of sizes

182: $489$, $5784$ and $31484$ respectively. This will allow us to

183: investigate the effect of neutral network size on our observables.

184: %

185: \begin{figure}

186: %    \psfrag{5'}{$5'$} \psfrag{3'}{$3'$}

187: \begin{center}

188:     \includegraphics[width=7cm]{RNA.eps}

189:     \end{center}

190: \caption{The target RNA secondary structure (cf.\ van Nimwegen et

191: al.~\cite{NimwegenCrutchfield99}).} \label{rna}

192: \end{figure}

193: %

194:

195: To define the fitness landscape, we consider that individuals with

196: the ``correct'' phenotype (residing on the neutral network) are

197: viable, i.e., have maximal fitness, while all other phenotypes are

198: non-viable, i.e., have minimal fitness. We take these extreme values

199: to be 1 and 0, corresponding to the \emph{strong} selection limit;

200: then any mutation that takes one off the neutral network is lethal.

201:

202: \section{Population spread, Hamming distances and neutrality}

203: \label{sec:neutrality}

204:

205: We now ask how drift and selection affect observables associated

206: with the steady state population. A first observable quantifies how

207: much the population is ``spread out'' in genotype space, namely how

208: individuals at a given generation differ from one another in

209: genotype space. One defines the ``Hamming distance'' $h$ between two

210: genotypes as the number of positions where the two associated

211: sequences have different nucleotides. Following Derrida and

212: Peliti~\cite{DerridaPeliti91}, we shall study the distribution

213: $P(h)$ of distances when two genotypes are taken at random in the

214: population, averaged over generations. If $P(h)$ is broad, then the

215: population is spread out in genotype space.

216:

217: A second observable is ``neutrality''~\cite{HuynenStadler96}. Let

218: $g_0$ be a genotype (a sequence of $L$ bases) belonging to the

219: neutral network; examine its $3 L$ possible single-nucleotide

220: substitutions and let $d$ be the number of these mutants that belong

221: to the neutral network. The ``neutrality'' of $g_0$ is then $d$, the

222: coordination (degree) of $g_0$ on the neutral network. A related

223: notion is the \emph{mutational robustness} $R_{\mu}$ of $g_0$. It is

224: defined as the survival probability of its mutant offspring. In the

225: context of neutral networks with fitness values 0 and 1, we see that

226: in fact

227: \begin{equation}

228: \label{eq:Rmu}

229: R_{\mu} = \frac{d}{3L},

230: \end{equation}

231: where $d$ is the neutrality of $g_0$. These definitions can be

232: straightforwardly extended to the neutrality or robustness of any

233: collection of genotypes. Thus one defines the ``network neutrality''

234: of a neutral network as the mean of $d$ when considering all of its

235: nodes. Similarly, when one has a \emph{population} of genotypes, the

236: ``population neutrality'' is simply the average of $d$ over that

237: population, each individual being counted once. The population neutrality

238: depends on both the neutral network properties and on the

239: evolutionary dynamics~\cite{NimwegenCrutchfield99}. Furthermore, we

240: immediately see that the neutrality of a population is $3 L$ times

241: the mean robustness of its individuals.

242:

243:

244: \section{The model}

245: \label{sec:EvolutionaryDynamics}

246:

247: Many processes affect the genetic makeup of natural populations. In

248: this work, we focus on the effects of mutation, selection and drift.

249: We wish to turn on and off selection or even drift while considering

250: the effects of the population size $M$ or of the mutation rate

251: $\mu$. Thus the details of the evolutionary process we consider are

252: tailored to emphasize one or another of these aspects at a time.

253:

254: We consider a population evolving with nonoverlapping

255: generations; the population size is kept at a fixed value in the

256: standard way. Other choices could have been made, but as

257: in most studies, the detailed procedures used in the evolutionary

258: dynamics is not expected to be very important.

259:

260: Given the $M$ individuals at the current generation, we must produce

261: $M$ viable offspring to form the next generation. Each offspring is

262: produced from a parent and given a chance to mutate: with

263: probability $1-\mu$, no mutation is applied, and with probability

264: $\mu$ one base at random is changed. Then \emph{selection} is

265: applied: the child is kept if and only if it is viable. More

266: generally, on an arbitrary fitness landscape, we let it survive

267: stochastically according to its fitness. Of course if there is no

268: selection, the offspring is always kept. The process of producing

269: offspring is repeated until the new generation has size $M$.

270: \emph{Drift} comes in via the way the parents are chosen to produce

271: candidate offspring. In the standard method, the parents are chosen

272: randomly \emph{with replacement}: clearly this allows for drift as

273: by bad luck some parents will not produce any offspring. In the

274: presence of selection, drift cannot be turned off completely but it

275: can be significantly lowered.

276:

277: Indeed, let us consider the following process. First, each individual

278: of the population produces one offspring which mutates with

279: probability $\mu$: if a mutation is lethal, the corresponding

280: offspring is killed. In this step there is no replacement and the

281: resulting offspring population size will generally be smaller than

282: $M$. Second, one chooses individuals randomly from this offspring

283: population and replicates them. This is done until the population

284: size reaches $M$ again. Note that when $\mu$ is small, the new

285: generation will be nearly identical to the previous one, even for

286: small $M$, so there is very little drift. Because of selection, a

287: small amount of drift does occur, but its intensity is proportional

288: to $\mu$.

289:

290: In all our runs, we initialize arbitrarily the population and let it

291: evolve for a large number of generations until initial conditions

292: are forgotten: this is the steady state limit. All the data

293: presented in this work are time averages taken from this regime. We

294: are now ready to see how $M$, $\mu$, drift and selection affect the

295: spread, neutrality and the genotypic diversity of the steady state

296: population.

297:

298: \section{Dynamics without selection}

299: \label{sec:DynamicsWithoutSelection}

300:

301: In this section, we consider a population evolving without being

302: subject to selection. We investigate the effects of allowing or not

303: drift, first on regular networks and then on

304: heterogeneous ones.

305:

306: \subsection{Homogeneous networks}

307:

308: Consider the space of sequences of length $L$; if we take these to

309: all be viable, then we get a homogeneous network in which all

310: 1-mutant neighbors of each genotype belong to the network. Derrida

311: and Peliti~\cite{DerridaPeliti91} studied the evolution of a

312: population on such a flat fitness landscape. Using the fact that

313: there is no selection, it is easy to show that the steady state

314: distribution is \emph{uniform}. Thus the population neutrality is

315: trivial, being given by the degree of the network, i.e., $3L$, for

316: all population sizes. In contrast to this simple result, the

317: distribution of Hamming distances between genotypes in the

318: population is generally non-trivial. Depending on the nature of the

319: dynamics, we have the following behaviors:

320: \begin{itemize}

321: \item[(a)] Drift off --- The offspring are produced from parents

322: \emph{without replacement}: since each individual has exactly one

323: offspring, each lineage acts like an independent random walk. Thus

324: the Hamming distances between the genotypes in the population are

325: completely random: the mean of $h$ lies at $3L/4$ (at each position

326: along the sequence, one has a $3/4$ chance of having different bases

327: when comparing two random sequences) and its variance is equal to

328: $3L/16$.

329: \item[(b)] Drift on --- Here the offspring are produced from parents

330: \emph{with replacement}: the number of offspring of an individual is

331: variable, leading to tree genealogies. This situation incorporating

332: drift was studied by Derrida and Peliti~\cite{DerridaPeliti91} and

333: leads to a non-trivial $P(h)$ which depends on $M$ and $\mu$. At any

334: given generation, the individuals have mutual distances that reflect

335: the fact that they descend from a common ancestor, giving rise to a

336: clustering of the population that fluctuates from one generation to

337: the next. For our purposes here, we focus on the

338: result~\cite{DerridaPeliti91} that the relevant parameter when $M$

339: is large is $M \mu$: in particular, $P(h)$ depends only on the

340: product $M \mu$ at large $M$, a property that we call $M

341: \mu$~scaling.

342: \end{itemize}

343:

344:

345:

346: \subsection{RNA neutral networks}

347:

348: We now consider a population evolving on an RNA neutral network,

349: defined as the subspace of sequences which fold (at $30^\circ$~C)

350: into the target secondary structure shown in figure~\ref{rna}. RNA

351: neutral networks are generally heterogenous. Evolutionary dynamics

352: without selection can be implemented by simply ``forbidding''

353: attempts to apply lethal mutations. There are two natural ways to do

354: this, referred to as blind and myopic ant dynamics~\cite{Hughes96}.

355: In myopic ant dynamics, also called adaptive random

356: walks~\cite{KauffmanLevin87,FlyvbjergHenrik92,Orr03}, an offspring

357: that mutates is forced to choose a single point mutation that is

358: non-lethal (all non-lethal choices are equiprobable). In blind ant

359: dynamics (also called gradient random walks), a point mutation is

360: chosen at random (lethal or not): if it is non-lethal, it is

361: accepted, while if it is lethal, it is refused and the offspring is

362: taken to be \emph{non-mutant}. Both the blind and myopic dynamics

363: can be implemented with or without drift, according to the method

364: sketched in section~\ref{sec:EvolutionaryDynamics}. Although these

365: dynamical processes may appear to be somewhat artificial, they do

366: provide solvable cases. Furthermore, van~Nimwegen et

367: al.~\cite{NimwegenCrutchfield99} have shown that in the limit of

368: small $M \mu$, the standard evolutionary dynamics converges to the

369: blind ant dynamics.

370:

371: Consider first the case without drift, in which the sampling of the

372: parents takes place \emph{without replacement}. Then each lineage

373: performs an independent random walk on the whole neutral network. As

374: shown by van~Nimwegen et~al.~\cite{NimwegenCrutchfield99}, the

375: steady-state distribution for blind ants is uniform on the neutral

376: network, while for myopic ants the probability of being at a node of

377: the neutral network that has degree $d$ is a constant times $d$.

378: (Note that if $d$ is the same for all nodes as in regular networks,

379: we obtain the uniform distribution as expected.) Given the

380: steady-state distribution, the histogram $P(h)$ of Hamming distances

381: is determined from the fact that the lineages are independent. At

382: large $L$, one expects it to become peaked, neutral networks being

383: widely spread out in genotype space. Furthermore, the steady-state

384: distribution and the $P(h)$ are $M$ and $\mu$ independent.

385:

386: %

387: \begin{figure}

388: %\psfrag{P(h)}{$P(h)$}\psfrag{h}{$h$}

389: \begin{center}

390:     \includegraphics[width=7cm]{hammingmyp2.eps}

391: \end{center}

392: \caption{Distribution of Hamming distances $h$ for different values

393: of $\mu$ for the myopic ant dynamics in the presence of drift. The

394: population size is $M=20$.} \label{hammingmy2}

395: \end{figure}

396: %

397:

398: Let us now allow for drift. The sampling of the parents takes place

399: \emph{with replacement}. Interestingly, the steady-state

400: distribution of the population is not affected by the drift: this is

401: due to the fact that the heterogeneity of the neutral network does

402: not affect the chances of appearance of an offspring. However, the

403: lineages are no longer independent since the population typically

404: shares a recent common ancestor. As a consequence, the

405: Hamming-distance distribution $P(h)$ is not determined from the

406: steady-state distribution: it is non-trivial and depends

407: on $M\mu$ at large $M$. Since this result holds in a far

408: more general context which includes selection, we postpone its proof

409: to the next section. \emph{Corrections} to the $M \mu$ scaling are

410: $O(1/M)$, with typically a rather small prefactor. In the $M \mu \to

411: 0$ limit, just as in the general population

412: dynamics~\cite{NimwegenCrutchfield99}, one recovers the blind ant

413: dynamics.

414:

415: For illustration, we show in Figures \ref{hammingmy2} and

416: \ref{hammingmy1} the distribution of Hamming distances $h$ for a

417: population with myopic ant moves in the presence of drift on a

418: neutral network. We see that in spite of the heterogeneity of the

419: network, the $M \mu$ scaling holds just as for a homogeneous network.

420:

421: %

422: \begin{figure}

423: %\psfrag{P(h)}{$P(h)$}\psfrag{h}{$h$}

424: \begin{center}

425:     \includegraphics[width=7cm]{hammingmyp1.eps}

426: \end{center}

427: \caption{Distribution of Hamming distances $h$ for different values

428: of $M \mu$ for the myopic ant dynamics in the presence of drift. For

429: each $M \mu$ we studied $M=20$ and $M=40$. The data with same $M

430: \mu$ superimpose perfectly, exhibiting the $M \mu$ scaling.}

431: \label{hammingmy1}

432: \end{figure}

433: %

434:

435:

436: \section{Dynamics with selection and drift}

437: \label{sec:SelectionAndDrift}

438:

439: \subsection{The infinite-population (quasi-species) limit}

440: \label{subsec:InfinitePop}

441:

442: In the infinite-population limit, $M \to \infty$ (with $\mu$ fixed),

443: drift is absent and only the effects of mutation and

444: selection show up; this is called the quasi-species

445: regime~\cite{Eigen71,EigenSchuster77,EigenMcCaskill89}. As shown by

446: van~Nimwegen et~al.~\cite{NimwegenCrutchfield99}, the steady-state

447: distribution is given in this limit by the dominant eigenvector

448: $\Psi_0$ of a linear operator defined by the adjacency matrix of the

449: network. This eigenvector does not depend on $\mu$, but its

450: eigenvalue $\lambda_0$ does. If each individual in the population

451: produced only one offspring, and the unviable ones were eliminated

452: without replacement, then the population size would decay by a

453: factor $\lambda_0$ at each generation. Relating this decay to the

454: mutational robustness $R_{\mu}$ of the steady-state population we

455: immediately obtain

456: \begin{equation}

457: \label{eq:RmuInfPop}

458: \lambda_0 = (1-\mu) + \mu R_{\mu},

459: \end{equation}

460: which yields the population's average neutrality $\langle d

461: \rangle_{\infty}$ via equation~(\ref{eq:Rmu}): both $R_{\mu}$ and

462: $\langle d \rangle_{\infty}$ are $\mu$ independent.

463:

464: These results can be compared to the population neutrality in the

465: case of blind/myopic ant moves, in which there is no explicit

466: fitness-based removal of individuals. In the case of blind ants, the

467: probability of residing on any node of the network is uniform. The

468: average neutrality seen by such a walker is just equal to the

469: average network neutrality. For a myopic ant, the probability of

470: choosing any node on the network is proportional to the degree of

471: the node. Neutrality is slightly higher in this case and is given by

472: the ratio of second and first moments of the node degrees. A

473: standard variational principle~\cite{SoshnikovSudakov03} shows that

474: the population neutrality is always at least as large as the network

475: neutrality, defined as the average degree of the neutral network.

476: %This property follows from a variational principle as

477: %follows. The linear operator ${\cal O}=(1-\mu){\cal I} + \mu {\bf A}/3L$,

478: %which evolves the genotype

479: %density from one generation to the next, is symmetric and thus has a real

480: %eigenvalue spectrum. The time evolution over many steps projects

481: %the initial density onto the eigenvector $\Psi_0$ of largest eigenvalue

482: %$\lambda_0$. Then the standard variational principle for the spectrum

483: %of ${\cal O}$ states that

484: %\begin{equation}

485: %\label{eq:variational} \lambda_\phi=\frac{\sum_{ij} \phi_i

486: %\,\mathcal{O}_{ij}\, \phi_j}{\sum_i \phi_i^2},

487: %\end{equation}

488: %is maximized when $\phi = \Psi_0$; this maximum

489: %is $\lambda_0=1-\mu + \mu \alpha_0 / 3L$ where

490: %$\alpha_0$ is the largest eigenvalue of the adjacency

491: %matrix. A direct calculation shows

492: %$\alpha_0 = \average{d}_{\Psi_0}$ which is

493: %the same as the population neutrality. Now if we take $\phi$

494: %to be the uniform distribution $\phi_0$, the variational expression

495: %gives

496: %\begin{equation}

497: %\lambda_{\phi_0} = (1-\mu) + \frac{\mu}{3L} \average{d}_{\phi_0}

498: %\end{equation}

499: %where $\average{d}_{\phi_0}$ is the network neutrality,

500: %proving the desired inequality.

501: Of course on a homogenous graph, population neutrality, blind ant

502: neutrality and myopic ant neutrality are all equal to the network

503: neutrality. We refer the reader to the work of van~Nimwegen

504: et~al.~\cite{NimwegenCrutchfield99} for a thorough discussion.

505:

506: %As for the

507: %population spread, there are no fluctuations from generation to

508: %generation and $P(h)$ is simply determined by the

509: %eigenvector $\Psi_0$ (and thus the steady state genotype density).

510:

511:

512: \subsection{Case of an infinite population at fixed $M \mu$}

513: \label{subsec:InfinitePopMmu}

514:

515: In biological situations, the mutation rate is very small, making

516: the large-$M$ limit of little use: indeed to get to the infinite $M$

517: limit just discussed, the quantity $M \mu$ must be large.

518: When $M \mu$ is finite, drift occurs, and it is of interest

519: to consider the large $M$ limit at fixed $M \mu$. In

520: the \emph{absence} of selection, the $M \mu$ scaling law has been derived

521: by Derrida and Peliti~\cite{DerridaPeliti91}. In the presence of

522: selection, it has been exhibited from numerical simulations by

523: van~Nimwegen et~al.~\cite{NimwegenCrutchfield99}. It turns out

524: that this scaling follows

525: from the dynamical equations, whether or not there is selection:

526: these equations are invariant under a simultaneous rescaling of

527: time, $M$ and $\mu$, as long as $M \mu$ is fixed, as we now show.

528:

529: Let $N_i(g)$ denote the number of individuals in the population

530: residing at the neutral network node $i$ at generation $g$. To go to

531: the next generation, let us choose $M$ individuals at random with

532: replacement, and let them mutate with probability $\mu$. One then

533: has

534: \begin{equation}

535: \label{eq:stochasticN}

536: N_i(g+1) = \sum_{\langle ij \rangle} P_j(M

537: p_j) + G_i(M q_i, M q_i),

538: \end{equation}

539: where $\sum_{\langle ij \rangle}$ denotes the sum over the nodes $j$

540: which are nearest neighbors of node $i$. Also, $p_j = \mu N_i / (3 L

541: M)$ is the probability to choose an individual of genotype $j$ and

542: to have it mutate to node $i$; $P_j(M p_j)$ is a Poisson random

543: variable of mean $M p_j$; $q_i = (1-\mu) N_i /M$ is the probability

544: to choose an individual of genotype $i$ and to leave it without

545: mutation; $G_i(M q_i, M q_i)$ is the sum of $M$ 0--1 random

546: variables of mean $q_i$, and thus at large $M$ is a Gaussian whose

547: mean and variance are both given by $M q_i$.

548:

549: Since we are interested in the large $M$ limit with $M \mu$ fixed,

550: let us define $x_i =N_i/M$ to be the fraction of individuals

551: residing on node $i$. If we average equation~(\ref{eq:stochasticN}),

552: we recover

553: the deterministic (quasi-species) evolution equations for the

554: $x_i$'s. But fluctuations do \emph{not} go away at large $M$ if $M

555: \mu$ is fixed, instead the intensity of drift goes to a limit.

556: Extracting the

557: mean from the Gaussian of equation~(\ref{eq:stochasticN}), the

558: stochastic evolution equations for the $x_i$ take the form

559: \begin{equation}

560: \label{stochasticX} x_i(g+1) = (1-\mu) x_i(g) + \sum_{\langle ij

561: \rangle} P_j(M p_j) / M + G_i(0, M q_i) / M.

562: \end{equation}

563: Summing this expression over $M$ steps we obtain in

564: the limit of large $M$ (and thus $\mu \to 0$):

565: \begin{equation}

566: \label{stochasticXX}

567: \Delta x_i \equiv x_i(g+M) - x_i(g) = - M \mu x_i +

568: \sum_{\langle ij \rangle} (M \mu) x_j/3L + G_i(0,x_i),

569: \end{equation}

570: where we have used the fact that the sum of the $M$ Poisson

571: variables contributes via its mean but its variance (equal to $p_j$)

572: becomes negligible. Clearly, the steady-state behavior of these

573: stochastic equations depends only on the scaling parameter $M \mu$.

574: As expected, by averaging these equations, we recover the

575: quasi-species evolution dynamics. Strictly speaking, we have shown

576: that there is a limit when evolving $M$ individuals, neglecting the

577: decrease in the population size. But restoring the population to a

578: fixed size $M$ involves no mutations and so falls into the standard

579: case of drift for a single genotype; that stochastic process also

580: reaches a limit at large $M$ when $M \mu$ is fixed, so we can

581: conclude that the full process (which maintains the population size

582: at $M$) also depends only on the scaling variable $M \mu$ as $M \to

583: \infty$. Note that within this scaling framework, we recover the

584: $\mu$ fixed, $M\to \infty$ case of equation~(\ref{eq:RmuInfPop}) by

585: taking $M \mu \to \infty$: we shall see that the corrections to this

586: limit are linear in $1/M \mu$. One can also consider the limit $M

587: \mu \to 0$: there, the population structure typically collapses to

588: just one genotype at a time, and as shown by van Nimwegen et

589: al.~\cite{NimwegenCrutchfield99}, the effective dynamics reduces to a

590: random walk on the neutral network, so the population neutrality is

591: given by the network neutrality.

592:

593:

594: \subsection{Hamming distances in a finite population}

595: \label{subsec:HammingSelection}

596:

597: We studied the distribution of Hamming distances between individuals

598: in the steady state population evolving on our three RNA neutral

599: networks. Here we report our results only for the largest network

600: (of size 31484), as qualitatively similar results were obtained with

601: the two other sizes.

602:

603: %

604: \begin{figure}

605: %\psfrag{P(h)}{$P(h)$}\psfrag{h}{$h$}

606: \begin{center}

607: \includegraphics[width=7cm]{hammingmmu0p2rds.eps}

608: \end{center}

609: \caption{Distribution of Hamming distances at $M \mu=0.2$ when

610: $M=10,20,40$ and 100. The data superpose perfectly.}

611: \label{rdsmmu0p2}

612: \end{figure}

613: %

614: \begin{figure}

615: \begin{center}

616: \includegraphics[width=7cm]{hammingmmu2rds.eps}

617: \end{center}

618: \caption{Distribution of Hamming distances at $M \mu=2$ when

619: $M=10,20,40$ and 100. The data collapse is excellent when $M \ge 20$.}

620: \label{rdsmmu2}

621: \end{figure}

622: %

623: \begin{figure}

624: \begin{center}

625: \includegraphics[width=7cm]{hammingmmu5rds.eps}

626: \end{center}

627: \caption{Distribution of Hamming distances at $M \mu=5$ when

628: $M=20,40$ and 100. Here the data collapse requires $M \ge 40$.}

629: \label{hammingmmu5}

630: \end{figure}

631: %

632: \begin{figure}

633: \begin{center}

634: \includegraphics[width=7cm]{hammingmmu10rds.eps}

635: \end{center}

636: \caption{Distribution of Hamming distances at $M \mu=10$ when

637: $M=20,40,100,200$. The $M \mu$ scaling is good only for $M \ge

638: 100$.} \label{hammingmmu10}

639: \end{figure}

640:

641: If we fix $M \mu$, we obtain $M$-independent results when $M$ is

642: large, in agreement with the scaling law derived in the previous

643: section. However, the value of $M$ at

644: which this scaling arises depends on $M \mu$. As shown in figures

645: \ref{rdsmmu0p2}, \ref{rdsmmu2}, \ref{hammingmmu5} and

646: \ref{hammingmmu10}, the scaling sets in for larger and larger values

647: of $M$ as the value of $M \mu$ grows. For instance, when $M \mu=10$,

648: one needs $M \ge 100$ to really see the $M \mu$ scaling convincingly

649: (cf.~figure~\ref{hammingmmu10}). Moreover corrections to this

650: scaling go as $O(1/M)$, i.e., they are of the same form as we found

651: in section~\ref{sec:DynamicsWithoutSelection} in the absence of

652: selection. This is a generic property that will be further studied

653: in section~\ref{sec:Bottlenecks}.

654:

655: Within the $M \mu$ scaling, we see the population spread increases

656: monotonically with $M \mu$. In particular, as $M \mu \to 0$, the

657: spread goes to $0$, while as $M \mu \to \infty$, the population

658: spreads across the whole neutral network.

659:

660:

661: \subsection{Neutrality in a finite population}

662:

663: We first examine the population neutrality $\langle d

664: \rangle_\mathrm{M}$. We are interested in seeing how large $M$ should

665: be for the $M \mu$ scaling to set in, considering in particular

666: the dependence on the neutral network size.

667:

668: \subsubsection{Small neutral network}

669:

670: Figures~\ref{small1}(a) and (b) show the average neutrality

671: $\average{d}_\mathrm{M}$ as a function of $M$ and $M \mu$

672: respectively, for $\mu=0.01$, $0.1$ and $0.25$.

673: \begin{figure}[htb]

674: \begin{center}

675: \begin{tabular}{cc}

676: \includegraphics[width=7cm]{smallrdsm.eps}

677: &  \includegraphics[width=7cm]{smallmmu.eps}\\

678: %\mbox{\bf ($M$ dependence)} & \mbox{\bf ($M \mu$ dependence)}

679: \end{tabular}

680: \end{center}

681: \caption{Population neutrality $\average{d}_\mathrm{M}$ as a

682: function of $1/M$ and $1/M \mu$ for $\mu=0.01,0.1$ and $0.25$

683: in our small size network. One can

684: see the $M \mu$ scaling with small corrections due to

685: finite $M$; note also the linear behavior at the origin.} \label{small1}

686: \end{figure}

687:

688: %

689: \begin{figure}

690: \begin{center}

691:     \includegraphics[width=7cm]{smallrdsmmu.eps}

692: \end{center}

693: \caption{$1/M$ law for the corrections to $M \mu$ scaling.} \label{small3}

694: \end{figure}

695: %

696:

697: The average population neutrality depends both on $M$ and $M \mu$.

698: If $\mu$ is fixed and we take $M \to \infty$, we recover the

699: quasi-species limit which is

700: $\mu$-independent. Similarly, for fixed $M \mu$, we

701: obtain the $M \mu$ scaling regime by taking large $M$, just as in

702: van~Nimwegen et~al.~\cite{NimwegenCrutchfield99}. Furthermore, we

703: see that the corrections to the large-$M \mu$ limit of population

704: neutrality are linear in  $1/(M \mu)$ (cf.~the linear behavior at

705: the origin in Figure~\ref{small1}(b)). Finally, the approach to the

706: large $M$ limit at fixed $M \mu$ has measurable $1/M$ corrections.

707: (We also find that the value of $M \mu$ affects the time taken to

708: approach the steady state.) At fixed $M \mu$, the dependence on $M$

709: of the population neutrality or of the distribution of $d$ is rather

710: mild, though more marked than in the myopic or blind ant dynamics.

711: For several values of $M \mu$, we show these $1/M$ corrections in

712: figure~\ref{small3}. From all these data, we conclude that the

713: population neutrality has the form

714: \begin{equation}

715: \average{d}_\mathrm{M} = f(M \mu) \left(1+\frac{A(M

716: \mu)}{M} + \cdots\right),

717: \label{eq:CorrectionsToScaling}

718: \end{equation}

719: where $f(M \mu)=\average{d}_{\infty}$ is the $M \rightarrow \infty$

720: limit of $\average{d}_\mathrm{M}$ at given $M \mu$ and

721: \begin{equation}

722: f(M \mu) = f(\infty) \left(1 + \frac{B}{M \mu} + \cdots \right),

723: \label{eq:CorrectionsToInfinity}

724: \end{equation}

725: describes how the large $M \mu$ limit converges to the quasi-species

726: case. Finally, as shown by van~Nimwegen

727: et~al.~\cite{NimwegenCrutchfield99}, $f(M \mu)$

728: tends to the network neutrality as $M \mu \to 0$.

729:

730:

731: \subsubsection{Medium and large networks}

732:

733: Similar results are found for our medium and large networks.

734: Figure~\ref{med} shows the population neutrality

735: as a function of $1/M \mu$,

736: exhibiting good $M \mu$ scaling.

737: %

738: \begin{figure}[htb]

739: \begin{center}

740: \begin{tabular}{cc}

741: \includegraphics[width=7cm]{medmmu.eps} &

742:     \includegraphics[width=7cm]{largemmu.eps} \\

743: %\mbox{\bf (medium network)} & \mbox{\bf (large network)}

744: \end{tabular}

745: \end{center}

746: \caption{Average neutrality as a function of $1/M \mu$ for different

747: values of $\mu$ on our medium and large size networks.} \label{med}

748: \end{figure}

749: %

750: However the corrections to this scaling are larger than those

751: found in the small network; in fact, to see the asymptotic $1/M$ law

752: for the corrections to the $M \mu$ scaling, one has to go to quite

753: large values of $M$. Finally, as before, the

754: quasi-species limit is reached with corrections $O(1/(M \mu))$.

755:

756: Other results that seems to apply generally is that for fixed $\mu$,

757: the population neutrality (and thus the population robustness)

758: increases with $M$, but it decreases at fixed $M \mu$.

759: Mathematically, this means that $A > 0$ in

760: equation~(\ref{eq:CorrectionsToScaling}) while $B < 0$ in

761: equation~(\ref{eq:CorrectionsToInfinity}).

762:

763:

764: \section{Dynamics with selection but low drift}

765: \label{sec:SelectionLowDrift}

766:

767: \subsection{Framework}

768: \label{subsec:framework}

769:

770: Random drift reduces the population spread and thus delays the

771: approach to the large $M$ limit at fixed $\mu$. Lowering

772: the drift

773: would thus allow one to reach the large $M$ limit more easily.

774: Furthermore, one would have a higher mutational robustness of the

775: steady-state population for a given population size; this higher

776: survival probability suggests that biological mechanisms for

777: reducing drift~\cite{WhitlockIngvarsson00} could be selected for in

778: natural populations. In this section we study dynamics in which

779: selective pressures are high but the drift is particularly low.

780:

781: Our dynamics on a neutral network is defined as follows

782: so that drift effects are minimized:

783: %

784: \begin{itemize}

785: \item For a given population of size $M$ and mutation probability

786: per individual $\mu$, we have $M \mu$ of the individuals in the

787: population hop to any of their $3L$ neighbors; the others are

788: unchanged. If a mutation brings an individual off the neutral

789: network, we kill it, otherwise we keep it.

790: \item We find the number of individuals that were killed and replace them

791: by randomly cloning individuals from the remaining population.

792: \end{itemize}

793: Note that in these dynamics we perform a random sampling \emph{only

794: for a part of the population}; in fact, in the absence of selection,

795: there is no drift at all.

796:

797: To understand the essential difference from the usual dynamics, we

798: use a branching process representation of the evolution and the

799: cloning. An individual is represented by a point; at a given time

800: there are $M$ points for $M$ individuals (note that as always $M$

801: represents the number of individuals, not the number of different

802: genotypes). From one generation (time) to the next,

803: $M \mu$ points are replicated leading to branchings

804: while the other points just proceed without branching. The time

805: evolution generates branching processes in which each individual is

806: represented by a point and is connected to its parent at the

807: previous time by an edge. Following this branching process, we

808: obtain the descendants of a given individual; one can also

809: consider two individuals, follow their edges backwards till one reaches

810: their most recent common ancestor. Because of the drift, at large

811: enough times the whole population belongs to just one connected

812: component.

813:

814: Following Derrida and Peliti~\cite{DerridaPeliti91}, let us

815: investigate the genealogy of individuals going back in generations.

816: For a subset of $k$ individuals in the population, let $w_k(t)$ be

817: the probability that their ancestors $t$ generations ago were all

818: different. To calculate this quantity, we first determine the

819: probability $x_k$ that $k$ individuals have $k$ distinct immediate

820: ancestors (parents). Define $q=\mu(1-\average{R_\mu}_{M})$ where

821: $\average{R_\mu}_{M}$ is the mean robustness of the population. Then

822: assuming $q$ is small and neglecting node to node fluctuations in

823: neutrality, we have

824: \begin{equation}

825: x_k \approx 1 - \frac{k (k-1) q}{(1-q)M} \ .

826: \end{equation}

827: Taking the generation-to-generation processes to be independent, one

828: has $w_k(t) = x_k^t$, which can be approximated to leading order in

829: $q$ by

830: %

831: \begin{equation}

832: \label{DP2:eq}

833:     w_k(t) \approx \exp\left(-\frac{k(k-1) q}{M} ~ t\right).

834: \end{equation}

835: %

836: Hence, unlike the random drift case, the time scale is proportional

837: to $M/q$. We thus define a rescaled dimensionless time variable

838: %

839: \begin{equation}

840:     \tau = q t / M.

841: \end{equation}

842: %

843: The probability that the two individuals shared a common ancestor at

844: most $t$ generations back is given by $1-w_2(t) = 1-\exp(-2\tau)$.

845: Thus the probability density $p(\tau)$ that the most recent common

846: ancestor of the two individuals arose between $\tau M/q$ and

847: $(\tau+d \tau) M/q$ generations ago is given by

848: %

849: \begin{equation}

850:     p(\tau)=\frac{\rmd (1-w_2(\tau))}{\rmd \tau} = 2 \exp(-2\tau).

851: \end{equation}

852: Therefore the characteristic time for the most recent common

853: ancestor scales as $M/\mu$, while it scales as $M$ in the usual

854: dynamics with drift of section~\ref{sec:SelectionAndDrift}.

855:

856: The distribution of times to the most recent

857: common ancestor can be used to

858: obtain the Hamming-distance distribution of the steady-state

859: population. Let $\phi_{\nu}(t)$ be the probability that two random

860: walkers in genotype space find themselves at a Hamming distance

861: $\nu$ given that they coincided $t$ generations before. We have

862: \begin{equation}

863: \label{DP3:eq}

864: \phi_{\nu}(t)= \frac{\Gamma[L+1]}{2^{\nu} \Gamma[\nu+1]

865: \Gamma[L-\nu+1]} (1-\exp^{-2 \mu t})^{\nu},

866: \end{equation}

867: where $L$ is the genome length. Then $P_{\nu}$, the probability that

868: the Hamming distance between two individuals in the steady state

869: population is equal to $\nu$, is given by

870: %

871: \begin{equation}

872:     P_{\nu} = \int_{0}^{\infty} \rmd \tau\, p(\tau) \phi_{\nu}(t).

873: \end{equation}

874: %

875: Using the expression of $\phi_{\nu}(t)$ from equation~(\ref{DP3:eq}),

876: one sees that $P_{\nu}$ depends on $M$ but not much

877: on $\mu$; in particular, one has

878: a well defined $\mu \to 0$ limit at fixed $M$. This is to be

879: contrasted with the random drift case where $P_{\nu}$ depends mainly

880: on $M \mu$.

881:

882: \subsection{Consequence for the genotypic diversity}

883: \label{subsec:diversity}

884:

885: A high level of genotypic diversity in a population is usually

886: advantageous for survival. Here we study how low drift can greatly

887: enhance this diversity by examining two measures of the number of

888: different genotypes, namely the actual number $g_M$ (which is

889: frequency independent), and the inverse participation ratio $G_M$ of

890: the genotypic abundances. Explicitly, for a population of size $M$,

891: if the number of individuals with genotype $i$ is $m_i$, we define

892: %

893: \begin{equation}

894: G_M = \frac{(\sum m_i)^2}{\sum m_i^2},

895: \end{equation}

896: where the sum runs over all the different genotypes present in the

897: population.

898: %

899: \begin{figure}

900: \begin{center}

901:     \includegraphics[width=7cm]{drift.eps}

902: \end{center}

903: \caption{Plot of genotypic diversity ($g_M$ and $G_M$) for dynamics with

904:   selection and drift as

905: function of $\ln M$ at $M \mu =0.4$.}

906: \label{fig:driftd}

907: \end{figure}

908: %

909:

910: %

911: \begin{figure}

912: \begin{center}

913:     \includegraphics[width=7cm]{lessdrift.eps}

914: \end{center}

915: \caption{Plot of genotypic diversity ($g_M$ and $G_M$)

916: as a function of population size

917: $M$ for dynamics with selection but low drift at $M \mu =0.4$.}

918: \label{fig:lessdriftd}

919: \end{figure}

920: %

921:

922: We show in figure~\ref{fig:driftd} the two different measures of

923: genotypic diversity for dynamics with selection and drift. The

924: bottom curve corresponds to $G_M$, the top one to $g_M$. We find

925: that the ``absolute'' genotypic diversity $g_M$, (taking into

926: account genotypes of arbitrarily low frequencies), grows

927: logarithmically (and thus rather slowly) with $M$ at fixed $M \mu$.

928: On the other hand, the rare genotypes contribute less to the inverse

929: participation ration $G_M$, and we find that this measure of

930: genotypic diversity saturates in the large $M$ limit at fixed $M

931: \mu$.

932:

933: In Fig.~\ref{fig:lessdriftd} we display the same quantities for our

934: low-drift dynamics. We now see that both measures of diversity grow

935: linearly with $M$ at fixed $M \mu$; thus each genotype in the

936: population arises just a few times as $M \to \infty$ if $M \mu$ is

937: fixed. Clearly the reduction of drift in this dynamics allows for a

938: high genotypic diversity.

939:

940:

941: \subsection{Hamming distances}

942: We can study Hamming distances as was done in

943: section~\ref{subsec:HammingSelection} for our low-drift dynamics. We

944: find that the distribution of Hamming distances depends strongly on

945: $M$ but not much on $\mu$ and approaches a $\mu$-independent limit

946: as $M$ grows large (figures~\ref{hammings1} and~\ref{hammings2}).

947: This is consistent with the above calculation for the scaling laws.

948: As expected, for fixed $M$, there is a non-trivial limit

949: distribution as $\mu \to 0$.

950: %

951: \begin{figure}

952: \begin{center}

953:     \includegraphics[width=7cm]{avHist.eps}

954: \end{center}

955: \caption{Distribution of Hamming distances

956: for evolutionary dynamics with low drift. The distributions

957: depend strongly on $M$ and only very weakly on $\mu$.} \label{hammings1}

958: \end{figure}

959: %

960: \begin{figure}

961: \begin{center}

962:     \includegraphics[width=7cm]{noMmu.eps}

963: \end{center}

964: \caption{Distribution of Hamming distances for two different values of

965: population size (20 and 40) when $M \mu$=0.4, showing the absence

966: of $M \mu$ scaling when the dynamics has low drift.}

967: \label{hammings2}

968: \end{figure}

969:

970: \subsection{Population neutrality}

971:

972: In the infinite population limit, the value of population neutrality

973: is independent of drift effects, and it is also independent of

974: $\mu$. Consider first our small neutral network with $489$ nodes and

975: network neutrality $10.4499$; we determined the largest eigenvalue

976: of the adjacency matrix (via

977: \textsl{Mathematica}~\cite{Mathematica05}), obtaining $\langle d

978: \rangle_{\infty} = 11.5107$. In a finite population we find that

979: $\average{d}_M$ approaches $\average{d}_{\infty}$, with $1/M$

980: corrections that do not depend much on the mutation rate. In

981: particular, we find

982: \begin{equation}

983:     \average{d}_M = \average{d}_{\infty} \left(1+\frac{A(\mu)}{M} \right),

984: \label{eq:nc}

985: \end{equation}

986: where $A=-0.328\pm 0.002$ for $\mu =0.1$ and $A=-0.317 \pm 0.005$

987: for $\mu=0.25$. The corresponding fits are shown in

988: figure~\ref{small}.

989: %

990: \begin{figure}[htb]

991: \begin{center}

992: \begin{tabular}{cc}

993: \includegraphics[width=7cm]{fitm01.eps} &

994:     \includegraphics[width=7cm]{fitm025.eps} \\

995: \mbox{\bf ($\mu$ =0.1)} & \mbox{\bf ($\mu$ =0.25)}

996: \end{tabular}

997: \end{center}

998: \caption{Population neutrality vs $1/M$ on the small size network.

999: The dotted line is a linear fit as in Eq.~(\ref{eq:nc}).}

1000: \label{small}

1001: \end{figure}

1002:

1003: %\subsubsection{Medium-size neutral network}

1004: On our medium-size neutral network (with $5784$ nodes), we find that

1005: the network average neutrality is equal to $10.6888$, while

1006: $\average{d}_{\infty} = 12.592 \pm 0.0002$. We again find a $1/M$

1007: convergence as in Eq.~(\ref{eq:nc}) with $A=-0.752 \pm 0.035$ for

1008: $\mu=0.05$ and $A= -0.662 \pm 0.02$ for $\mu=0.25$

1009: (figure~\ref{medium}). Just as for the small neutral network, $A$

1010: does not depend much on $\mu$.

1011: %

1012: \begin{figure}[htb]

1013: \begin{center}

1014: \begin{tabular}{cc}

1015: \includegraphics[width=7cm]{mfitm005.eps} &

1016:     \includegraphics[width=7cm]{mfitm025.eps} \\ [0.4cm]

1017: \mbox{\bf ($\mu$ =0.05)} & \mbox{\bf ($\mu$ =0.25)}

1018: \end{tabular}

1019: \end{center}

1020: \caption{Population neutrality vs $1/M$ on the medium size

1021: network. The dotted line is a linear fit as in Eq.~(\ref{eq:nc}).}

1022: \label{medium}

1023: \end{figure}

1024: %

1025: \begin{figure}[htb]

1026: \begin{center}

1027: \begin{tabular}{cc}

1028: \includegraphics[width=7cm]{lfitm005.eps} &

1029:     \includegraphics[width=7cm]{lfitm025.eps} \\

1030: \mbox{\bf ($\mu$ =0.05)} & \mbox{\bf ($\mu$ =0.25)}

1031: \end{tabular}

1032: \end{center}

1033:  \caption{Population neutrality vs $1/M$ on the large size network.

1034: The dotted line is a linear fit as in Eq.~(\ref{eq:nc}).}

1035: \label{large}

1036: \end{figure}

1037: %

1038:

1039: %\subsubsection{Large neutral network}

1040: Finally, on the large neutral network (with $31484$ nodes), we find

1041: the network neutrality to be $12.116$, whereas $\average{d}_{\infty}

1042: = 15.434$. Figure~\ref{large} confirms the $1/M$ convergence with

1043: $A=-0.927 \pm 0.045$ for $\mu =0.05$ and $A=-0.944 \pm 0.044$ for

1044: $\mu =0.25$.

1045:

1046: The overall pattern is thus that the data are well represented by

1047: equation~(\ref{eq:nc}),with an $A(\mu)$ that grows with increasing

1048: network size and depends slowly on $\mu$. We have also checked

1049: directly that $\average{d}_M$ is rather insensitive to the value of

1050: $\mu$ (cf.~figure~\ref{Mdep}). Furthermore, in all cases, there is

1051: no $M \mu$ scaling (cf.~figure~\ref{Mmudep}), in contrast with what

1052: happens for the case of standard drift (see

1053: section~\ref{sec:SelectionAndDrift}).

1054: %

1055: \begin{figure}

1056: \begin{center}

1057: \includegraphics[width=7cm]{Mplot.eps}

1058: \end{center}

1059: \caption{Population neutrality $\average{d}_M$ vs $M$ for

1060: $\mu =0.01,0.05,0.1$ and $0.25$ on the large neutral network,

1061: showing the insensitivity to $\mu$.}

1062: \label{Mdep}

1063: \end{figure}

1064: %

1065:

1066: %

1067: \begin{figure}

1068: \begin{center}

1069: \includegraphics[width=7cm]{Mmuplot.eps}

1070: \end{center}

1071: \caption{Population neutrality $\average{d}_M$ vs $M \mu$ for $\mu

1072: =0.01,0.05,0.1$ and $0.25$ on the large neutral network,

1073: showing the absence of $M \mu$ scaling. }

1074: \label{Mmudep}

1075: \end{figure}

1076: %

1077:

1078: \subsection{Distribution of neutrality}

1079:

1080: As a last point, let us consider the whole distribution $P_M(d)$

1081: rather than just the average population neutrality $\average{d}_M$.

1082: In figure~\ref{dis} we display several cases of interest for our

1083: largest neutral network. The left-most curve is for genotypes chosen

1084: randomly and uniformly from the neutral network. The next curve on

1085: the right is for a population of size $M=50$ undergoing selection

1086: with low drift; there are in fact two data sets displayed, one for

1087: $\mu=0.1$ and one for $\mu=0.25$. The last curve is for the same

1088: algorithm at $M=1000$ for three values of $\mu$, namely $\mu=0.1$,

1089: $0.25$ and $0.5$.

1090: %

1091: \begin{figure}

1092: \begin{center}

1093: \includegraphics[width=7cm]{dist.eps}

1094: \end{center}

1095: \caption{The continuous line shows the distribution of

1096: the number of neighbors for

1097: random nodes on the largest network (with 31484 nodes). The curve closest

1098: to it shows the steady state distribution for two data sets,

1099: with $\mu=0.1$ and $0.25$ when the population size is 50:

1100: they superpose extremely well.

1101: The third curve corresponds to $M=1000$ and

1102: $\mu=0.1,0.25$ and $0.5$, again with excellent superposition.

1103: Superposition (independence on $\mu$) becomes

1104: exact when $M=\infty$.} \label{dis}

1105: \end{figure}

1106: %

1107:

1108: Several comments are in order. First, as $M$ increases, the overall

1109: trend is for $P_M(d)$ to shift to larger $d$; this is in agreement

1110: with the general property that mutational robustness grows with

1111: increasing genotypic diversity. Second, there is hardly any

1112: dependence of these data on $\mu$, a feature particular

1113: to dynamics having reduced drift.

1114:

1115:

1116: \section{Reaching the large $M$ limit and barriers in the fitness landscape}

1117: \label{sec:Bottlenecks}

1118:

1119: \subsection{Motivation}

1120: For large populations in the presence of drift, the relevant

1121: parameter appears to be $M \mu$, as we have seen in

1122: sections~\ref{sec:DynamicsWithoutSelection} and

1123: \ref{sec:SelectionAndDrift}. When one wishes to evaluate the

1124: large-$M$ limit at given $\mu$, it is better to minimize drift as in

1125: section~\ref{sec:SelectionLowDrift}, especially when $\mu$ is small,

1126: since $\mu$ plays little role in this modified dynamics. We also saw

1127: that the corrections to the $M=\infty$ limit go like $O(1/M)$ on

1128: neutral networks. Simple arguments~\cite{CerfMartin95} suggest that

1129: this is a generic property of growth with

1130: diffusion phenomena, and we shall now confirm this on toy

1131: landscapes: the $O(1/M)$ corrections are not an artefact of the

1132: fitness being 0 and 1 as we have assumed so far. Furthermore, we

1133: wish to get some insight into the \emph{size} of this correction.

1134: For our three neutral networks, we found that the factor $A$ of the

1135: $A/M$ correction grows with increasing neutral network size;

1136: however, barriers (entropic or fitness) in the landscape are likely

1137: to affect $A$, as we will now illustrate using a few toy landscapes.

1138:

1139: \subsection{Evolution in a toy fitness landscape}

1140:

1141: To build up our intuition, we will consider a space where genotypes

1142: are parametrized by a real number, and mutations correspond to small

1143: changes of this number. Evolution in low-dimensional fitness

1144: landscapes has been considered by several authors in the recent

1145: years and has led to a number of insights (see, e.g.,

1146: \cite{tsimring96,tsimring97,peng03,rouzine03}). In this case, it is

1147: convenient to consider a continuous-time limit because this allows

1148: for a Fokker-Planck formulation. We can start with all individuals

1149: at the same position or place them randomly in the landscape. After

1150: some time, the population reaches a well defined steady state. For

1151: the numerical simulation of such evolutionary dynamics, we

1152: discretize time using a time step $\Delta t$; this then

1153: gives the following update rules:

1154: \begin{itemize}

1155:     \item At each

1156: step (small) mutations arise; this means that the genotype $x$ is

1157: changed by $\Delta x$ where $\Delta x$ is a Gaussian random

1158: variable of standard deviation $\sqrt{2 D \Delta t}$~\cite{Vankampen81}.

1159:     \item Given the new positions of the individuals, we allow for

1160: replication according to fitness given by a function

1161: $-V(x)$ in the Fokker-Planck language.

1162: $V(x)$ is low (or even negative) if

1163: the genotype has a high fitness; it is large and positive for an

1164: unfit genotype. Then for an interval $\Delta t$ of time, an

1165: individual of genotype $x$ will be killed with probability

1166: $1-\exp(-V(x)\Delta t)$ if $V(x) > 0$; if instead $V(x) < 0$, a clonal

1167: birth will be produced with probability $\exp(-V(x)\Delta t) - 1$.

1168: Following standard practice, if one wants

1169: to keep the population size fixed on average near some

1170: target value $M$, one simply shifts additively $V(x)$ so that the

1171: expected population size is precisely $M$.

1172: \end{itemize}

1173:

1174: %\subsection{Infinite population limit}

1175: In the $M \to \infty$ limit, the details associated with keeping the

1176: population at its target value no longer matter and the overall

1177: process can be formulated as a rate equation. Up to the rescaling of

1178: the population to keep its size fixed, the density of genotypes

1179: $\rho(x)$ follows the deterministic equation

1180: \begin{equation}

1181:     \label{eq:schroedinger}

1182:     \frac{\partial \rho(x,t)}{\partial t} = D

1183:     \frac{{\partial}^2 \rho(x,t)} {\partial x^2} -V(x) \rho(x,t) .

1184: \end{equation}

1185: This is a linear evolution law that is the

1186: continuum analog of the quasi-species dynamics.

1187: At large times the \emph{shape} of

1188: $\rho(x)$ converges to the eigenfunction of the linear operator on

1189: the right hand side whose eigenvalue is largest. One can recognize

1190: \eref{eq:schroedinger} as being an (imaginary time) Schr�dinger

1191: equation. The problem of the steady-state distribution is thus

1192: mathematically a simple one that can be solved analytically for

1193: particular choices of the function $V(x)$. Now we can address the

1194: question of \emph{how} the large $M$ limit is reached.

1195:

1196: \subsection{Harmonic well}

1197: We first consider  the case where $V(x)= x^2$, which corresponds in

1198: the Schr�dinger equation framework to diffusion in a harmonic well,

1199: a case with no barriers. In the $M\to \infty$ limit, the probability

1200: distribution of the population in this landscape is

1201: $P(x)=\exp(-x^2/2)/\sqrt{2 \pi}$.

1202:

1203: For a finite population of size $M$, we evaluate numerically the

1204: steady state distribution $P_M(x)$. We see a clear convergence of

1205: this distribution to its large-$M$ limit, with $1/M$ corrections:

1206: %

1207: \begin{equation}

1208: P_M(x) = P(x)\left(1+\frac{K(x)}{M}\right) +

1209: \Or\left(M^{-2}\right).

1210: \end{equation}

1211: %

1212: The $1/M$ nature of the convergence clearly appears in

1213: figure~\ref{fig:HarmonicWell}, where we see that the amplitude of

1214: these corrections is small. This kind of convergence has been

1215: justified before~\cite{CerfMartin95} in the context of population

1216: algorithms for solving linear evolution operators. Furthermore, it

1217: is possible to show that the correction function $K(x)$ goes to a

1218: constant at large $x$. When we obtain such a data collapse, we know

1219: $M$ is large enough for one to extract $P(x)$.

1220: %

1221: \begin{figure}[htb]

1222: \begin{center}

1223: \begin{tabular}{cc}

1224: \includegraphics[width=7cm]{beforescalingH.eps} &

1225:     \includegraphics[width=7cm]{afterscalingH.eps} \\

1226: \mbox{\bf (a)} & \mbox{\bf (b)}

1227: \end{tabular}

1228: \end{center}

1229: \caption{(a) Relative deviation $P_M(x)/P(x)-1$ vs $x$ for

1230: $M=6,8,10,12,15$ and 20. (b) Data collapse plot: $M(P_M(x)/P(x)-1)$ for

1231: $M=6,8,10,12,15$ and 20.}

1232: \label{fig:HarmonicWell}

1233: \end{figure}

1234:

1235: \subsection{Symmetric double well}

1236: We now consider the case of a landscape with two degenerate optima

1237: separated by a barrier (passage of low fitness). For that we take

1238: $V(x)$ to be an even polynomial of degree 4 in $x$: $V(x)=V(-x)$.

1239: Since $P(x)$ is not analytically known, we examine instead the

1240: quantity $M_1 M_2 (P_{M_1}(x) - P_{M_2}(x))/(M_1 - M_2)$ for

1241: different population sizes $M_1$ and $M_2$. If the convergence goes

1242: as $O(1/M)$, then, provided $M_1$ and $M_2$ are sufficiently large,

1243: the data should collapse onto a limit function. This is indeed what

1244: we find; the case $V(x) = x^2 + 0.1 x^4$ is used for illustration in

1245: figure~\ref{fig:DoubleWell}.

1246: %

1247: \begin{figure}

1248: %\psfrag{x}{$x$}

1249: \begin{center}

1250: \includegraphics[width=7cm]{doublewellscaled.eps}

1251: \end{center}

1252: \caption{Data collapse plot for $M_1 M_2 (P_{M_1}(x) - P_{M_2}(x))/(M_1

1253: - M_2)$ vs $x$, for several population sizes $M_1$ and $M_2$, in

1254: the case of a symmetric double well. We have taken $(M_1,M_2)=

1255: (6,10),(10,20),(6,20),(20,50)$}

1256: \label{fig:DoubleWell}

1257: \end{figure}

1258:

1259: \subsection{Case of an asymmetric double well}

1260: To go from one fitness peak to another, one has to cross a barrier.

1261: The previous double well has a symmetric steady state, and even with

1262: a small population, this symmetry is realized. However when

1263: the well is asymmetric, finite population effects will be quite

1264: larger. Indeed, in the $M\to\infty$ limit, even a small non

1265: symmetric part ($V(x)$ not even in $x$) will lead to a distribution

1266: practically concentrated in one well. (This is called the ``flea and

1267: elephant phenomenon'' well known in

1268: quantum mechanics: when the barrier between the two wells is

1269: high, even a tiny difference in $V$ between the two sides leads to a

1270: big effect, just as when a little itching on an elephant's shoulder

1271: leads it to put all of its weight onto one side.) When the

1272: population is finite, this effect is not so evident, and the two

1273: wells remain nearly symmetrically populated. One has to go to large

1274: population sizes $M$ in order to come closer to the $M\to\infty$

1275: limit.

1276:

1277: %

1278: \begin{figure}[htb]

1279: %\psfrag{PM}[c]{$M(P_M(x)/P(x)-1)$}\psfrag{x}{$x$}

1280: \begin{center}

1281: \includegraphics[width=8cm]{asym.eps}

1282: \end{center}

1283: \caption{Data collapse plot for

1284: $M_1 M_2 (P_{M_1}(x) - P_{M_2}(x))/(M_1 - M_2)$ vs $x$, for

1285: several population sizes $M_1$ and $M_2$, in

1286: the case of an asymmetric well. $M=200,250,300,400,500$. }\label{asym:fig}

1287: \end{figure}

1288: %

1289:

1290: To investigate this effect quantitatively, we consider the

1291: asymmetric potential $V(x)=-x^2+b x^4+cx$ with $b=0.05$ and

1292: $c=0.002$, corresponding to an asymmetry of roughly $0.1\%$. We

1293: found the $1/M$ convergence law,

1294: but had to go to $M \approx 100$ to observe

1295: the data collapse. This is shown in figure~\ref{asym:fig}; note the

1296: large scale of the $y$ axis compared to the symmetric double well

1297: case: the $1/M$ corrections are much larger here. When the barrier

1298: height is increased, one needs even larger $M$ values to see the

1299: $M=\infty$ limit: the proper balance of population on each side of

1300: the barrier sets in very slowly in $M$. In landscapes with more than

1301: one dimension, there can also be entropic barriers, that is passages

1302: that are narrow but not of particularly low fitness; such cases are

1303: relevant for general neutral networks.

1304:

1305: \section{Summary and conclusions}

1306: \label{sec:Conclusions}

1307:

1308: In general, an evolving population undergoes mutation, selection and

1309: random drift. In this work we quantified the effect of these

1310: processes to untangle the different effects, using neutral networks

1311: and toy fitness landscapes for illustration. In the case of infinite

1312: populations, there have been many studies. If $M \mu\to\infty$, ($M$

1313: being the population size and $\mu$ the mutation rate), drift is

1314: absent and one recovers the quasi-species

1315: limit~\cite{Eigen71,EigenSchuster77,EigenMcCaskill89}, with results

1316: that developed by van Nimwegen et al.~\cite{NimwegenCrutchfield99}

1317: in the context of neutral networks. For finite $M \mu$ ($M \to

1318: \infty$), drift effects are important; there, in the absence of

1319: selection, Derrida and Peliti~\cite{DerridaPeliti91} derived a

1320: number of important results. In this work we have considered the

1321: case $M \mu$ finite with selection, for both $M$ finite and

1322: infinite. We derived the $M \mu$ scaling even

1323: in the presence of selection.

1324: The (finite $M$) \emph{corrections} to

1325: this $M \mu$ scaling are $O(1/M)$, be there selection or not.

1326: When $M=\infty$, we find that the quasi-species limit is reached

1327: via $1/M \mu$ corrections. In all cases,

1328: $M \mu$ plays the role of an effective population size. These

1329: laws are summarized in equations~(\ref{eq:CorrectionsToScaling})

1330: and (\ref{eq:CorrectionsToInfinity}). We also

1331: found that the amplitude of the correction terms showed a slow

1332: increase with neutral network size. In practice, the $M \mu$ scaling

1333: sets in at relatively small values of $M$. Furthermore, at fixed $M

1334: \mu$ we showed that the genotypic diversity $g_M$ of the population

1335: increases only logarithmically as a function of $M$.

1336:

1337: Finally, we considered a dynamics with low drift in

1338: section~\ref{sec:SelectionLowDrift}. Drift is effectively reduced by

1339: a factor $\mu$ and thus genotypic diversity

1340: always  grows \emph{linearly}

1341: with population size even if $M \mu$ is fixed. One thereby avoids

1342: the $M \mu$ scaling law, a useful property

1343: if one wishes to evaluate the

1344: large-$M$ limit at small $\mu$. Nevertheless, reaching this limit

1345: can be seriously hindered by fitness or entropic barriers in the

1346: fitness landscape as we saw in section~\ref{sec:Bottlenecks}.

1347:

1348: \ack This work was supported by the EEC's FP6 IST Programme under

1349: contract IST-001935, EVERGROW. We thank F. Hospital and A. Wagner

1350: for their comments. LP is grateful to the LPTMS for its hospitality

1351: when this work was started.

1352:

1353:

1354: \section*{References}

1355: %\begin{thebibliography}{99}

1356:

1357: \addcontentsline{toc}{chapter}{\protect\bibname}

1358: \providecommand{\newblock}{}

1359: \begin{thebibliography}{10}

1360: \expandafter\ifx\csname url\endcsname\relax

1361:   \def\url#1{{\tt #1}}\fi

1362: \expandafter\ifx\csname urlprefix\endcsname\relax\def\urlprefix{URL }\fi

1363: \providecommand{\eprint}[2][]{\url{#2}}

1364: % Bibliography created with iopart-num v2.0

1365: % /biblio/bibtex/contrib/iopart-num

1366:

1367: \bibitem{HerronFreeman03}

1368: Herron J~C and Freeman S~R 2003 {\em Evolutionary Analysis\/} (Prentice Hall)

1369:

1370: \bibitem{Wright32}

1371: Wright S 1932 {\em Proc. of the Sixth Int. Congress on Genetics\/} vol~1 ed

1372:   Jones D~F (Austin, TX) pp 356--366

1373:

1374: \bibitem{Gavrilets04}

1375: Gavrilets S 2004 {\em Fitness Landscapes and the Origin of Species\/}

1376:   (Princeton: Princeton University Press)

1377:

1378: \bibitem{Eigen71}

1379: Eigen M 1971 {\em Naturwissenschaften\/} {\bf 58} 465

1380:

1381: \bibitem{EigenSchuster77}

1382: Eigen M and Schuster P 1977 {\em Naturwissenschaften\/} {\bf 64} 541

1383:

1384: \bibitem{EigenMcCaskill89}

1385: Eigen M, McCaskill J~S and Schuster P 1989 {\em Adv. Chem. Phys.\/} {\bf 75}

1386:   149--263

1387:

1388: \bibitem{Stadler02}

1389: Stadler P~F 2002 {\em Biological Evolution and Statistical Physics\/} ed

1390:   {L\"assig} M and Valleriani A (Berlin: Springer) pp 183--204

1391:

1392: \bibitem{Schuster02}

1393: Schuster P 2002 {\em Biological Evolution and Statistical Physics\/} ed

1394:   {L\"assig} M and Valleriani A (Berlin: Springer) pp 56--83

1395:

1396: \bibitem{LiebermanHauert05}

1397: Lieberman E, Hauert C and Nowak M 2005 {\em Nature\/} {\bf 433} 312--316

1398:

1399: \bibitem{TaylorRudenbert04}

1400: Taylor C, Fudenberg D, Sasaki A and Nowak M 2004 {\em Bull. Math. Biol.\/} {\bf

1401:   66} 1621--1644

1402:

1403: \bibitem{RocaGuesta06}

1404: Roca C~P, Cuesta J~A and Sanchez A 2006 {\em Phys. Rev. Lett.\/} {\bf 97}

1405:   158701

1406:

1407: \bibitem{AntalRedner06}

1408: Antal T, Redner S and Sood V 2006 {\em Phys. Rev. Lett.\/} {\bf 96} 188104

1409:

1410: \bibitem{DerridaPeliti91}

1411: Derrida B and Peliti L 1991 {\em Bull. Math. Biol.\/} {\bf 53} 355--382

1412:

1413: \bibitem{NimwegenCrutchfield99}

1414: {van Nimwegen} E, Crutchfield J~P and Huynen M 1999 {\em Proc. Natl. Acad. Sci.

1415:   USA\/} {\bf 96} 9716--9720

1416:

1417: \bibitem{Ewens04}

1418: Ewens W 2004 {\em Mathematical Population Genetics - I. Theoretical

1419:   Introduction\/} (New York: Springer Verlag)

1420:

1421: \bibitem{FontanaStadler93}

1422: Fontana W, Stadler P~F, Bornberg-Bauer E~G, Griesmacher T, Hofacker I~L, Tacker

1423:   M, Tarazona P, Weinberger E~D and Schuster P 1993 {\em Phys. Rev. E\/} {\bf

1424:   47} 2083--2099

1425:

1426: \bibitem{LipmanWilbur91}

1427: Lipman D~J and Wilbur W~J 1991 {\em Proceedings: Biological Sciences\/} {\bf

1428:   245} 7--11

1429:

1430: \bibitem{GrGiegerich96}

1431: {Gr\"uner} W, Giegerich U, Strothmann D, Reidys C, Weber J, Hofacker I, Stadler

1432:   P and Schuster P 1996 {\em Monath. Chem.\/} {\bf 127} 375--389

1433:

1434: \bibitem{KauffmanLevin87}

1435: Kauffman S and Levin S 1987 {\em J. Theor. Biol.\/} {\bf 128} 11--45

1436:

1437: \bibitem{FlyvbjergHenrik92}

1438: Flyvbjerg H and Lautrup B 1992 {\em Phys. Rev. A\/} {\bf 46} 6714--6723

1439:

1440: \bibitem{NimwegenCrutchfield00}

1441: van Nimwegen E and Crutchfield J~P 2000 {\em Bull. Math. Biol.\/} {\bf 62}

1442:   799--848

1443:

1444: \bibitem{ReidysSchuster95}

1445: Reidys C, Schuster P and Stadler P 1997 {\em Bull. Math. Biol.\/} {\bf 59}

1446:   339--397

1447:

1448: \bibitem{FontanaSchuster98}

1449: Fontana W and Schuster P 1998 {\em J. Theor. Biol.\/} {\bf 194} 491--515

1450:

1451: \bibitem{SchusterStadler02}

1452: Schuster P and Stadler P 2002 {\em Complexity\/} {\bf 8} 34--42

1453:

1454: \bibitem{Vienna95}

1455: {The {V}ienna package} 1995 Http://www.tbi.univie.ac.at/$\sim$ivo/RNA/

1456:

1457: \bibitem{HuynenStadler96}

1458: Huynen M~A, Stadler P~F and Fontana W 1996 {\em Proc. Natl. Acad. Sci. USA\/}

1459:   {\bf 93} 397--401

1460:

1461: \bibitem{Hughes96}

1462: Hughes B~D 1996 {\em Random Walks and Random Environments\/} (Oxford:

1463:   Clarendon)

1464:

1465: \bibitem{Orr03}

1466: Orr H~A 2003 {\em J. Theor. Biol.\/} {\bf 220} 241--247

1467:

1468: \bibitem{SoshnikovSudakov03}

1469: Soshnikov A and Sudakov B 2003 {\em Comm. Math. Phys.\/} {\bf 239} 53--63

1470:

1471: \bibitem{WhitlockIngvarsson00}

1472: Whitlock M, Ingvarsson P and Hatfield T 2000 {\em Heredity\/} {\bf 84} 452

1473:

1474: \bibitem{Mathematica05}

1475: {Mathematica} 2005 {W}olfram {R}esearch, http://www.wolfram.com

1476:

1477: \bibitem{CerfMartin95}

1478: Cerf N~J and Martin O~C 1995 {\em Int. J. of Modern Physics C\/} {\bf 6}

1479:   693--723

1480:

1481: \bibitem{tsimring96}

1482: Tsimring L~S, Levine H and Kessler D~A 1996 {\em Phys. Rev. Lett.\/} {\bf 76}

1483:   4440--4443

1484:

1485: \bibitem{tsimring97}

1486: Kessler D~A, Levine H~Ridgway D and Tsimring L 1997 {\em J. Stat. Mech.\/} {\bf

1487:   87} 519--544

1488:

1489: \bibitem{peng03}

1490: Peng W~Q, Gerland U, Hwa T and Levine H 2003 {\em Phys. Rev. Lett.\/} {\bf 90}

1491:   088103

1492:

1493: \bibitem{rouzine03}

1494: Rouzine I~M, Wakeley J and Coffin J~M 2003 {\em Proc. Natl. Acad. Sci. USA\/}

1495:   {\bf 100} 587--592

1496:

1497: \bibitem{Vankampen81}

1498: {van Kampen} N~G 1981 {\em Stochastic Processes in Physics and Chemistry\/}

1499:   (Amsterdam: North-Holland)

1500:

1501: \end{thebibliography}

1502:

1503: \end{document}

1504:

1505:

1506: %\bibliographystyle{iopart-num}

1507: %\bibliography{biology}

1508: %\end{document}

1509:

1510:

1511: