0611:q-bio0611044/cg.tex

1: \documentclass[pre,twocolumn,floatfix]{revtex4}

2: \usepackage{color}

3: \usepackage{graphicx}

4: \usepackage{amsfonts}

5: \usepackage[intlimits]{amsmath}

6: \usepackage{amssymb}

7: \usepackage{dcolumn}

8: \usepackage{multirow}

9:

10: \usepackage{tensind}

11: \tensorformat{none}

12: \tensordelimiter{?}

13:

14:

15: \newcommand{\ts}[1]{\ensuremath{_\mathit{#1}}}

16: \newcommand{\tsup}[1]{\ensuremath{^\mathit{#1}}}

17: \DeclareMathOperator{\diag}{diag}

18: \newcommand{\Tp}{\mathsf{T}}

19: \newcommand{\g}{\boldsymbol{g}}

20: \newcommand{\norm}[1]{\ensuremath{\|#1\|}}

21: \newcommand{\evat}[1]{\bigr\rvert_{#1}}

22: \newcommand{\Evat}[2]{\left. #1\right\rvert_{#2}}

23: \newcommand{\dbyd}[1]{\frac{d}{d#1}}

24: \newcommand{\pdbyd}[1]{\frac{\partial}{\partial #1}}

25: \newcommand{\dd}{\text{d}}

26: \newcommand{\E}[1]{\langle #1\rangle}

27: \newcommand{\Eb}[1]{\bigl\langle #1\bigr\rangle}

28: \newcommand{\EB}[1]{\Bigl\langle #1\Bigr\rangle}

29: \newcommand{\four}[1]{\ensuremath{\widetilde #1}\,}

30: \newcommand{\unc}[1]{\ensuremath{\widehat #1 }\,}

31: \newcommand{\hav}[1]{\ensuremath{\bar #1 }\,}

32:

33: \DeclareMathOperator{\Adtemp}{{Ad}}

34: \DeclareMathOperator{\adtemp}{{ad}}

35: \DeclareMathOperator{\ADtemp}{{AD}}

36: \DeclareMathOperator{\aDtemp}{{aD}}

37:

38: \DeclareMathOperator{\tr}{{tr}}

39:

40: \newcommand{\Ad}{\ensuremath{\Adtemp}}

41: \newcommand{\ad}{\ensuremath{\adtemp}}

42: \newcommand{\AD}{\ensuremath{\ADtemp}}

43: \newcommand{\aD}{\ensuremath{\aDtemp}}

44:

45: \newcommand{\ax}{\shortparallel}

46: \newcommand{\compl}{\overline}

47: \newcommand{\strandchange}{E}

48:

49:

50: % \newcommand{\new}[1]{{\color{red}#1}}

51: \newcommand{\new}[1]{{#1}}

52:

53: %opening

54:

55:

56:

57: \begin{document}

58:

59: \title{DNA: From rigid base--pairs to semiflexible polymers}

60:

61: \date{\today}

62:

63: \author{Nils B. Becker}

64: \affiliation{Max-Planck-Institut f\"ur Physik komplexer Systeme,\\

65:  N\"othnitzer Str.~38, 01187 Dresden, Germany}

66:

67: \author{Ralf Everaers}

68: \affiliation{Max-Planck-Institut f\"ur Physik komplexer Systeme,\\ N\"othnitzer

69:  Str.~38, 01187 Dresden, Germany}

70: \affiliation{Laboratoire de Physique, ENS Lyon,\\

71: 46, all\'ee d'Italie,

72: 69364 Lyon cedex 07, France

73: }

74:

75: \begin{abstract}

76: The sequence--dependent elasticity of \new{double-helical} DNA on a nm length scale can be captured by the rigid base--pair model, whose strains are the relative position and orientation of adjacent base--pairs.  Corresponding elastic potentials have been obtained from all--atom MD simulation and from high--resolution structural data. On the scale of a hundred nm, DNA is successfully described by a continuous worm--like chain model with homogeneous elastic properties characterized by a set of four elastic constants, which have been directly measured in single--molecule experiments. We present here a theory that links these experiments on different scales, by systematically coarse--graining the rigid base--pair model \new{for random sequence DNA} to an effective worm--like chain description. The average helical geometry of the molecule is exactly taken into account in our approach.

77: % We calculate the mean values of the resulting structural and elastic worm--like chain parameters as well as their scale--dependent variability for random sequence chains.

78: We find that the available microscopic parameters sets predict qualitatively similar mesoscopic parameters. The thermal bending and twisting persistence lengths computed from MD data are 42 and 48 nm, respectively. The static persistence lengths are generally much higher, in agreement with cyclization experiments. All microscopic parameter sets predict negative twist--stretch coupling.  The variability and anisotropy of bending stiffness in short random chains lead to non--Gaussian bend angle distributions, but become unimportant after two helical turns.

79: \end{abstract}

80:

81: \maketitle

82:

83: \section{Introduction}

84: The sequence--dependent elastic properties of the DNA play a vital role in basic biological processes such as chromatin organization \new{\cite{widom01,segal06}} and gene regulation,\new{ via indirect readout \cite{koudelka87,hines98,hegde02,prevost93} or via DNA looping \cite{schleif72,schleif92,rippe01}}. \new{The structure and elasticity of double helical DNA on the nm-scale is often described using rigid base--pair chain (RBC) models, in which the relative orientation and translation of adjacent base--pairs (bp) specify the conformation of the molecule  \cite{calladine84,coleman03}. }Parameter sets for rigid base--pair step elastic potentials were obtained from molecular dynamics simulation \cite{lankas03} and from an analysis of high resolution crystal structure data \cite{olson98}. We have found qualitative but not quantitative agreement between these different potentials in a recent study on indirect readout in protein--DNA binding \cite{becker06}.

85:

86: On a mesoscopic length scale, \new{it is possible to directly measure  force--extension relations for DNA in single--molecule experiments \cite{charvin04}. For small external forces,  DNA behaves as a worm--like chain (WLC)\ \cite{bustamante94}}, i.e.~an inextensible semiflexible polymer with a single parameter, the \new{bending} persistence length, and no explicit sequence dependence.

87: An extension of the classical WLC model, reflecting the chiral symmetry of the DNA double helix, includes coupled twisting and stretching degrees of freedom \cite{strick96,marko97,kamien97,moroz97}. These become important in a force regime where the DNA molecule is already pulled straight {\new but not yet overstretched \cite{cluzel96}. Interestingly, recent measurements indicate that DNA overtwisting when stretched  in the linear response regime \cite{lionnet06,gore06}.}

88:

89: In this article we establish a relation between these different levels of detail. Specifically, we coarse--grain a RBC to the WLC scale, while taking the average helical geometry of the chain exactly into account. As a result, we obtain the average helical parameters and the full set of stiffnesses for bend, twist, stretch, as well as twist--stretch coupling.

90:

91: It has been pointed out \cite{trifonov88} that the total apparent persistence length of a WLC is composed of a static part which originates from the sequence--dependent equilibrium bends of the molecule, and a dynamic part induced by thermal fluctuations. Their relative contributions have been measured \cite{bednar95,vologodskaia02}.  In analogy to this approach, we consider the variability of static conformations of a random RBC and from these derive the static and thermal persistence lengths.

92:

93: We compare the mesoscopic predictions for DNA stiffness resulting from different microscopic parametrizations in some detail, relating them to recent measurements in single--molecule experiments.

94:

95:

96: \section{Rigid base pair model of DNA}

97:

98: % {\color{blue}Be careful to talk about moments, never about an assumed Gaussian distribution.}

99:

100:

101: In canonical double--stranded DNA, Watson--Crick base pairs are stacked into a helical column. We can fix a Cartesian coordinate frame to the center of each base pair in a standard way \cite{dickerson89,olson01}, effectively averaging out internal distortions within the base pair. By convention, the $z$-axis of this right handed orthonormal frame is normal to the base pair plane and points towards the 3' direction of the preferred strand, while the $x$-axis points towards the major groove.

102:

103: The configuration in space of the chain is specified by the sequence of these frames, i.e.~by a $3\times 3$ rotation matrix $R$ together with three Cartesian coordinates of the origin $p$, for each base pair step. Only for homogeneous, idealized and non-fluctuating B-DNA do all frames lie on a straight line, with their body $z$-axes pointing into a single direction. Generically, the frames are displaced and rotated away from this idealized arrangement, due to both thermal fluctuations and sequence--dependent \new{variations in the} equilibrium conformations.

104:

105: We represent the rotation and translation of the $k+1$-th base pair frame relative to the $k$-th frame by a $4\times 4$ matrix, written in block form as

106: \begin{equation}

107: g_{k\,k+1} = \begin{bmatrix}

108:  R_{k\,k+1} & p_{k\,k+1} \\

109: 0\;0\;0 & 1

110: \end{bmatrix}.

111: \end{equation}

112: Throughout the article, matrices in square brackets will have exactly this block structure.

113: In idealized B-DNA along the $z$-axis, $p_{k\,k+1}\propto d_3=(0,0,1)$, and $R_{k\,k+1}$ is a rotation about $d_3$.

114:

115: This \new{so-called homogeneous} representation \new{(see e.g.\ \cite{murray}} has the advantage that the translation and rotation relating frames $k$ and $l>k$ can be obtained by matrix multiplication along the chain,

116: \begin{equation}

117: g_{k\,l}=g_{k\,k+1}g_{k+1\,k+2}\cdots g_{l-1\,l}.

118: \end{equation}

119: For convenience we fix the lab frame on the first base pair, so $g_{1k}$ represents the frame $k$ relative to the lab. Observe that $g_{k\,k+1}=g_{1k}^{-1}g_{1\,k+1}$ and $g_{kk}=e$, the identity matrix.

120: % The $g$-matrices constitute a matrix representation of the special Euclidean group $\mathit{SE}(3)$ of frame transformations in $\mathbb R^3$.

121: % {\color{blue}Do we need that?}

122: % More mathematical details can be found in, e.g.\ \cite{murray, borri00}.

123: % , sometimes called the homogeneous representation

124:

125: \section{Fluctuations}

126:

127: At finite temperature, a base pair step $g=g_{k\,k+1}$ in a RBC fluctuates around a mean or equilibrium value $g_0$. To parametrize these fluctuations, we first introduce coordinates suitable to describe small deviations from $g_0$. We will then characterize thermal fluctuations and the sequence randomness in terms of their second moments. In our model, \new{we neglect possible couplings between neighboring base-pair steps \cite{arauzo-bravo05,dixit05}.  As will be explained below, the requirement of a meaningful base sequence nonetheless introduces some nearest--neighbor correlations in expectation values for random DNA}.

128:

129: \subsection{Exponential coordinates}

130:

131: Any continuous group can be locally parametrized by its infinitesimal generators via the exponential map. In the $g$--matrix representation, this is the ordinary matrix exponential $\exp$, and the group generators $\{X_i\}$ are $4\times 4$ matrices.  Explicitly, in block form,

132: \begin{subequations}

133: % \begin{align}

134: % X_i &=\begin{bmatrix}

135: %  \epsilon_{i} & 0_{3\times 1} \\

136: %  0_{1\times 3} & 0

137: % \end{bmatrix}\text{, where } (\epsilon_i)_{jk}=\epsilon_{jik}\text{, and }\\

138: % X_{i+3}&=\begin{bmatrix}

139: %  0_{3\times 3} & e_i \\

140: % 0_{1\times 3} & 0

141: % \end{bmatrix}\text{, with }(e_i)_j=\delta_{ij}.

142: % \end{align}

143: \begin{align}\label{eqn:epsmatrix}

144: X_i &=\begin{bmatrix}

145:  \epsilon_{i} & 0 \\

146:  0 & 0

147: \end{bmatrix}\text{, with } (\epsilon_i)_{jk}=\epsilon_{jik}\text{ and }\\

148: X_{i+3}&=\begin{bmatrix}

149:  0 & d_i \\

150: 0 & 0

151: \end{bmatrix}\text{, with }(d_i)_j=\delta_{ij}.

152: \end{align}

153: \end{subequations}

154: Here, $\epsilon_{ijk}$ and $\delta_{ij}$ are the antisymmetric and symmetric tensors, respectively, and $1\leq i,j,k \leq 3$.

155: A rotation around the $d_i$ axis is generated by $X_i$ while a translation along $d_i$ is generated by $X_{i+3}$. The generators satisfy the usual commutation relations of angular and linear momentum.

156: Any group element $g$ can be written as

157: \begin{equation}

158:  g=\begin{bmatrix}

159: R(\xi) & p(\xi)\\ 0 & 1

160: \end{bmatrix}=\exp[{\xi}^iX_i]

161: \end{equation}

162: which defines the ${\xi}^i$ as exponential coordinates of $g$

163: \footnote{We conventionally always sum over all upper--lower index pairs.}.

164: The coordinate vector can be split up into two three--dimensional parts, $\xi=(\omega,v)$. Both have a geometrical meaning: $\omega$ points along the rotation axis of $R$ with $\norm{\omega}$ equal to the total rotation angle, and $v$ is the initial tangent $\dbyd{s}\evat{0}p(s \xi)$, see fig.\ \ref{fig:screw}.

165: All of $\mathit{SE}(3)$ except for a measure zero set is covered one-to-one by the coordinate range $\{\xi\in \mathbb R^6|\:\norm{\omega}<\pi\}$.

166: \begin{figure}

167: % \psfrag{om}{$\omega$}

168: % \psfrag{v}{$v$}

169: % \psfrag{g}{$g$}

170: % \psfrag{gp}{$g'$}

171: % \psfrag{e}{$e$}

172: % \psfrag{ep}{$e'$}

173: % \psfrag{gax}{$g\ts{ax}$}

174:  \centering

175:  \includegraphics[width=.99\columnwidth]{screw2}

176:  \caption{Frame geometry. A base pair step, connecting the base--pair fixed material frames $e$ and $g$ (left hand side). The frame origin trace of the corresponding screw motion is shown in blue. It has initial tangent $v$. By right multiplication with $g\ts{ax}$, the same step can be described using the frames $e'$ and $g'$ (red, right hand side). They lie on the helical axis and point into its direction $\omega$.}\label{fig:screw}

177: \end{figure}

178:

179: We let the equilibrium conformation of a step $g_0=\exp[\xi_0^iX_i]$. Instead of considering small additive fluctuations in $\xi_0$, we also use exponential coordinates for $g_0^{-1}g$. Then, a fluctuating step is written as $g = g_0 \exp[\xi^iX_i] = \exp[\xi_0^iX_i]\exp[\xi^iX_i]$.

180: % {\color{blue}displayed?}

181: % \begin{equation}

182: % g = g_0

183: % \begin{bmatrix}

184: % R(\xi) & p(\xi)\\ 0 & 1

185: % \end{bmatrix}

186: % = g_0 \exp[\xi^iX_i].

187: % \end{equation}

188: % The $\xi^i$ are exponential coordinates of $g$, \emph{based at} $g_0$.

189: %

190: Writing $\xi_0=\mathrm{(Ti_0,Ro_0,Tw_0,Sh_0,Sl_0,Ri_0)}$, one can check that the Tilt, Roll, Twist, Shift, Slide and Rise equilibrium values so defined have the correct symmetries under change of preferred strand as required by the Cambridge convention \cite{dickerson89}. This left--invariant formulation has the advantage that deformations with respect to different equilibrium positions are directly comparable and no distortions due to curvilinear coordinates occur. \new{It is essential for our formalism which relates fluctuations given with respect to different frames (see below).}

191: Note however that this definition of base--pair step parameters differs from those used in available software such as \cite{lavery89,lu03}. We explain in appendix \ref{sec:coordconv} how to convert between our exponential coordinates and the coordinate set used in \cite{lu97,lu03}. A related approach makes use of exponential coordinates for the rotation part of the frame transformation only \cite{babcock94}.

192:

193: \subsection{On-axis transformation}\label{sec:onaxis}

194:

195: The screw motion $s\mapsto \exp[s\xi^i X_i]$ joins the identity frame $e$ with $g$ as $s$ increases from 0 to 1, see fig.\ \ref{fig:screw}. Its screw axis is determined by a vector from the origin of $e$ to a point on the axis, given by $p\ts{ax}=\norm\omega^{-2}{\omega\times v}$, and by its direction, $\omega$. It is the `local helical axis' \cite{lavery89} associated with the base pair step $g$. When concatenating many \emph{identical} steps $g$ one generates a RBC with frame origins lying on a regular helix with this axis.

196:

197: In addition to $p\ts{ax}$ we can define a matrix $R\ts{ax}$ which rotates $e$ such that $\omega$ becomes its third direction vector. One choice is to take $p\ts{ax}$ as the second new direction.

198: In combination, we then get

199: % {\color{blue}to appendix?}

200: \begin{equation}\label{eqn:gax}

201:  g\ts{ax}=\begin{bmatrix}

202:            R\ts{ax} & p\ts{ax} \\

203:            0 &1

204:           \end{bmatrix}

205:  =\begin{bmatrix}

206:            \frac{(\omega\times v)\times\omega}{\norm{(\omega\times v)\times\omega}} & \frac{\omega\times v}{\norm{\omega\times v}} & \frac{\omega}{\norm{\omega}} & \frac{\omega\times v}{\norm\omega^2} \\

207:            0 &0&0&1

208:           \end{bmatrix},

209: \end{equation}

210: which takes $e$ to a frame $e'=eg\ts{ax}=g\ts{ax}$ sitting on the helix axis with its third direction pointing along it. One can check that $g'=gg\ts{ax}$ also has these properties. The primed, on-axis frames are `local helical axis systems' in the terminology of \cite{lavery89}. In the following, we reserve the name $g\ts{ax}$ for that frame transformation \eqref{eqn:gax} which takes the \emph{equilibrium} step $g_0$ onto its own axis.

211:

212:

213: \subsection{Thermal fluctuations and sequence randomness}\label{sec:seqrandomness}

214:

215: Any base pair step fluctuates in a thermal environment. In general the thermal mean value as well as the covariance matrix are sequence--dependent. In order to study the large scale behavior of a random sequence chain, we include this variability as another, independent source of randomness in addition to the thermal fluctuations \cite{trifonov88}.

216: I.e.~we consider a random sequence step $g=g_0\exp[\xi^iX_i]$ which fluctuates around a global, sequence--independent equilibrium conformation $g_0$, with a covariance matrix $C^{ij}=\E{\xi^i\xi^j}$ resulting from both sequence and thermal fluctuations.

217: The corresponding deformation probability distribution is

218: % \begin{equation}

219: % p(\xi)dV_\xi \propto e^{-\frac{1}{2}\xi^i\beta {S}_{ij}\xi^j}\:A(\xi)d\xi^1 d\xi^2\cdots d\xi^6

220: % \end{equation}

221: \begin{equation}

222: p(\xi)dV_\xi = p(\xi)A(\xi)d\xi^1 \cdots d\xi^6

223: \end{equation}

224: Here, $p$ is the probability density function (pdf) and

225: % $\beta=1/k_BT$ is the inverse temperature, $S$ is the positive--definite symmetric, effective stiffness matrix of the step, and

226: $dV_\xi=A(\xi)d^6\xi$ is the invariant volume element on the group, which is the Jacobian factor corresponding to our choice of curvilinear coordinates \cite{gonzalez01}. We can approximate $A$ as a constant, see appendix \ref{sec:volel}.

227: We now calculate $g_0$ and $C$ from the thermal and sequence statistics.

228:

229: We first determine $g_0$ such that the expectation over thermal and sequence randomness, $\E{\xi}=0$. This is always possible for not too wide step distributions \cite{kendall90}, and can be implemented by a gradient search with no numerical problems.

230:

231: Within a regime of linear response, the deformation energy of a step with fixed sequence $\sigma$ is a quadratic function of the deviation from the thermal equilibrium value $\E{\xi|\sigma}$

232: \footnote{The conditional expectation of some function $f$ is taken with respect to the conditional distribution,

233: 	$\E{f|\sigma}=\int f(\xi)p(\xi|\sigma)dV_\xi$},

234: irrespective of the detailed nature of backbone connections and stacking interactions. The associated thermal covariance matrix is

235: \begin{equation}

236: C_\sigma^{ij}=\Eb{\,(\xi-\E{\xi|\sigma})^i(\xi-\E{\xi|\sigma})^j\,\bigr|\,\sigma\,}.

237: \end{equation}

238: On the other hand, the covariance of the sequence--dependent thermal mean values is given by

239: \begin{equation}

240: C_0^{ij}=\Eb{\,\E{\xi|\sigma}^i\E{\xi|\sigma}^j\,},

241: \end{equation}

242: where the outermost expectation is effectively taken with respect to the step sequence distribution $p(\sigma)$.

243:

244: Since the two sources of randomness are independent, their covariances add up. One computes

245: \begin{equation}\label{eqn:Cstaticthermal}

246: \begin{split}

247: C^{ij}=& % =\E{\E{\delta\xi^i\delta\xi^j|\sigma}}

248:  \Eb{\,(\xi-\E{\xi|\sigma})^i(\xi-\E{\xi|\sigma})^j\,}+\Eb{\,\E{\xi|\sigma}^i\E{\xi|\sigma}^j\,}\\

249: =&\E{C^{ij}_\sigma}+C_0^{ij}.

250: \end{split}

251: \end{equation}

252: Given the covariance (or stiffness) matrices and equilibrium values of all sixteen dinucleotide steps, and a distribution of relative step frequencies $p(\sigma)$, by computing $g_0$ and $C$ we have characterized a thermally fluctuating random sequence step in terms of its center and second moment.

253:

254: \section{Coarse--graining}\label{sec:coarse}

255:

256: Up to this point, step deformations and therefore also the covariance matrices were given with respect to a reference frame fixed to the equilibrium base--pair frame $g_0$, which in general is offset and tilted relative to its own local helical axis. To relate the RBC deformations to a coarse--grained WLC model, we are much more interested in the elastic properties of the \emph{centerline} of the chain. Such a centerline can be taken as the local helical axis for every base pair step, cf.~fig.\ \ref{fig:screw}. This has the disadvantage that for fluctuating steps, the centerline pieces do not form a continuous curve. On the other hand, one can fit a continuous centerline globally to a stretch of a RBC \cite{lavery89}. In such an approach, the centerline depends non-locally on the base pair step conformations, introducing artificial correlations on the length scale over which the fitting procedure extends.

257:

258: We circumvent these problems in three steps. First we transform all rigid base--pairs of the chain to new frames of reference.  These are chosen such that \emph{without fluctuations}, all new bp frames lie exactly on, and point in the direction of a single straight helical axis. We can then identify and average over the unwanted shear degrees of freedom. In a last step, this reduced model is averaged over the helical phase angle and mapped to the WLC models.

259:

260:

261: \subsection{On-axis RBC}

262:

263: We would like to transform small deviations from an equilibrium conformation $g_0$ into small deviations from a version of $g_0$ which is on-axis. Consider first a regular helix composed of identical $g_0$ steps. As explained in section \ref{sec:onaxis}, the on-axis step between the $k$-th and $k+1$-th on-axis frames is

264: \begin{equation}\label{eqn:steponax}

265: g_{0\ax}= (g_0^kg\ts{ax})^{-1}g_0^{k+1}g\ts{ax}=g\ts{ax}^{-1}g_0g\ts{ax},

266: \end{equation}

267: where $g\ts{ax}$ is determined entirely by $g_0$, see \eqref{eqn:gax}.

268: % Although $g\ts{ax}$ differs to the local helical axis transformation as used in, e.g.\ \cite{lu03} by some phase angle rotation around $\omega$, this difference will become immaterial in the next step of averaging over the helical phase.

269: Since $g_{0\ax}$ is a transformation between on-axis frames, its rotation and displacement vectors point along the $d_3$ axis, $\omega_{0\ax}=\norm{\omega_{0\ax}} d_3$ and $p_{0\ax}=\norm{p_{0\ax}}d_3$.

270:

271: For a \emph{fluctuating} RBC we calculate,

272: \begin{equation}\label{eqn:fluctonax}

273:  (g_{1k}g\ts{ax})^{-1}g_{1k+1}g\ts{ax} =g\ts{ax}^{-1}g_{k\,k+1}g\ts{ax}

274: =g_{0\ax}g\ts{ax}^{-1}\exp[\xi^iX_i]g\ts{ax},

275: \end{equation}

276: where $g_{k\,k+1}=g_0\exp[\xi^iX_i]$ is the off-axis fluctuating step. The three rightmost factors in \eqref{eqn:fluctonax} clearly represent the deviation from the on-axis equilibrium step $g_{0\ax}$.

277: We introduce some standard notation.

278: % The adjoint mapping is the linear map $X \mapsto g X g^{-1}$ on the space of group generators.

279: The $6\times 6$ adjoint matrix $\Ad g$ is defined for any $g\in \mathrm{SE}(3)$ by $gX_ig^{-1}=?{(\Ad g)}^j_i?X_j$. Explicitly, if $g=(R,p)$, one finds

280: \begin{equation}\label{eqn:explicitAd}

281:  \Ad g= \begin{pmatrix}

282:          R & 0 \\

283:          p^i\epsilon_i R  & R

284:         \end{pmatrix},

285: \end{equation}

286: written in $3\times 3$ blocks.

287: Pulling a similarity transformation inside the exponential series we can then rewrite \eqref{eqn:fluctonax} as

288: \begin{equation}\label{eqn:fluctonax2}

289: \begin{gathered}

290: g_{0\ax}g\ts{ax}^{-1}\exp[\xi^iX_i]g\ts{ax}= g_{0\ax}\exp[\xi_\ax^iX_i]

291: % \\ \text{where } \xi_\ax=\Ad g_{ax}^{-1}\xi.

292: \end{gathered}

293: \end{equation}

294: Here  the deviation from the on-axis equilibrium step $\xi_\ax=\Ad g_{ax}^{-1}\xi$, has zero mean and covariance matrix

295: \begin{equation}\label{eqn:fluctonax3}

296:  C_\ax^{ij}=\E{ \xi_\ax^i\xi_\ax^j}=?{(\Ad g_{ax}^{-1})}^i_k? C^{kl}?{(\Ad g_{ax}^{-1})}^j_l?.

297: \end{equation}

298:

299: The RBC composed of steps \eqref{eqn:fluctonax2} is an equivalent description of the original chain, which we may call its on-axis version. Intuitively, to each fluctuating frame $g_{1k}$ of the original chain, we rigidly connected a frame $g_{1k}'$ in such a way that the primed, on-axis chain fluctuates about a straight, but still twisted, equilibrium conformation.

300: This is illustrated in fig.\ \ref{fig:combview}. The sequence--dependent equilibrium conformations produce an irregular helix. Thermal fluctuations increase irregularity. However, when averaging over thermal and sequence fluctuations, the on-axis configuration is exactly lined up on a straight helical axis. Note that we had no need to compute a fluctuating axis explicitly.

301:

302: \begin{figure}

303:  \centering

304: %  \includegraphics[width=.8\columnwidth, bb =0 0 620 518]{offaxisboxes/combview.pdf}

305: \includegraphics[width=.8\columnwidth, bb =0 0 620 518]{combview}

306: \caption{Equivalent descriptions of a particular random sequence RBC. ``seq'': Colored blocks represent base pairs in their equilibrium conformations. Wireframe blocks represent their on--axis counterparts. ``thermal+seq'': The same, but with added thermal fluctuations. The top views show  the reduced helix axis offsets of the on--axis frames. (MD parameter set, base pair size scaled down by 40 \% for clarity, sequence GCGTTGTGGGCT.)}

307:  \label{fig:combview}

308: \end{figure}

309:

310:

311: % \subsection{Combining steps}

312: %

313: % To average over the helical phase, we need to combine a short subchain of fluctuating steps into one compound step.

314: % This is not entirely straightforward since the angular fluctuations at intermediate steps in the subchain will contribute to the translational fluctuations at the end of the compound step amplified by the lever arm that is given by their distance to the end frame. These effects can be taken into account again by using the adjoint mapping.

315: %

316: % Let's start with two steps, with static displacements $g_{0}$ and independent fluctuating increments $\xi_{12}, \xi_{23}$, both with covariance $C$.

317: % We then want to find $g_{0,13}$, $\xi_{13}$ and $C_{13}$ so that

318: % \begin{equation}\label{eqn:twostep}

319: % g_{13}= g_{0}\exp[\xi_{12}^iX_i]g_{0}\exp[\xi_{23}^iX_i]=g_{0,13}\exp[\xi_{13}^iX_i],

320: % \end{equation}

321: % the distribution $p(\xi_{13})$ is centered at 0, and $C_{13}$ is its covariance. Again using the adjoint matrix, and expanding the exponentials for small fluctuations, we rewrite this as

322: % \begin{equation}\label{eqn:twostep2}

323: %  \begin{aligned}

324: %  	&g_{13}=g_{0}^2\exp[(\Ad g_0^{-1} \xi_{12})^iX_i]\exp[\xi_{23}^iX_i]=\\

325: %  	&=g_{0}^2\exp\bigl[(\Ad g_0^{-1}\xi_{12}+\xi_{23})^iX_i\,\bigr]

326: % %  	+\tfrac{1}{2} \Ad g_0^{-1}\xi_{12}^i\xi_{23}^j[X_i,X_j]\,\bigr]

327: % +O(\xi)^2.

328: %  \end{aligned}

329: % \end{equation}

330: % This order of expansion is sufficient in the limit where

331: % \begin{enumerate}

332: % \item  the equilibrium step is much larger than the fluctuations,   $\xi_0^iC^{-1}_{ij}\xi_0^j\gg 1$, and

333: % \item the fluctuation angles are small, $\sum_{i=1}^3C_{13}^{ii}\ll 1$.

334: % \end{enumerate}

335: % The first condition guarantees that $g_{0,13}=g_0^2$ is a good approximation. The second condition ensures the validity of expanding the exponential.

336: %

337: % In an analogous way, we can combine a subchain of $k$ identically distributed steps into a compound step. Permuting the equilibrium steps to the left and compensating by adjoint matrices, we get

338: % \begin{equation}

339: %  \begin{aligned}

340: %   g_{1k}&=g_{0,1k}\exp[\xi_{1k}^iX_i] +O(\xi)^2\text{, where} \\

341: %   g_{0,1k}&=g_0^{k-1}\text{ and  }\xi_{1k}=\sum_{l=2}^{k}\Ad g_0^{k-l}\xi_{l-1l}.

342: %  \end{aligned}

343: % \end{equation}

344: % The compound fluctuation $\xi_{1k}$ has zero mean, and since the steps are independent, its covariance is

345: % \begin{equation}

346: %    C_{1k}^{ij}=\sum_{l=2}^{k}?{(\Ad g_0^{k-l})}^i_m?C^{mn}?{(\Ad g_0^{k-l})}^j_n?

347: % \end{equation}

348: % The condition 2.~now reads $\sum_{i=1}^3C_{1k}^{ii}\ll 1$. Clearly, after a certain maximal number of steps, the compound step will be too soft to be described by small fluctuations around an equilibrium in a meaningful way. For DNA, condition 2.~is fulfilled reasonably well up to a full helical turn.

349:

350: % {\color{blue} todo: Marko does have the helical WLC, not averaged over the helical repeat. marko94 does the averaging (however without the stretching dof)}

351:

352: \subsection{Averaging over shear variables}

353:

354: The on-axis RBC has the nice property that the translational fluctuations $(\xi_\ax^4,\xi_\ax^5)=(v_\ax^1,v_\ax^2)$ are now exactly transversal to the equilibrium helix axis. They are pure shear modes and do not contribute to compression fluctuations along the chain. Let $\eta=(\omega_\ax,v_\ax^3)$ be the vector of the four remaining variables.  Noting that the volume element $A$ depends only the angular part (see appendix \ref{sec:volel}), we write

355: \begin{equation}

356: \E{\eta^i\eta^j}=\int \underbrace{ \biggl. d^3\omega_\ax dv_\ax^3 A(\omega_\ax)}_{dV_\eta}\underbrace{\biggl. \int dv_\ax^1 dv_\ax^2 p(\xi_\ax) }_{p(\eta)}\,\eta^i\eta^j ,

357: \end{equation}

358: % \begin{equation}

359: % \E{\eta^i\eta^j}=\int\underbrace{\biggl. \int p(\xi_\ax) \, dv_\ax^1 dv_\ax^2 }_{p(\eta)}\,\eta^i\eta^j  \underbrace{ \biggl.  A(\xi_\ax)\,d^3\omega_\ax dv_\ax^3}_{A(\eta) d^4\eta}

360: % \end{equation}

361: from which one can see that the $4\times 4$ covariance matrix $\four C^{ij}=\E{\eta^i\eta^j}$ is the same as  $C_\ax$ with its $v_\ax^1,v_\ax^2$ rows and columns deleted. Thus,  $\eta$ has a centered distribution with covariance matrix $\four C$.

362: Here and in the following, $\four\cdot$ indicates deletion of the shear rows and columns in an on-axis, $6\times 6$ matrix. E.g,~$\four \Ad{}$ is the $4\times 4$ adjoint matrix. Also unless noted otherwise, we consider only on-axis quantities and suppress the $\cdot_\ax$ subscript in the following.

363:

364: \subsection{Correlations induced by sequence}

365:

366: \new{While we assume thermal fluctuations of neighboring steps to be independent random variables, there are nevertheless correlations in the {\em sequence identity} of neighboring base-pair steps.

367: Any realization of a random sequence of dinucleotide steps must be `continuous',  e.g.~$\sigma_{12}=\mathrm{AG}$ implies that  $\sigma_{23}$ can only start with a G.

368: These correlations need to be taken into account when calculating expectation values for random sequences of DNA.

369: For this purpose, we now consider the combined fluctuations of a short RBC consisting of $m$ base pair steps.}

370:

371: Assuming independent, identically distributed bases we obtain the joint pdf of sequence steps along the chain as the product of the base pdfs,  $p(\sigma_{12},\dots, \sigma_{m\,m+1})=\prod_{k=1}^{m+1}p(b_k)$. This implies that the covariance between thermal mean values,

372: \begin{equation}\label{eqn:nearestneighbor}

373:  \Eb{\, \E{\eta_{k\,k+1}^i|\sigma_{k\,k+1}} \E{\eta_{l\,l+1}^j|\sigma_{l\,l+1}} \,} =

374:  \left\{

375:  \begin{aligned}

376:   &\four C_{0}^{ij}\; & l=k\\

377:   &\four C_{1}^{ij} & l=k+1\\

378:   & \four C_{1}^{ji} & l=k-1\\

379:   & 0 & \text{otherwise}.

380:  \end{aligned}

381:  \right.

382: \end{equation}

383: Here we introduced a nearest--neighbor term $\four C_{1}$ which will be computed below. No nearest--neighbor correlation occurs in the thermal covariances by assumption.

384:

385: We are now in a position to combine the $m$ base pair steps of our chain into one compound step.

386: Here, $m$ must be small enough that the typical deviation angles of the compound step from equilibrium stay small. I.e, the short chain must be well approximated by a (helical) rigid rod.

387: Successively commuting the on-axis equilibrium steps $g_{0\ax}$ to the left and introducing the adjoint matrix as in \eqref{eqn:fluctonax2}, one arrives at

388: \begin{multline}\label{eqn:adcompound}

389:  g_{0\ax}\exp[\xi_{12\ax}^i X_i ]\cdots g_{0\ax}\exp[\xi_{m\,m+1\ax}^iX_i]=\\

390: % =g_{0}^m(\Ad{g_{0}^{-m+1}}\xi_{12}+\cdots +\xi_{m\,m+1} )^iX_i

391: =g_{0\ax}^m \,\biggl[e+ \Bigl(\sum_{k=1}^{m} \Ad{g_{0\ax}^{-m+k}}\xi_{k\,k+1\ax}\Bigr)^iX_i\biggr] +O(\xi)^2.

392: \end{multline}

393: The sum in parentheses is the deviation $\xi_{1m+1\ax}$ of the fluctuating compound step from its equilibrium value $g_{0\ax}^m$.

394:

395: The corresponding reduced $\four\Ad$ matrix has a simple form. Using \eqref{eqn:explicitAd} and noting that  $p_{0\ax}\propto\omega_{0\ax}\propto d_3$ we obtain

396: \begin{subequations}

397: \begin{gather}

398: % \begin{split}

399: \eta_{1\,m+1}= \sum_{k=1}^{m}\four \Ad{g_{0\ax}}^{-m+k}\eta_{k\,k+1} \text{ , where} \\ \label{eqn:fourAd}

400:  \four \Ad{g_{0\ax}}=\begin{pmatrix}

401:                                \cos \norm{\omega_0} & \sin \norm{\omega_0} &0 &0 \\

402:                                -\sin   \norm{\omega_0} & \cos \norm{\omega_0}  &0&0 \\

403:                               0 &0& 1 &0  \\

404:                               0&0&0&1

405:                               \end{pmatrix}

406: % \end{split}

407: %

408: %  \eta_{1\,m+1}= \sum_{k=1}^{m}\four \Ad{g_{0\ax}}^{-m+k}\eta_{k\,k+1}

409: %  \begin{pmatrix}

410: %                                \cos \norm{\omega_0} & \sin \norm{\omega_0} & & \\

411: %                                -\sin   \norm{\omega_0} & \cos \norm{\omega_0}  && \\

412: %                                && 1 &  \\

413: %                                &&&1

414: %                               \end{pmatrix}^{-m+k} \eta_{k\,k+1}

415: % \eta_{1\,m+1}= \sum_{k=1}^{m}\begin{pmatrix} R_{0\ax}

416: \end{gather}

417: \end{subequations}

418: One sees that the $\omega_\ax^{1,2}$ components are successively rotated around the $d_3$ axis, while the $\omega_\ax^3,v_\ax^3$ components are unaffected.

419: %

420: % \begin{equation}

421: % \omega_{1\,m+1\ax} = \sum_{k=1}^{m}R_{0\ax}^{-m+k} \omega_{k\,k+1\ax} ,\; v^3_{1\,m+1\ax} = \sum_{k=1}^{m} v^3_{k\,k+1\ax}

422: % \end{equation}

423:

424: What is the covariance matrix  $ \four {C}^{ij}_{1m+1} = \E{\eta_{1m+1}^i\eta_{1m+1}^j } $ of the compound deviation? Using  \eqref{eqn:nearestneighbor}, we are left with a sum of appropriately transformed single--step covariances $\four  C=\E{\four C_\sigma}+\four C_0$

425: and in addition a sum of  nearest neighbor cross--terms involving $\four C_{1}$:

426: \begin{equation}\label{eqn:compoundCfour}

427: \begin{split}

428:  \four C_{1\,m+1} =   \sum_{l=0}^{m-1} \four \Ad{g_{0}^{-l}}\four  C \four \Ad^\Tp{g_{0}^{-l}}

429:   + \\

430:   + \sum_{l=0}^{m-2}  \four \Ad{g_{0}^{-l}} \four C_\times  \four \Ad^\Tp{g_{0}^{-l}},\\

431:   \text{ where }  \four C_\times = \four C_{1} \four \Ad^{\Tp}{g_0^{-1}}+  \four \Ad {g_0^{-1}} \,\four C_{1}^\Tp.

432: \end{split}

433: \end{equation}

434: % \begin{equation}\label{eqn:compoundCfour}

435: % \begin{split}

436: %  \four C_{1\,m+1} =   \sum_{l=0}^{m-1} \four \Ad{g_{0}^{-l}}\four  C \four \Ad^\Tp{g_{0}^{-l}}

437: %    \\

438: %   + \sum_{l=0}^{m-2}  \four \Ad{g_{0}^{-l}} \four C_1^\Tp  \four \Ad^\Tp{g_{0}^{-(l+1)}} \\

439: %   + \sum_{l=0}^{m-2}  \four \Ad{g_{0}^{-(l+1)}} \four C_1  \four \Ad^\Tp{g_{0}^{-l}}.

440: % \end{split}

441: % \end{equation}

442: The cross--covariance $\four C_\times$ represents the fact that nearest neighbor equilibrium steps are correlated and their frames of reference are rotated by an angle $\norm{\omega_0}$.

443:

444: % {\color{blue} CORRECTED MISTAKE: IN $C_\times$ there was an extra $\Ad g_0$ going ``over the end'' Changed for $\Ad^{-1}$}

445:

446: Note that two neighboring compound steps are still correlated by sequence continuity at their interface. From \eqref{eqn:compoundCfour} we have the recursion relation

447: \begin{equation}

448: \begin{split}

449: \four C_{1\,m+1} = \four \Ad g_{0}^{-1} \four C_{1\,m}  \four \Ad^\Tp{g_{0}^{-1}} +  \four C +  \four C_\times .

450: \end{split}

451: \end{equation}

452: The same relation is obeyed by a sequence of \emph{independent} steps with covariance matrix  $ \unc C = \four C +  \four C_\times$.

453: We conclude that except for a boundary term $\four C_\times$ at the beginning of the chain, a RBC with independent steps and with covariances $\unc C$  exhibits the same effective covariance as the original, short range correlated chain with $\four C$ and $\four C_\times$. The relative error in effective compound covariance is of order $1/m$.

454:

455:

456:

457: % We compute only the relevant components of the compound matrix, using the explicit form \eqref{eqn:explicitAd} of $\Ad{}$ and noting that on-axis, $p_{0\ax}\propto \omega_{0\ax}\propto d_3$:

458: % \begin{equation}\label{eqn:compoundCfour}

459: %  \begin{aligned}

460: %  &\E{\xi_{1m+1\ax}^i\xi_{1m+1\ax}^j} =

461: %   \sum_{k=0}^{m-1} ?{(R_{0\ax}^{-k})}^i_p? C_\ax^{pq}?{(R_{0\ax}^{-k})}^j_q? + \\

462: %  &+\sum_{k=0}^{m-2}  ?{(R_{0\ax}^{-k})}^i_p? C_\times^{pq}?{(R_{0\ax}^{-k})}^j_q?,  i,j,p,q=1, 2\, , \\

463: % &\E{\xi_{1m+1\ax}^i\xi_{1m+1\ax}^j}= m  C^{ij} + (m-1) C_\times^{ij},\; i,j=3, 6\, , \\

464: % %     &\E{\omega_{1m+1}^3\omega_{1m+1}^3}= m  C^{33} + (m-1) C_\times^{33}  \\

465: % %     &\E{\omega_{1m+1}^3 v_{1m+1}^3}= m  C^{36} + (m-1) C_\times^{36}, \\

466: % %     &\E{v_{1m+1}^3 v_{1m+1}^3}= m  C^{66} + (m-1) C_\times^{66}

467: %  & \text{ where }   C_\times =  C_{1\ax} \Ad^\Tp{g_0}+  \Ad{g_0} \,C_{1\ax}^\Tp .

468: % \end{aligned}

469: % \end{equation}

470:

471:

472:

473:

474:

475: % \begin{equation}

476: % \begin{aligned}

477: % \begin{pmatrix}  \omega^1\\ \omega^2 \end{pmatrix}_{1\,m+1\ax}  &= \sum_{k=1}^{m}

478: % % \begin{pmatrix}

479: % %  \cos \norm{\omega_0} & \sin\norm{\omega_0} \\  -\sin\norm{\omega_0} & \cos \norm{\omega_0}

480: % % \end{pmatrix}^{-m+k}

481: % R_{2\times 2}^{-m+k}(\norm{\omega_0})

482: % \begin{pmatrix}  \omega^1 \\ \omega^2\end{pmatrix}_{k\,k+1\ax} ,

483: % \\

484: %  \begin{pmatrix}  \omega^3 \\ v^3\end{pmatrix}_{1\,m+1\ax}

485: %  &= \sum_{k=1}^{m} \begin{pmatrix} \omega^3\\ v^3\end{pmatrix}_{k\,k+1\ax}

486: % \end{aligned}

487: % \end{equation}

488: % Note that since also $\omega_{0\ax}\propto d_3$, the rotation $R_{0\ax}$ is entirely in the $d_1-d_2$ plane.

489: % What is the covariance matrix

490: % % $ \four {C}^{ij}_{1m+1} = \E{\eta_{1m+1}^i\eta_{1m+1}^j } $

491: % of the compound deviation? According to \eqref{eqn:nearestneighbor}  we obtain a sum of appropriately transformed single-step covariances

492: % % $\four C=\four C_0+\E{\four C_\sigma}$

493: % and in addition a sum of  nearest neighbor cross-terms.

494: % %  involving $\four C_{1}$.

495: % We compute the components,

496:

497:

498:

499:

500:

501: % What is the covariance matrix $C^{ij}_{1_m+1} = \E{\xi_{1m+1}^i\xi_{1m+1}^j }$ of the compound deviation? Since according to \eqref{eqn:nearestneighbor} there are at most nearest neighbor correlations, we will get a covariance that is a sum of appropriately transformed single-step covariances $C=C_0+\E{C_\sigma}$, and in addition a sum of  nearest neighbor cross-terms involving $C_{1}$. Carrying out the computation, one finds

502: % \begin{equation}

503: % \begin{split}

504: % %   \E{\xi_{1m+1}^i\xi_{1m+1}^j} =

505: % %   \sum_{k=0}^{m-1} ?{(\Ad g_{0}^{-k})}^i_p? C^{pq}?{(\Ad g_{0}^{-k})}^j_q? +\\

506: % %   +

507: % %   \sum_{k=0}^{m-2}  ?{(\Ad g_{0}^{-k})}^i_p? C_\times^{pq}?{(\Ad g_{0}^{-k})}^j_q?,\\

508: % %    \text{where }

509: % %   C_\times^{pq} = C_{1}^{pr} ?{\Ad{g_0}}^q_r?+   C_{1}^{qr} ?{\Ad{g_0}}^p_r?,

510: %   C_{1m+1}=

511: %   \sum_{k=0}^{m-1} \Ad{g_{0}^{-k}} C \Ad^\Tp{g_{0}^{-k}} +\\

512: %   +

513: %   \sum_{k=0}^{m-2}  \Ad{g_{0}^{-k}} C_\times  \Ad^\Tp{g_{0}^{-k}},\\

514: %   \text{where }  C_\times = C_{1} \Ad^\Tp{g_0}+  \Ad{g_0} \,C_{1}^\Tp

515: % \end{split}

516: % \end{equation}

517: % and $^\Tp$ denotes the transpose,

518: % see also \eqref{eqn:fluctonax3}. This lengthy expression simplifies considerably when we look only at the reduced set of variables $\eta^i$ in an on-axis chain.

519:

520: % {\color{blue} Get rid of the mess$_{kk+1}$  for the one-step parameters?}

521:

522: \subsection{Averaging over the helical phase}

523:

524: A shear--averaged, on-axis RBC still has a finite equilibrium twist and anisotropic bending stiffness. To relate it to a WLC with isotropic bending rigidity, we

525: % would need to consider compound steps comprising a half integer multiple of a helical turn. Their symmetry results in isotropic bending in linear order. However, this is generally not possible since $\pi / \norm{\omega_0}$ is not an integer (e.g.~5.25 for canonical B-DNA). We therefore

526: perform an average over a continuous helical phase angle rotation of the reference frame \cite{marko94}.

527: % , to get an exactly isotropic equivalent to the RBC.

528: An on-axis covariance matrix which is rotated by a helical phase angle $\phi$ around the average local helical axis (see \eqref{eqn:fluctonax3}), is

529: \begin{equation}

530:  \unc C(\phi) = \four \Ad g_\phi \unc C \four \Ad^\Tp g_\phi,

531: \end{equation}

532: where $g_\phi=\exp[\phi X_3]$ is a pure rotation by an angle $\phi$ around  $d_3$. Since $\four\Ad g_\phi$ has the form \eqref{eqn:fourAd}, the helical phase average is seen to be

533: \begin{equation}\label{eqn:helav}

534: \begin{split}

535:  \hav C = \frac{1}{2\pi}\int_0^{2\pi}\unc C (\phi)d\phi =

536:  \begin{pmatrix}

537:   \tfrac{\unc C^{11}+\unc C^{22}}{2} & 0 & 0 & 0 \\

538:   0 &\tfrac{\unc C^{11}+\unc C^{22}}{2} & 0& 0\\

539:  0 & 0 & \unc C^{33} & \unc C^{34 }  \\

540:  0 & 0 & \unc C^{34} & \unc C^{44}

541:  \end{pmatrix}.

542: %  \diag\bigl(\tfrac{\unc C^{11}+\unc C^{22}}{2},\tfrac{\unc C^{11}+\unc C^{22}}{2},\unc C^{33},\unc C^{44}\bigr).

543: \end{split}

544: \end{equation}

545: From $\hav C$ one can read off the bend and twist persistence lengths as $l_b=h_\ax/ \hav C^{11}$ and $l_t=h_\ax/\hav C^{33}$, respectively, where the on-axis helical rise is $h_\ax=\norm{p_{0\ax}}$.

546: The WLC stiffness matrix $\beta \hav S = \hav C^{-1}$ can be found by inversion and has the same block structure, see also appendix \ref{sec:volel}. Its nonzero components are the bend, twist, stretch and twist--stretch coupling stiffness coefficients.

547:

548: % \begin{equation}

549: % \hav S = \beta^{-1}

550: % \begin{pmatrix}

551: %   1/\hav C^{11} & 0 & 0 & 0 \\

552: %   0 &1/\hav C^{11} & 0& 0\\

553: %  0 & 0 &\hav C^{44}/D_{ts} & -\hav C^{34}/D_{ts} \\

554: %  0 & 0 & -\hav C^{34}/D_{ts} & \hav C^{33}/D_{ts}

555: %  \end{pmatrix}.

556: % \end{equation}

557: % \begin{subequations}\label{eqn:EHWLCstiffnesses}

558: %  \begin{alignat}{2}

559: %  \hav S^{11} = \hav S^{22} & =1/(\beta\hav C^{11})= l_b/\beta, \\

560: % \hav S^{33} 					&=\hav C^{44}/(\beta D_{ts})\neq l_t/\beta,\\

561: %  \hav S^{44} 					&=\hav C^{33}/(\beta D_{ts}),\\

562: %  \hav S^{34} = \hav S^{43} & =-\hav C^{34}/(\beta D_{ts}),

563: % \end{alignat}

564: % \end{subequations}

565: % Here  $D_{ts}= \hav C^{33}\hav C^{44}-{\hav C^{34}}^2>0$.

566:

567: % We compute the helical phase average of the compound step covariance. Since the phase rotation and the equilibrium on-axis rotation  are about the same axis, the  $R_{0\ax}$ rotations in \eqref{eqn:compoundCfour} just add an irrelevant offset to the integration variable.

568: %

569: %

570: % Explicitly,  written in $3\times 3$ blocks,

571: % \begin{equation}\label{eqn:C3blocks}

572: %  C_\ax(\phi) = \begin{pmatrix}

573: %  R_\phi C_\ax^{\omega\omega}R_\phi^\Tp & R_\phi C_\ax^{\omega v}R_\phi^\Tp \\

574: % R_\phi C_\ax^{ v \omega}R_\phi^\Tp  & R_\phi C_\ax^{ vv}R_\phi^\Tp

575: %               \end{pmatrix},

576: % %               ,\quad R_\phi=\begin{pmatrix}

577: % %  \cos \phi & -\sin \phi & 0 \\

578: % %  \sin \phi & \cos \phi & 0\\

579: % %  0 & 0 & 1

580: % %                                          \end{pmatrix}.

581: % \end{equation}

582: % \eqref{eqn:fluctonax3}

583: % where $C_\ax^{ ab}$ are the $3\times 3$ blocks of $C_\ax$, with $a,b=\omega_\ax$ or $v_\ax$.

584: % Doing the average over full periods of trigonometric functions, each such block becomes

585: % \begin{equation}

586: %  \bar C_{\ax ab}=\int_0^{2\pi} R_\phi C_{\ax ab}R_\phi^\Tp\,d\phi=

587: %  \mathrm{diag}\bigl(\tfrac{C_{\ax ab}^{11}+C_{\ax ab}^{22}}{2},\tfrac{C_{\ax ab}^{11}+C_{v ab}^{22}}{2},C_{\ax ab}^{33}\bigr).

588: % \begin{multline}

589: %   \bar C_\ax^{ab}=\int_0^{2\pi} R_\phi C_\ax^{ab}R_\phi^\Tp\,d\phi= \\

590: %  =\diag\bigl(\tfrac{\E{a^1 b^1+a^2 b^2}}{2},\tfrac{\E{a^1 b^1+a^2 b^2}}{2},\E{a^3 b^3}\bigr).

591: % \end{multline}

592: % \end{equation}

593: % I.e, the averaged covariance $\bar C_\ax$ has isotropic bending and shear fluctuations, and coupling occurs only between corresponding shear and rotation directions. This is the result of the chiral symmetry we have imposed by averaging over the helical phase.

594: % An isotropic, on-axis RBC with covariance $\bar C_\ax^{ij}=\E{\bar\xi_\ax^i\bar\xi_\ax^j}$ and equilibrium step $\bar g_{0\ax}$ consisting of the pure translation $\bar p_{0\ax}=p_{0\ax}$ still has all six degrees of freedom of the original model.

595:

596: % \subsection{Averaging over shear variables OLD}

597: %

598: % The translational shear fluctuations are orthogonal to the equilibrium helix axis in both the isotropic and non-averaged, on-axis RBC.  They therefore do not contribute to compression fluctuations of the chain. To relate an on-axis RBC to a discrete worm-like chain model without shear fluctuations, we integrate over the two shear degrees of freedom, $\bar v^1, \bar v^2$. Let $\eta=(\omega_\ax,v_\ax^3)$ be the vector of the four remaining variables.  From

599: % \begin{equation}

600: % \E{\eta^i\eta^j}=\int\underbrace{ \int p(\xi_\ax) \, dv_\ax^1 dv_\ax^2 }_{p(\eta)}\,\eta^i\eta^j  A(\xi_\ax)\,d^3\omega_\ax dv_\ax^3

601: % \end{equation}

602: % one can see that the $4\times 4$ covariance matrix $C_{w}^{ij}=\E{\eta^i\eta^j}$ is the same as  $C_\ax$ with its $v_\ax^1,v_\ax^2$ rows and columns deleted. Thus,  $\eta$ has a centered Gaussian distribution with covariance matrix $C_w$.  It does not matter whether the helical phase or the shear average is carried out first, as long as the chain is on-axis. Both ways, we get a reduced, isotropic covariance matrix, given explicitely as

603: % \begin{equation}

604: % \bar C_w=

605: % % 	\begin{pmatrix}

606: % % 	\tfrac{C_\ax^{11}+C_\ax^{22}}{2} & & & \\

607: % % 	 & \tfrac{C_\ax^{11}+C_\ax^{22}}{2} & & \\

608: % % 	 & & C_\ax^{33} & C_\ax^{36} \\

609: % % 	 & & C_\ax^{36} & C_\ax^{66}

610: % % 	\end{pmatrix}

611: % \begin{pmatrix}

612: % 	\tfrac{1}{2}\E{{\omega_\ax^1}^2+{\omega_\ax^2}^2} & & & \\

613: % 	 & \tfrac{1}{2}\E{{\omega_\ax^1}^2+{\omega_\ax^2}^2}  & & \\

614: % 	 & & \E{{\omega_\ax^3}^2}& \E{\omega_\ax^3v_\ax^3} \\

615: % 	 & &  \E{\omega_\ax^3v_\ax^3} & \E{{v_\ax^3}^2}

616: % 	\end{pmatrix}

617: % % \begin{pmatrix}

618: % % 	\bar C_{\omega\omega}&

619: % % 	\begin{matrix}

620: % % 	 0 \\ 0 \\ \E{\omega_\ax^3v_\ax^3}

621: % % 	\end{matrix} \\

622: % % 	 \begin{matrix}

623: % % 	  0 & 0 & \E{\omega_\ax^3v_\ax^3}

624: % % 	 \end{matrix}

625: % %  & \E{{v_\ax^3}^2}

626: % % 	\end{pmatrix}

627: % \end{equation}

628: % Note that the structure with one diagonal block for bend fluctuations and one $2\times 2$ block for twist-stretch fluctuations arises from averaging over helical phase and over unwanted shear degrees of freedom. The bend and twist persistence lengths are $l_b=1/ C_w^{11}$ and $l_t=C_w^{33}$, respectively.

629: %

630: % The EHWLC stiffness matrix $\beta \bar S_w=\bar C_w^{-1}$ (see \eqref{eqn:pAcorrection}) has the same block structure.

631: % % (We use the approximation of neglecting the influence of the volume element.)

632: % Its nonzero components are, the bend, twist and stretch stiffness, and the twist-stretch coupling stiffness,

633: % \begin{subequations}\label{eqn:EHWLCstiffnesses}

634: %  \begin{alignat}{2}

635: % \beta \bar S_w^{11} = \beta \bar S_w^{22} & =1/\bar C_w^{11}= l_b, \\

636: % \beta \bar S_w^{33} 					&=C_w^{44}/D_{ts},\\

637: % \beta \bar S_w^{44} 					&=C_w^{33}/D_{ts},\\

638: % \beta \bar S_w^{34}=\beta \bar S_w^{43} & =-C_w^{34}/D_{ts},

639: % \end{alignat}

640: % \end{subequations}

641: % respectively. Here  $D_{ts}= C_w^{33}C_w^{44}-{C_w^{34}}^2>0$.

642: %

643: % % \begin{equation}

644: % %  \begin{aligned}

645: % %  \bar S_w=\begin{pmatrix}

646: % %                  \bar S\ts{b} & 0 \\

647: % %                  0 & \bar S\ts{ts}

648: % %                 \end{pmatrix}

649: % % \text{ , where } \\

650: % % \beta \bar S\ts{b} = \tfrac{2}{C_\ax^{11}+C_\ax^{22}}I_2 \text{ and } \beta \bar S\ts{ts}=

651: % %  \end{aligned}

652: % % \end{equation}

653:

654: \subsection{Coarse--graining relations}\label{sec:cogrel}

655:

656: % We now have arrived a discrete version of an EHWLC with frames stacked exactly on top of each other in the equilibrium configuration, starting from a completely arbitrary initial RBC. Averaging of the on-axis version of the initial model over the helical phase results in a model with a remaining chiral symmetry but no phase register along the helix. The symmetry allows six independent stiffness parameters in $\bar S_\|$. Such an averaged model is  a valid description of the original RBC only on a scale beyond a few helical turns. Further averaging over the shear reduces this to four independent stiffnesses occurring in $\bar S_w$, as required by symmetry \cite{marko97}.

657: We have derived all WLC elastic parameters starting from an arbitrarily oriented and offset RBC. We now discuss in some detail how these coarse--grained parameters are related to the microscopic RBC parameters.

658:

659: % {\color{blue}NOTE: Constraining the shear fluctuations kills part of the translational BM !! This results in slightly different persistence lengths: $R^2$ has the translation included, whereas the director correlations do not. These fall off with $l_b,l_t$. Slight difference of definition of persistence length.}

660:

661: \subsubsection{Equilibrium step}

662:

663: The  transformation of the equilibrium step onto the helical axis  \eqref{eqn:steponax} leaves the total rotation angle invariant. Therefore the equilibrium twist of $g_{0\ax}$ is $\theta_\ax=\norm{\omega_{0\ax}}=\norm{\omega_0}\geq |\omega_0^3|$. I.e, the twist per base pair of the WLC  equals the total angle of rotation, not the Tw angle of the off-axis step. The equilibrium rise on axis is $h_\ax=\norm{p_{0\ax}}=\omega_0^\Tp p_0/{\norm{\omega_0}}$ which is different from both off-axis quantities $\norm{p_0}$ and $p_0^3$. These differences are of order $O(\omega_0^1+\omega_0^2)^2$ so they

664: become important only when the equilibrium rotation axis $\omega_0$

665: has significant roll and tilt with respect to the material frame, i.e.~when the local helical parameters Inclination and Tip \cite{dickerson89} are not negligible.

666:

667: \subsubsection{Fluctuations}

668:

669: Unlike  the equilibrium step, the covariance matrix is changed not only by the rotation $R\ts{ax}$ but also by the shift $p\ts{ax}$ onto the average local helix axis.  Intuitively,  the on-axis frame $g'$ is rigidly connected to $g$, cf.~figs.\ \ref{fig:screw}, \ref{fig:combview}. Therefore, a rotational fluctuation of $g$ with rotation vector $\omega'$ will result in an additional \emph{ translational } fluctuations of $g'$ equal to  $\omega'\times p\ts{ax}$.

670:

671: A familiar example of this geometrical effect is the stretching of an ordinary coil spring along its helix axis. In the wire material, this deformation corresponds mainly to torsion, i.e.~a rotational deformation of consecutive wire segments. On a larger scale, this deformation is levered into a translation of one coil end along the helix axis.  The transformation \eqref{eqn:fluctonax3} captures exactly this lever arm effect, which is proportional to the total axial displacement $\norm{p\ts{ax}}$ and so becomes relevant if the chain deviates from an idealized B-DNA form.

672:

673: We calculate explicitly the $3\times 3$ blocks $C_\ax^{(ab)}$ of $C_\ax$, \eqref{eqn:fluctonax2}, in terms of the corresponding blocks  $C^{(ab)}$ of $C$, using \eqref{eqn:fluctonax3} and \eqref{eqn:explicitAd}. Here $a, b\in\{\omega,v\}$ stand for the set of rotational or translational components, respectively. Further, we let $C^{(ab)\prime}=R\ts{ax}^\Tp C^{(ab)} R\ts{ax}$ and $P\ts{ax}'=?{{R\ts{ax}}}^i_j? p\ts{ax}^j \epsilon_i$, an antisymmetric matrix. Using this notation,

674: \begin{align}\label{eqn:explicitCax}

675: C_\ax=\begin{pmatrix}

676:        C^{(\omega\omega)\prime} & C^{(\omega v)\prime}+C^{(\omega\omega)\prime}P\ts{ax}'\\[3mm]

677:         C^{(v\omega) \prime}-P\ts{ax}'C^{(\omega\omega)\prime} \quad &

678:         \begin{matrix}

679:          C^{(vv)\prime}  - P\ts{ax}'C^{(\omega\omega)\prime}P\ts{ax}'  \\

680:         + C^{(v\omega)\prime}P\ts{ax}'-P\ts{ax}'C^{(\omega v)\prime}

681:         \end{matrix}

682:       \end{pmatrix}.

683: \end{align}

684: % Since $P\ts{ax}^\Tp=-P\ts{ax}$ and $C^{v\omega\Tp}=C^{\omega v}$, $C_\ax$ is indeed a symmetric matrix, and \eqref{eqn:fluctonax3} implies that it is also still positive definite.

685: In this expression, the rotational block $C_\ax^{(\omega\omega)}$ is merely a rotated version of the off-axis rotational block $C^{(\omega\omega)}$.

686: In contrast, the translational block $C^{(vv)}_\ax$ and the coupling block $C^{(\omega v)}_\ax$ have `leverage terms', since rotational fluctuations about directions perpendicular to the offset vector contribute through a cross product with $p\ts{ax}$.  For $C^{(vv)}_\ax$, these involve the off-axis coupling $C^{(v\omega)}$ in first order and rotational fluctuations $C^{(\omega\omega)}$ in second order in $\norm{p\ts{ax}}$. The coupling block $C^{(\omega v)}_\ax$  has contributions from $C^{(\omega\omega)}$ in first order. These leverage terms persist in the reduced WLC covariance matrix $\unc C$. They are the remainder of the microscopic description of fluctuations with respect to a material frame that is offset from the average helical axis.

687:

688: Consider for example a base pair step that exhibits $x$-displacement but no Inclination or Tip, i.e.~$p\ts{ax}\propto d_1, \omega\propto d_3,R\ts{ax}=I_3$. Then \eqref{eqn:explicitCax} implies that any coupled Roll--Rise  ($C^{26}$) and Roll ($C^{22}$) fluctuations will add to the stretching fluctuations $C_\ax^{66}$ of the chain. In addition, the off-axis Roll--Twist fluctuation ($C^{23}$) contributes  to twist--stretch coupling fluctuation on axis, $C_\ax^{36}$.

689:

690: When Inclination or Tip are nonzero, then due to the additional rotation $R\ts{ax}$ also Shift and Slide fluctuations contribute to the resulting WLC parameters. It is therefore essential to transform to an on-axis frame before averaging over the shear degrees of freedom.

691:

692: \subsection{Numerical verification}\label{sec:numveri}

693:

694: We tested our coarse--graining relations by performing a simple--sampling Monte Carlo simulation. After generating a random sequence, for each dinucleotide, random conformations were drawn according to a Gaussian distribution with the corresponding microscopic parameters. The measured mean squared base--pair center end--to--end distances are shown in fig.\ \ref{fig:thermalandstaticr2}. The theoretical curves

695: $\langle R^2\rangle = 2 l l_b -2l_b^2(1-e^{-l/l_b})$ for an inextensible WLC using the computed \new{contour} and bending persistence lengths, $l$ and $l_b$,  fit the simulation data to within numerical error. The only exceptions occur below 3 nm, where the inextensible WLC model is not a good description for the full shearable helical RBC.

696:

697: \begin{figure}[htp]

698: 	\centering

699: 	\includegraphics[clip=true,width=\columnwidth]{thermalandstaticr2AI2}

700: \caption{Comparison of a simple sampling Monte Carlo simulation of random--sequence DNA to our theory. Symbols designate the measured mean squared end--to--end distances for static disorder only (upper row) and for static plus thermal fluctuations (lower row). The theoretical curves assuming a WLC model for static disorder, uncorrelated static disorder, and for static plus thermal fluctuations are shown in blue, red and orange, respectively. MD microscopic parameter set, as explained below.}

701: \label{fig:thermalandstaticr2}

702: \end{figure}

703:

704:

705: \section{WLC parameters of different RBC parameter sets}

706:

707: As a result of the coarse--graining procedure outlined above, we obtain a set of WLC parameters from a set of sequence--dependent RBC stiffness (or covariance) matrices and equilibrium offsets. There are several different parameter sets available in the literature, extracted from analysis of X-ray crystal structures of DNA \cite{olson98} and from molecular dynamics simulation \cite{lankas03,lankas06}.

708:

709: For the stiffnesses obtained from structural data, the missing thermal energy scale is substituted by an ``effective temperature''. We here use the effective temperatures determined in a previous study \cite{becker06} by equating the total, microscopic fluctuation strengths of the crystal and MD covariance matrices. The absolute magnitudes of all parameters derived from structural data (B for B-DNA crystal and P for Protein$\bullet$DNA cocrystals) are therefore depend on our choice of effective temperature. Still, their relative magnitudes are properties of the microscopic structural data set independent of this choice. No such restrictions apply to the MD parameters (MD), since here the temperature is set by the simulation. We also include a hybrid parametrization (MP) which combines the equilibrium values from the P$\bullet$DNA dataset with the stiffness matrices from MD. This combined potential compared favorably to the others in binding affinity prediction \cite{becker06}. It can be seen as a version of the MD potential which is corrected for the well known undertwist occurring in MD simulations. For MD and MP, our coarse--graining involves no free parameter.

710:

711: % I.e, MD persistence lengths are the true persistence lengths as predicted by the simulation, whereas B and P persistence lengths result from a global scaling so that microscopic fluctuations match those of MD.

712:

713:

714: In table \ref{tab:randEHWLC} we show the resulting WLC stiffness parameters and geometry. For the crystal parameter sets, the equilibrium rise and twist are close to the commonly accepted values of 0.34 nm/step and 10.5 bp/turn. The MD rise and twist are both low, a known effect for the force field used in that study \cite{beveridge04}.

715: % This combination of undertwist and low rise is counterintuitive: Thinking in terms of rigid base pairs and sugar--phosphate backbones of constant length, one would expect undertwist to be accompanied by a bigger Rise per bps.

716:

717: The MD bending persistence length is smaller than the commonly accepted values at physiological conditions, see e.g.\ \cite{gore06}. It is also somewhat below the range of $45-47$~nm found experimentally \cite{wang97,baumann97,wenner02} at the conditions of the simulation of  $\simeq 100$ mM Na$^+$. (However, in \cite{salomo06} a lower experimental value is reported.) The low equilibrium Rise of the MD conformations accounts for half of this deviation.

718:

719:

720:

721: % salomo06: 20 (PBS) or 29wlc - 40winkler (pH 7) NaHPO4

722: % baumann97:45-50 at 100 mM

723: % wenner02: 47 at 100 mM and 45 for 1000.

724: % wang97: 47 at 10 mM

725:

726:

727:

728:

729: %TODO: check the quantitative influence of the axis offset. DONE.

730: % It is not very pronounced, and different for MD, P,  B.

731:

732: %TODO: Include MP, since gives best results.

733:

734: \begin{table}

735: \begin{tabular}{|c|cc|cc|cccc|}

736: \hline

737: & $\Bigr.\frac{2\pi}{\theta_\ax}$ & $h_\ax $ &$ l_b$ & $l_t$ & $\beta \hav S^{11}$&$\beta \hav S^{33}$ & $\beta \hav S^{44}$ & $\beta \hav S^{34}$\\

738: \hline

739: B    & 10.1 & 0.334  &  27.1 & 15.2 & 81.1 & 46.7 & 1300. & -39.9 \\

740: P    & 10.5 & 0.334  & 43.4 & 35.7 & 130. & 117. & 1280. & -116. \\

741: MD & 11.9 & 0.318 & 38.9 & 45.1 & 122. & 158. & 586. & -96.3\\

742: MP & 10.5 & 0.334 & 42.8 & 47.8 & 128. & 150. & 1020. & -81. \\

743: \hline

744: units & 1 & nm & nm & nm & $\Bigr.\text{rad}^{-2}$ &$ \text{rad}^{-2}$ & $\text{nm}^{-2}$ & $(\text{nm}\,\text{rad})^{-1}$  \\ \hline

745: \end{tabular}

746: \caption{WLC geometry, persistence lengths and stiffness parameters for the considered potentials. In our units, $\beta \hav S^{11}$ and $\beta \hav S^{33}$ are the bending and twisting persistence lengths, given in base pairs.}\label{tab:randEHWLC}

747: \end{table}

748:

749: The twisting persistence lengths of all parameters sets are similar to the bending persistence lengths, which is in stark contrast to measurements of twisting persistence in single--molecule studies which give a value close to $100$ nm, see \cite{charvin04} for a review. For the crystal parameter sets one might argue that this indicates that torsional deformations carry more elastic energy than bending deformations, thus `violating' an assumed equipartition of energy. However, for the MD parameter set, this is clearly not the case; the simulated DNA oligomers were indeed more twistable than experimental values for DNA suggest.

750:

751: % Check detailed interpretation of TS coupling. If interesting, into new paper !!

752:

753: \begin{table}

754: \begin{tabular}{|c|cccc|}

755: \hline

756: $\Bigr.$ & $\beta \hav S^{11}$&$\beta \hav S^{33}$ & $\beta \hav S^{44}$ & $\beta \hav S^{34}$\\

757: \hline

758: Gore \emph{et al.}\cite{gore06} &  $163 \pm 15$&  $327\pm 15$ & $781 \pm 150$ &$ -64 \pm 15 $ \\

759: Lionnet  \emph{et al.}\cite{lionnet06}& & $294$ & $ 710$ & $-47 \pm 20$ \\

760:  \hline

761: units & $\Bigr.\text{rad}^{-2}$ &$ \text{rad}^{-2}$ & $\text{nm}^{-2}$ & $(\text{nm}\,\text{rad})^{-1}$  \\ \hline

762: \end{tabular}

763: \caption{Experimental stiffness parameters as given in the literature, converted to our single--step units. The conversion factor for $B,C,G,S$ from \cite{gore06} is $\beta /h_\ax$. The conversion factors for $B,C,D$ in \cite{lionnet06} are respectively, $\theta_\ax^2/h_\ax^3,{1}/{h_\ax},{\theta_\ax}/{h_\ax^2}$. Beware of a missing $1/2$ factor in their first formula.}\label{tab:litEHWLC}

764: \end{table}

765:

766: The twist--stretch coupling is negative in all cases. This is counter--intuitive since it implies that DNA overwinds in linear response to stretching. The same sign of the coupling is also found in the ``naive'' Twist--Rise coupling stiffness of the original stiffness matrices for (8, 9, 10) of the 10 unique basepair steps in the (B, P, MD)  parameters, respectively. Negative twist--stretch coupling has recently been observed in single--molecule experiments at low applied tension \cite{lionnet06,gore06}. We show the full elastic parameters collected in these articles in table \ref{tab:litEHWLC} for comparison. Generally, the agreement between the microscopic parameter sets and single--molecule data is better for the twist--stretch coupling than for the twisting rigidity.

767: The stretching modulus $\beta \hav S^{44}$ differs by about a factor of 2 between the crystal and MD parameters, with the experimental value inbetween. We remark that no rescaling by a different effective temperature can bring all crystal stiffness parameters into reasonable agreement with experiment since the various deviations occur in opposite directions.

768: % current knowledge on the elastic coefficients in the Bryant / Gore / Bustamante papers:

769: % 03 paper: measures the twist compliance not the stiffness. No change found between

770: %  tensions of positive vs. negative TS coupling.

771: %  Possible explanation: the coupling strength is much lower than the diagonal elements

772: % so naive formulas are not so wrong

773: %  06 paper: again, TS compliance is measured but the conversion to TS stiffness

774: %  indeed gives their value for g if the 03 values are taken as stiffnesses.

775: % The conversion works by solving a quadratic equation. This seems to be what they have done.

776: %  The second mode of measuring is analyzed correctly if the stretching constant is a true stiffness

777: %  It is consistent with the previous results.

778: \begin{table}

779: \begin{tabular}{|c| cccc | cccc |}

780: \hline

781:  & \multicolumn{4}{c|}{$l_b/\text{nm}$} &  \multicolumn{4}{c|}{$l_t/\text{nm}$} \\

782:  & full & thermal & static & static' & full & thermal & static & static' \\

783:  \hline

784:  B &  27.1 & 29.5 &327. &   211. &  15.2  & 15.4 & 1260& 88.3 \\

785:  P & 43.4 &  45.3 &  1040. & 575. &  35.7 & 36.3 & 2430 & 172. \\

786:  MD &  38.9 &  42. &  519. & 175. &  45.1 & 47.7 & 818. & 256. \\

787: MP & 42.8 & 44.6 & 1040. &   575. &47.8 & 48.8 & 2340.& 172. \\

788:  \hline

789: \end{tabular}

790: \caption{Thermal and static contributions to the apparent persistence length for different potentials. For comparison, the static' column shows the static persistence lengths when sequence continuity is disregarded.}\label{tab:stattherm}

791: \end{table}

792:

793: Instead of looking at random DNA, we can consider thermal and sequence fluctuations separately. Table \ref{tab:stattherm} shows the corresponding static and thermal persistence lengths \cite{trifonov88}. It follows from eqn.\ \eqref{eqn:Cstaticthermal} that their inverses add up to give the inverse apparent (or random DNA) persistence length. In disagreement with the cryo--EM study \cite{bednar95} we find that the static persistence lengths are much higher than the thermal ones, leading to a correction of only a few nm random DNA persistence lengths. This is in accordance with cyclization data \cite{vologodskaia02}. Also, the static $l_b$ for the P parameter sets correctly reproduces the value found numerically in that study, using the same parameter set. When we disregard the requirement of sequence continuity by setting to zero all $\four C_{1}$ contributions in \eqref{eqn:nearestneighbor}, static variability is strongly overestimated (more than tenfold for twist).

794:

795:

796: % \begin{table}

797: % \begin{tabular}{|c|cc|cc|}

798: % \hline

799: % \multirow{2}{*}{repeat}& \multicolumn{2}{c|}{$l_b/\text{nm}$} &\multicolumn{2}{c|}{$l_t/\text{nm}$} \\

800: % 	&	P & MD &  P & MD \\

801: % 	\hline

802: %  AA &  40.2 & 45.9 & 53.5 & 46.1 \\

803: %  AC &   49.5 & 40.2 & 29. & 42.3 \\

804: %  AG &   49.2 & 45.7 & 44.2 & 48.6 \\

805: %  AT &  40.1 & 33. & 34.9 & 61.8 \\

806: %  GG &   48.7 & 51.9 & 32. & 59.7 \\

807: %  CG &   40.2 & 38.4 & 34.6 & 38.2 \\

808: %  \hline

809: % \end{tabular}

810: % \caption{Comparison of persistence lengths of all six unique repetitive sequences of period two, for the P$\bullet$DNA and MD parametrizations. Their correlation is rather weak.}\label{tab:twostep}

811: % \end{table}

812:

813: \begin{table}

814: \begin{tabular}{|cc|cccccc|}

815: \hline

816:  &  & AA&AC&AG&AT&GG&CG \\

817: 	\hline

818: \multirow{4}{*}{$\frac{l_b}{\text{nm}}$}

819:  & B &  32.2 & 18.8 & 43.2 & 35. & 47.5 & 27.1 \\

820:  & P  &  40.2 & 49.5 & 49.2 & 40.1 & 48.7 & 40.2\\

821:  & MD  & 45.9 & 40.2 & 45.7 & 33. & 51.9 & 38.4 \\

822:  & MP  & 47. & 44.1 & 46.3 & 37. & 53.8 & 42.1  \\

823:  \hline

824: \multirow{4}{*}{$\frac{l_t}{\text{nm}}$}

825: & B  &9.4 & 7.35 & 25.6 & 18.7 & 19.1 & 26.  \\

826: &P  & 53.5 & 29. & 44.2 & 34.9 & 32. & 34.6  \\

827: &MD  & 46.1 & 42.3 & 48.6 & 61.8 & 59.7 & 38.2  \\

828: &MP  & 45.6 & 44.2 & 50. & 63. & 60.4 & 40.2 \\

829:  \hline

830: \end{tabular}

831: \caption{Comparison of persistence lengths of all six unique repetitive sequences of period two, for the MP and MD parametrizations.}\label{tab:twostep}

832: \end{table}

833: The range over which the stiffness of random B-DNA can vary depending on sequence can be estimated from the persistence lengths of all six unique repetitive sequences of period 2, given in table \ref{tab:twostep}, see also \eqref{eqn:compoundsigma}. Generally, $l_b$ has similar dependence on the sequence in all considered potentials, while the predictions for $l_t$ are less correlated. The large deviations in the B-DNA parameter set are likely due to insufficient statistics \cite{olson98}.

834: The TA(=AT) repeat stands out as the most bendable sequence which is at the same time torsionally stiff. Another common trend is that poly-G DNA is comparatively stiff with respect to bending.

835:

836: A more detailed view of the sequence variability of WLC stiffness is given in table \ref{tab:twostepfull} for the MP hybrid potential. The stretch modulus and the twist--stretch coupling depend on the sequence in a correlated way. The rightmost column shows the ratio of overtwist over elongation in response to an external stretching force, $r\ts{resp}=\hav C^{34}/\hav C^{44}$. When a repetitive sequence is cut by one bp and then stretched to the original length, the ``missing twist'' at the last bp ranges from 29 (AA) to 20 (AC) degrees undertwist.

837:

838:

839: \begin{table}

840: % \begin{tabular}{|c|cccc|cccc|}

841: % \hline

842: % \multirow{2}{*}{repeat}& \multicolumn{4}{c|}{P} &\multicolumn{4}{c|}{MD} \\

843: % $\Bigr.$ & $\beta \hav S^{11}$&$\beta \hav S^{33}$ & $\beta \hav S^{44}$ & $\beta \hav S^{34}$	&

844: %   $\beta \hav S^{11}$&$\beta \hav S^{33}$ & $\beta \hav S^{44}$ & $\beta \hav S^{34}$ \\

845: % 	\hline

846: %  AA & 123. & 166. & 1430. & -50. & 142. & 153. &   766. & -87.3 \\

847: %  AC & 149. & 98.8 & 1400. & -128. & 130. & 153. &   688. & -102. \\

848: %  AG & 148. & 151. & 1180. & -148. & 139. & 163. &  700. & -101. \\

849: %  AT & 120. & 145. & 1640. & -259. & 109. & 216. &   497. & -78.8 \\

850: %  GG & 144. & 98.8 & 1120. & -67.3 & 158. & 191. &   446. & -63.1 \\

851: %  CG & 119. & 109. & 1210. & -87.6 & 123. & 134. &   523. & -79.4\\

852: %  \hline

853: % \end{tabular}

854: % \caption{Comparison of stiffness parameters of all six unique repetitive sequences of period two, for the P$\bullet$DNA and MD parametrizations. The stiffness constants have units as in table \ref{tab:litEHWLC} and are given per bps.}\label{tab:twostepfull}

855: \begin{tabular}{|c|cccc|c|}

856: \hline

857: $\Bigr.$ & $\beta \hav S^{11}$&$\beta \hav S^{33}$ & $\beta \hav S^{44}$ & $\beta \hav S^{34}$ & $r\ts{resp}$ \\

858: \hline

859:  AA  & 144. & 141. & 976. & -38.3  &0.27 \\

860:  AC  & 132. & 142. & 1140. & -105. & 0.74 \\

861:  AG  & 139. & 159. & 1120. & -103. & 0.64 \\

862:  AT  & 111. & 195. & 975. & -80.1  &0.41 \\

863:  GG  & 159. & 186. & 1090. & -89.9 & 0.48\\

864:  CG  & 124. & 126. & 831. & -78.5  &0.62\\

865: \hline

866: units & $\Bigr.\text{rad}^{-2}$ &$ \text{rad}^{-2}$ & $\text{nm}^{-2}$ & $(\text{nm}\,\text{rad})^{-1}$ & \text{rad}/\text{nm} \\

867:  \hline

868: \end{tabular}

869: \caption{Comparison of stiffness parameters of all six unique repetitive sequences of period two, for the MP hybrid parametrization.}\label{tab:twostepfull}

870: \end{table}

871:

872:

873:

874: \section{Variability of stiffness}

875:

876: \subsection{Bend angle distributions for short chains}

877:

878: The combined covariance matrix $\four C_{1\,m+1}$ gives the second moment of the distribution $p_{1\,m+1}$ of deformations, observed in a thermal ensemble of random sequence oligonucleotides of length $m$ steps. Here it is not necessary that the single step deformation distributions have a Gaussian shape. Indeed such an assumption depends on the choice of coordinates, and is not justified  by experiments.

879: % If the stiff rod approximation were valid for large $m$ (which it is not), $p_{1\,m+1}$ would approach a Gaussian. For compound steps of only a few bases, it will retain some more general shape.

880: Nevertheless, let us for the moment additionally assume that the single step deformation distributions are in fact Gaussians. In that case the deformation of a specific compound step again follows a Gaussian distribution $p(\eta_{1\,m+1}|\sigma_{1\,m+1})$, since it is the result of a convolution. However even in this case, after averaging over sequence randomness, the corresponding deformation distribution of a random compound step $p_{1\,m+1}(\eta_{1\,m+1})$, deviates from a Gaussian shape. This comes about by averaging together several Gaussians with different offsets and widths.

881: To illustrate this point, we show in fig.\ \ref{fig:baveff} the effective potential $U\ts{eff}$ for the total bend angle $\vartheta=((\eta_{1,m+1}^1)^2+(\eta_{1,m+1}^2)^2)^{1/2}$ of random sequence compound steps of different lengths. It is extracted from histograms of a simulation as described in section \ref{sec:numveri}.

882: \begin{figure}

883:  \centering

884:  \includegraphics[clip=true,width=\columnwidth]{bendangles3}

885:  \caption{Effective potential for the total bend angle $\vartheta$ (green, symbols). The blue curves show the harmonic approximation to the effective potential that yields the same variance $\E{\vartheta^2}$. Compound step length, from left to right: 1,2,3,5,10 bp. MP parameter set.}

886:  \label{fig:baveff}

887: \end{figure}

888: For compound steps shorter that 5 bp, the effective potentials stay well below their respective harmonic approximations which are tailored to reproduce the second moment of the bend angle distribution. These second moments agree to within 1\% with the bend angle variance of a WLC model with the same persistence length, again confirming our calculation. Thus for short random chains, large bending angles occur much more frequently than would be expected from a WLC model with matching persistence length.

889: This effect is the combined result of the varying bending stiffness coming from sequence as well as from anisotropic bending (see below). For the parametrizations we considered, this effect is very small for compound steps above 5 bp and is thus insufficient to explain the frequent large bending angles observed in a recent AFM study of DNA adsorbed on a surface \cite{wigginsnew}, on length scales of more than 15 bp.

890:

891: We note that the shape of the bend angle distribution depends on what exactly is considered the local bend angle. Instead of $\vartheta$ as defined above one could take the angle between vectors $(p_{i+1}-p_{i})$ connecting successive bp centers. For this choice, below 5 bp steps, the opposite behavior is seen: The second moment is increased while extreme bend angles are suppressed compared to the WLC prediction (data not shown), although both bend angle definitions agree on scales longer than a helical turn.

892:

893: \subsection{Decay of variability}

894:

895: % TODO Decide whether this semi analytical calculation of the spread in stiffness is worth putting in.

896:

897: To investigate the shape of $p_{1\,m+1}$ for small $m$ in some more detail, first consider a compound step with a fixed $m$ step sequence $\sigma_{1\,m+1}=(\sigma_{12},\dots,\sigma_{m\,m+1})$.

898: Essentially just by disallowing sequence randomness in \eqref{eqn:compoundCfour}, we compute the combined covariance matrix of this compound step to be

899:  \begin{equation}\label{eqn:compoundsigma}

900:  \begin{split}

901:  \four C_{\sigma_{1\,m+1}} = \sum_{l=0}^{m-1} \four \Ad{g_{0}^{-l}}

902:  \four C_{\sigma_{m-l\,m+1-l}}  \four \Ad^\Tp{g_{0}^{-l}},

903:  \end{split}

904: \end{equation}

905: valid for small deviations relative to the sequence--averaged equilibrium step $g_0^m$.

906:

907: We can describe the sequence variability of compound step covariances in terms of their first moments in the same way as was done for the equilibrium conformations in section \ref{sec:seqrandomness}. While the mean covariance matrix $M^{ij}=\E{\four C^{ij}_{\sigma_{1\,m+1}}}$ is just the sequence average of \eqref{eqn:compoundsigma}, the covariances of the entries of the thermal covariance matrix are given by

908: \begin{equation}\label{eqn:covarofcovar}

909: V_{1\,m+1}^{ijkl}=\E{(\four C_{\sigma_{1m+1}}^{ij}-M^{ij})(\four C_{\sigma_{1m+1}}^{kl}-M^{kl})}.

910: \end{equation}

911: This expression can be split into diagonal and nearest neighbor terms in analogy to \eqref{eqn:nearestneighbor}, again reflecting sequence continuity. In particular, it is impossible to combine two of the comparatively soft pyrimidine--purine \cite{olson98} steps in a row.

912: % In effect, extreme combinations of stiffnesses are suppressed and short compound steps have less spread in compound covariance than one would expect from independent steps. The parameters $M$ and $V$ can be computed when a complete set of covariance matrices is available. To compute the shape of $p_{1,m+1}(\eta_{1\,m+1})$ for Gaussian single steps, we can then assume a model distribution $p(C)$ for $C=\four C_{\sigma_{1\,m+1}}$ which has the first moments $M,V$ and average over it:

913: % \begin{equation}\label{eqn:analyticalpeta}

914: % p_{1,m+1}(\eta_{1\,m+1}) = \int dC p(C) N_{(0,C)}(\eta_{1\,m+1})

915: % \end{equation}

916: % where $N_{(0,C)}$ is a Gaussian with mean zero and covariance $C$. The integration over $C$ can only be done numerically. The result depends somewhat on the choice of model distribution $p(C)$. We have neglected here the variability of the mean values. This is seen in simulations to be a minor effect.

917: The resulting relative spread of thermal persistence lengths among random sequence compound steps is shown in fig.\ \ref{fig:widthdecay}. Explicitly, $\Delta l_b=(V_{1\,m+1}^{1111})^{1/2}$ and $\Delta l_t=(V_{1\,m+1}^{3333})^{1/2}$.

918: \begin{figure}

919:  \centering

920:  \includegraphics[width=\columnwidth]{widthdecay2}

921:  \caption{Relative spread $\Delta l/l$ of the bending ($l_b$, green) and twist ($l_t$, blue) persistence lengths vs.~compound step length.

922: %The full results (diamonds, line) decay more quickly than the version ignoring sequence continuity (triangles).

923: Ignoring sequence continuity leads to overestimation of the stiffness variability (triangles).}

924:  \label{fig:widthdecay}

925: \end{figure}

926: Note that already after one full turn, variability in stiffness is down to 5 \%, and that sequence continuity results in reduced variability compared to a model with independent step sequences.

927:

928: % {\color{blue} Theory: only quote the formula, very shortly!!}

929:

930: In summary, we remark that the detailed shape of the deformation distributions is not known, and there is no reason to believe it should be Gaussian for small step numbers. Even when starting with Gaussians for the single steps, we obyain clearly non-Gaussian shapes for the random sequence bend angle distributions up to a few steps.

931: For the long--wavelength behavior of the chain, the relevant quantities are just the first and second moments which we have calculated in section \ref{sec:coarse}.

932:

933: \subsection{Anisotropic bending}

934:

935: Another feature of short compound steps is their anisotropic bending stiffness. It is clear that on scales much longer than a full turn, the molecule behaves as a uniformly bending rod, at least for small deformations. Using the compound covariance $\four C$ (before helical averaging is performed, see  \eqref{eqn:compoundCfour}) we can quantify the decay of anisotropy for random sequence chains. The ratio of the principal bending stiffnesses as a function of  chain length is shown in fig.\ \ref{fig:isoaniso}.

936: \begin{figure}

937:  \centering

938:  \includegraphics[width=\columnwidth]{isoaniso2}

939:  \caption{Bending anisotropy. The ratio of larger over smaller bending stiffness decays in an oscillating fashion with compound step length. MP parameter set.}

940:  \label{fig:isoaniso}

941: \end{figure}

942: Since linear response is always symmetric, bending into major and minor groove has the same stiffness for small deformations. As a result, the bending anisotropy has minima every \emph{half} turn of the double helix. Since the 21 bp chain has exactly two full turns, the anisotropy is suppressed completely, but also already a 5 bp compound step is not far from 5.25 bp and behaves essentially isotropic.

943:

944: \section{Conclusions}

945:

946: In this article we have shown a way to quantitatively connect experiments on DNA elasticity on different length scales. Starting from atomistic data, DNA deformations are described in terms of a rigid base--pair chain model in a first step \cite{gonzalez01} . We then relate the stiffness expressed in terms of rigid base--pair deformations, to the long--wavelength WLC parameters of a random chain. In this coarse--graining step it is essential to properly account for the helical base--pair geometry. For this purpose we introduce an on-axis version of the rigid base--pair chain, which \emph{on average} has ideal B-DNA shape. This makes it straightforward to integrate over the shear degrees of freedom and helical phase, to finally obtain all four linear elastic constants allowed by the large--scale symmetry of the molecule \cite{kamien97,marko97,moroz97}.

947:

948: Our results allow a direct comparison of the different microscopic effective potentials to single molecule and cyclization experiments. It involves no free parameter for MD simulation data, and a single parameter (the effective temperature) for structural data. We find good qualitative agreement, including the negative sign of twist--stretch coupling.

949: Quantitatively, the microscopic bending persistence lengths agree best with recent single--molecule data. The twist persistence is about 50 \% lower, and the magnitudes of compressional modulus and twist--stretch coupling are roughly 50 \% higher than the mesoscopic experimental values.

950:

951: Does the involved computation of macroscopic parameters actually make a noticeable difference?

952: The calculations can  be simplified in two ways: By disregarding the details of average helical geometry of the chain, and by treating the base sequence of adjacent steps as independent, i.e.~disregarding sequence continuity.

953:

954: If the average helix geometry is treated correctly but sequence continuity is disregarded, static variability is strongly overestimated (table \ref{tab:stattherm}). Overall this remains a minor effect since the thermal fluctuations dominate. In addition, such a simplification leads to an overestimation of stiffness variability for short random oligonucleotides, see fig.\ \ref{fig:widthdecay}.

955:

956: On the other hand one can exclude static variability and treat the helix geometry as ideal B-DNA from the beginning. Starting from the sequence--averaged, \emph{off-axis} covariance matrix, one would perform an average over Shift, Slide and helical phase angle and invert to get an ``naive'' stiffness matrix $S\ts{na}$. The relative error made in such a naive computation, $e^{ij}=(S\ts{na}^{ij}-\hav S^{ij})/\hav S^{ij}$ is shown in table \ref{tab:naivechange}.

957: \begin{table}

958: \begin{tabular}{|c|cccc|}

959: \hline

960: $\Bigr.$& $e^{11}$&$e^{33}$ & $e^{44}$ & $e^{34}$\\

961: \hline

962: B    &  12. & -1. & 4. & -23. \\

963: P    &   8. & -7. & 8. & 24. \\

964: MD &     11. & -9. & 67. & 52. \\

965: MP &     6. & -4. & -3. & 44.      \\

966: \hline

967: \end{tabular}

968: \caption{Relative error in stiffness parameters made when using naive matrix elements instead of the coarse--grained parameters described above. Values are given in per cent.}\label{tab:naivechange}

969: \end{table}

970: While the bending and twisting stiffnesses are well approximated by the naive guess, the error in stretch modulus and twist--stretch coupling is considerable. For these terms, leverage due to the axis offset becomes important as explained in section \ref{sec:cogrel}. Especially the naive twist--stretch coupling is not negative enough.

971: % Oddly, the wrong estimates agree better with the recent experiments detailed in table \ref{tab:litEHWLC}.

972:

973: The procedure we describe involves no approximations regarding the geometry. This makes it directly applicable to alternative DNA structures, once microscopic covariance matrices are available. In fact, the more the average geometry deviates from idealized B-DNA, the greater is the need to treat the helical geometry correctly. Already for the MD parameter set, the error when using the naive geometry is quite important.

974:

975: \new{

976: The main model assumption is that thermal deformation fluctuations of neighoring steps are independent. Another limitation of any rigid base--pair model is that \emph{internal} deformation fluctuations of a base--pair such as propeller twist or buckle, are not explicit and thus effectively treated as uncorrelated between base pairs.

977:

978: Our framework can be extended to improve on both of these points. Nearest--neighbor correlations in base--pair parameters may be included by extending the model to a full Markov chain. Internal deformations could then be added by extending the configuration space, leading to a bi-rod \cite{moakher05b} in the continuum limit. However for either of these interesting generalizations, a microscopic parametrization is an open challenge in itself.

979: % is a challenge in itself, and none are yet available.

980: The fact that dinucleotide step stiffness depends overall rather weakly on the flanking sequence \cite{arauzo-bravo05} and the encouraging agreement with mesoscopic data we found, suggest that the main features of coarse--grained DNA elasticity are captured already by our more basic model.

981: }

982:

983: In view of an experimental precision of the order of one percent for the mesocopic

984: bending rigidity \cite{vologodskaia02}, we consider a quantitatively correct relation between mesoscopic and microscopic stiffness parameters essential. We hope that the method presented in this article proves useful in providing this link.

985:

986: % \new{

987: % biggest approximation: no coupling beyond steps,

988: % electrostatics

989: %

990: % good agreement with meso experiments indicates that this is not so bad

991: %

992: % in principle, formalism can be extended to include such coupling or to Maddock's

993: % bi-rods

994: %

995: % problem: parameterization of more detailed models

996: %

997: % }

998:

999: % An extension of the presented methods to a rigid--base coarse--graining scheme appears feasible. It is however complicated by the fact that the corresponding continuous model \cite{moakher05b} is not linear.

1000: %

1001: %

1002: % Bring together different scales of DNA experiments.

1003: %

1004: % B-DNA not so different but completely general: can be applied to other forms of DNA.

1005: %

1006: % Difference to naive:

1007: %

1008: % 1. geometric effects: helix axis

1009: %

1010: % 2. sequence continuity

1011: %

1012: % Quantitative corrections from both. Precise lp measurements available-> good calculation is needed.

1013: %

1014: % Table: all 4 MS parameters: naive vs. calculated.

1015: \begin{acknowledgments}

1016: RE acknowledges support from the chair of excellence program of

1017: the Agence Nationale de la Recherche (ANR). NBB thanks B. Lindner for helpful discussions.

1018: \end{acknowledgments}

1019:

1020:

1021: \appendix

1022:

1023: \section{Coordinate conversion}\label{sec:coordconv}

1024:

1025: How does one obtain the covariance matrix $C$ and equilibrium conformations $g_0$ for a given collection $\{g_k\}_{1\leq k\leq N}$ of bp frame conformations? We can first determine $g_0$ by requiring that $\{g_0^{-1}g_k\}$ has mean 0 in exponential coordinates. For not too wide distributions, such a center always exists and is unique \cite{kendall90}. Then, $C^{ij}=\E{\xi^i\xi^j}$ is the standard covariance matrix of $\{g_0^{-1}g_k\}$ in exponential coordinates.

1026:

1027: However, for the potential parametrizations considered here, only the equilibrium values $\zeta_0$ and covariance matrices $C_\zeta^{ij}=\E{(\zeta-\zeta_0)^i(\zeta-\zeta_0)^j}$ with respect to the global coordinates $\zeta=(\Omega,\tau,\rho,q_1,q_2,q_3)$ as defined in \cite{lu97} and used in \cite{lu03}, are given. Here, $\theta=(\Omega,\tau,\rho)$ are Twist, Tilt and Roll angles but differ from our choice of angles. The $q=(q_1,q_2,q_3)$ gives the translation vector with respect to the mid-frame $R\ts{m}$. The conversion formulas are,

1028: \begin{equation}

1029: \begin{aligned}

1030: R(\zeta) =&\exp((\Omega/2-\arctan(\tau /\rho))\epsilon_3)\exp(

1031:       \sqrt{\rho^2 + \tau^2}\epsilon_2)\\

1032:      &\exp((\Omega/2 + \arctan(\tau /\rho))\epsilon_3),\\

1033: R\ts{m}(\zeta)=&\exp((\Omega/2-\arctan(\tau /\rho))\epsilon_3)

1034: \exp(\sqrt{\rho^2 + \tau^2}/2\epsilon_2)\\

1035: &      \exp((\arctan(\tau /\rho))\epsilon_3)\text{, and}\\

1036: p(\zeta)&=R\ts{m}(\zeta)q,

1037: \end{aligned}

1038: \end{equation}

1039: together determining the frame conformation $g(\zeta)$.

1040: % and the volume element is $\frac{\sin\sqrt{\rho^2 + \tau^2}}{\sqrt{\rho^2 + \tau^2}}d^6\zeta$.

1041: We checked that the variation of the volume element in the region of noticeable probability around $g_0$ is small compared to the variations in the probability density. Therefore neglecting the former, we get $g_0=g(\zeta_0)$. In linear order around the equilibrium position, we can then transform the covariance matrix $C_\zeta$ given in $\zeta$-coordinates to exponential coordinates using just the Jacobian matrix $J_0$ of the coordinate transition map $\zeta\mapsto \xi(\zeta)=\log(g(\zeta))$. This gives $C=J_0C_\zeta J_0^\Tp$. We have calculated $J_0=\frac{\partial\xi}{\partial\zeta}\evat{\zeta_0}$  analytically. Its $3\times 3$ blocks are

1042: \begin{equation}

1043: \begin{aligned}

1044: \frac{\partial\omega^i}{\partial\theta^j}=&1/2 \tr (\epsilon_{i}R^\Tp\partial_{\theta^j}R)\\

1045: \frac{\partial\omega^i}{\partial q^j}=&0\\

1046: \frac{\partial v^i}{\partial\theta^j}=&(R^\Tp\partial_{\theta^j}{R\ts{mid}}q)^i\\

1047: \frac{\partial (v)}{\partial (q)}=&R^\Tp{{R\ts{mid}}}&

1048: \end{aligned}

1049: \end{equation}

1050: % Their lengthy explicit formulas are available as a supplement / upon request (TODO). ??

1051: All coarse graining calculations presented in this article use the matrices $C$ converted in this way as a starting point.

1052:

1053: %current knowledge on symmetries: TODO

1054: The exponential coordinates of the equilibrium conformations have the usual symmetries under strand change and reading direction reversal: Denote by $\compl\sigma$ the  sequence complementary to  $\sigma$, e.g. $\compl{\mathrm{AG}}=\mathrm{CT}$, and let $\strandchange=\diag(-1,1,1,-1,1,1)$. Then as $\sigma\rightarrow \compl\sigma $, $\xi_0=\mathrm{(Ti_0,Ro_0,Tw_0,Sh_0,Sl_0,Ri_0)} \rightarrow \strandchange \xi_0$. Due to the $\xi_0$ dependent coordinate conversion above, the body--frame covariance matrix does \emph{not} obey the corresponding symmetries, $C\nrightarrow \strandchange C\strandchange$. While this may seem a serious drawback of the coordinate system we use here, it turns out that in the on--axis, shear and helical phase averaged covariance matrices, the strand--exchange symmetry is re-established. Therefore, our coarse-grained results are indeed independent of the reading sense.

1055:

1056: \section{Volume element}\label{sec:volel}

1057:

1058: In our coordinates, $\ln A(\xi)=-\tfrac{1}{6}\norm\omega^2 + O(\norm\omega^4)$, so that in a Gaussian approximation,

1059: \begin{equation}\label{eqn:pAcorrection}

1060: % A(\xi) = \frac{(2-2\cos\norm\omega)^2}{\norm\omega^4}

1061: % d\xi^1d\xi^2\cdots d\xi^6.

1062: p(\xi)dV_\xi \propto  e^{-\frac{1}{2}\xi^i(\beta {S_\sigma}_{ij}+\bar{A}_{ij})\xi^j}\,d^6\xi,\;

1063: \bar{A}=\begin{pmatrix}

1064:  \tfrac{1}{3}I_3& 0_3 \\

1065: 0_3 &  0_3

1066: \end{pmatrix}.

1067: \end{equation}

1068: Here, $I_3$ and $0_3$ are the $3\times 3$ identity and zero matrices, respectively.

1069: In DNA, the distributions $p(\xi)$ of single steps are very narrow. Therefore when computing moments, in particular the covariance matrix $C^{ij}=\E{\xi^i\xi^j}$, we can extend the integration boundaries to infinity with negligible error. Performing the integral we then get the relation $\beta S +\bar{A} = C^{-1}$. Since $\beta S \gg\bar{A}$, in making the approximation $\beta S = C^{-1}$, we introduce an error of less than 1\% for typical B-DNA steps. I.e.~the stiffness matrix $\beta S$ is indeed given by the inverse of the covariance.

1070:

1071:

1072: \input{cg.bbl}

1073:

1074:

1075: \end{document}

1076:

1077: % \bibliographystyle{unsrt}

1078: % \bibliography{NilsLitStandard}

1079:

1080: