cond-mat0307006/part1
1: \documentclass[english,12pt]{article}
2: \usepackage[T1]{fontenc}
3: \usepackage[latin1]{inputenc}
4: \usepackage{babel}
5: \usepackage{cite}
6: 
7: \begin{document}
8: 
9: \title{\textbf{Energetics of Protein Thermodynamic Cooperativity: Contributions
10: of Local and Nonlocal Interactions}}
11: \author{Michael Knott, H\"useyin Kaya and Hue Sun Chan\footnote
12: {E-mail: chan@arrhenius.med.toronto.edu; Tel: +1~416~978~2697; Fax:
13: +1~416~978~8548; Mailing address: Department of Biochemistry,
14: University of Toronto, Medical Sciences Building, 1 King's College
15: Circle, Toronto, Ontario M5S 1A8, Canada.} \\
16: \normalsize{Protein Engineering Network of Centres of Excellence (PENCE),}\\
17: \normalsize{Department of Biochemistry, and} \\
18: \normalsize{Department of Medical Genetics \& Microbiology,} \\
19: \normalsize{Faculty of Medicine, University of Toronto,} \\
20: \normalsize{Toronto, Ontario M5S 1A8, Canada}}
21: 
22: \maketitle
23: 
24: \noindent
25: \textbf{Keywords:} calorimetry / G\=o models / two-state cooperativity/\\ 
26: single-domain proteins / radius of gyration
27: 
28: \section*{Abstract}
29: 
30: The respective roles of local and nonlocal interactions in the
31: thermodynamic cooperativity of proteins are investigated using
32: continuum (off-lattice) native-centric G\=o-like models with
33: a coarse-grained C$_\alpha$ chain representation. We study a
34: series of models in which the (local) bond- and torsion-angle terms
35: have different strengths relative to the (nonlocal)
36: pairwise contact energy terms. Conformational distributions in
37: these models are sampled by Langevin dynamics. Thermodynamic
38: cooperativity is characterized by the experimental criteria
39: requiring the van't Hoff to calorimetric enthalpy ratio
40: $\Delta H_{\rm vH}/\Delta H_{\rm cal}\approx 1$ (the calorimetric
41: criterion), as well as a two-state-like variation of the average
42: radius of gyration upon denaturation. We find that both local
43: and nonlocal interactions are critical for thermodynamic cooperativity.
44: Chain models with either much weakened local conformational propensities 
45: or much weakened favorable nonlocal interactions are significantly less 
46: cooperative than chain models with both strong local propensities and 
47: strong favorable nonlocal interactions. These findings are compared with 
48: results from a recently proposed lattice model with a local-nonlocal 
49: coupling mechanism; their relationship with experimental measurements of 
50: protein cooperativity and chain compactness is discussed.
51: 
52: \pagebreak
53:  
54: \section{Introduction}
55: 
56: How a globular protein can fold reliably into a particular three dimensional 
57: conformation \emph{in vitro}, without the participation of molecular 
58: chaperones, is a central puzzle in biophysics. If we wish not only to 
59: predict the folded state of a protein, but also to understand the folding 
60: phenomenon in terms of physical processes, we need to use physics-based 
61: methods: we run computer simulations of self-contained polymer models
62: \cite{Cha02}
63: that attempt to mimic the behavior of real protein molecules. A complete
64: quantum mechanical simulation, which would include the solvent molecules
65: in addition to all the atoms in the protein molecule, is not yet possible.
66: But in attempting to design simplified models, we face the problem
67: of how to simplify: which characteristics are essential and which
68: can be neglected? What effective energy functions does this imply
69: for the simplified system? 
70: 
71: Part of this general question is addressed in this article. Folding 
72: experiments on small globular proteins have long shown evidence of 
73: thermodynamic and kinetic cooperativity \cite{Jac91,Bak00}, which indicates a
74: phenomenon similar to a first order phase transition between native and 
75: denatured states. As our group has argued recently
76: \cite{Cha00,Kay00a,Kay00b,Kay02,Kay03a}, this 
77: observation 
78: can be exploited to constrain the set of possible simplified models and 
79: interaction schemes: for a particular simplified model to be a quantitatively
80: accurate representation of protein thermodynamics and kinetics, it is
81: essential that, when appropriately applied to a small globular protein,
82: it can produce the experimentally observed generic cooperative behavior.
83: 
84: Such constraints turn out to be rather stringent. It is nontrivial
85: to construct model interaction schemes that can produce proteinlike
86: cooperativities \cite{Cha00,Kay00a,Kay00b,Kay02,Kay03a}. 
87: A case in point is a class of common G\=o-like \cite{Tak75} 
88: models \cite{Kay03a,Cle00}. Their potential functions are native-centric, in
89: that they are 
90: explicitly biased to favor a given native structure. G\=o-like modeling
91: of proteins has provided important physical insights 
92: \cite{Cle00,Mic99,Zho99,Por01}. These include
93: an increasing number of elegant elucidations of functional protein dynamics 
94: under native conditions \cite{Hal97,Kes02,Isi02,Mic02,Jac01}. 
95: As for global folding and unfolding of 
96: proteins (in contrast to their near-native dynamics), a detailed discussion 
97: of the merits and limitations of G\=o-like approaches can be found in 
98: Ref.~\cite{Kay03a}. Notably, common G\=o-like models do not appear capable of
99: producing
100: simple two-state folding/unfolding kinetics. Instead, their chevron plots 
101: exhibit severe rollovers, which are typical of the class of folding
102: kinetics that is 
103: customarily referred to as non-two-state \cite{Kay03a,Kay03b}. 
104: Nonetheless, common
105: three-dimensional G\=o-like protein models seem sufficient to produce 
106: apparent two-state thermodynamic behavior \cite{Kay00a,Kay03a}, although their 
107: two-dimensional counterparts fail to do so \cite{Cha00}.
108: 
109: In the present investigation, we limit our scope to thermodynamic
110: cooperativity. Specifically, we aim to explore how protein 
111: thermodynamic two-state-like behavior is affected by the relative
112: strengths 
113: of local interactions (between residues close together along the chain 
114: sequence) and nonlocal interactions (between 
115: residues far apart along the chain sequence). The respective roles
116: of local and nonlocal interactions are an issue of long standing interest
117: in the study of protein energetics \cite{Go_78,Dil90,Abk95,Cha98,Bal99,Por01-2},
118: and the effect of analogous interactions on the phase diagram of lattice
119: polymers has also been investigated \cite{Deg75,Don96,Bas97,Doy98}. 
120: Here the issue is addressed
121: by varying the potential function in a series of coarse-grained G\=o-like 
122: models, which represent the protein as a string of C$_\alpha$ positions
123: in continuum space and which are simulated using Langevin dynamics. In
124: view
125: of the limitations of common G\=o models \cite{Kay03a}, the present study
126: should
127: be viewed as a first step in tackling the issue of local vs. nonlocal
128: interactions in cooperative continuum protein models. To assess
129: the robustness of our conclusions, results from the continuum Langevin
130: models 
131: are also compared with results from lattice model simulations.
132: 
133: We begin in section 2 by providing details of the models. An outline of the 
134: thermodynamics involved in interpreting the simulations is given
135: in section 3. Our findings are presented in section 4, and we conclude 
136: in section 5 with a discussion of the implications of our results.
137: \\
138: 
139: \section{Models and simulation details}
140: 
141: \subsection{Continuum models}
142: 
143: For the present continuum G\=o-like models we use a representation,
144: introduced by Clementi \emph{et al.} \cite{Cle00}, of the 64-residue truncated
145: form of chymotrypsin inhibitor 2 (CI2). The native contact set 
146: corresponds to NCS2 in Ref.~\cite{Kay03a}.
147: 
148: We use an energy function that is similar to one used previously 
149: \cite{Kay03a,Cle00,Kog01}.
150: The potential energy function \( V \), from which the conformational force
151: is derived, is given by 
152: \begin{equation}
153: \label{fda}
154: V=V_{\rm stretching}+V_{\rm bending}+V_{\rm torsion}
155: +V_{\rm native}+V_{\rm nonnative},
156: \end{equation}
157: where\begin{equation}
158: \label{fdb}
159: V_{\rm stretching}=\sum _{i=1}^{N-1}k_{l}\left( l^{i}-l^{i}_{0}\right) ^{2}
160: \end{equation}
161: contains a summation over the virtual bonds between pairs of residues,
162: \begin{equation}
163: \label{fdc}
164: V_{\rm bending}=\sum _{i=1}^{N-2}\varepsilon _{\theta }(\theta ^{i}-\theta
165: ^{i}_{0})^{2}
166: \end{equation}
167: involves a summation over the virtual bond angles between triplets of residues,
168: and
169: \begin{equation}
170: \label{fdd}
171: V_{\rm torsion}=\sum _{i=1}^{N-3}\left\{ \varepsilon _{\phi }^{(1)}\left[ 1-\cos
172: \left( \phi ^{i}-\phi ^{i}_{0}\right) \right] +\varepsilon _{\phi }^{(3)}\left[ 1-\cos 3\left( \phi ^{i}-\phi ^{i}_{0}\right) \right] \right\} 
173: \end{equation}
174: represents the virtual torsional potential between quadruplets of residues.
175: The latter contains a term with a single minimum as well as the traditional
176: three-minimum term \cite{Bra67}. 
177: [We note that there is an apparent typographical error in the corresponding
178: $V_{\rm torsion}$ in \cite{Cle00}, which effectively lists these terms
179: as $1+\cos\left( \phi ^{i}-\phi ^{i}_{0}\right)$ and 
180: $1+\cos 3\left( \phi ^{i}-\phi ^{i}_{0}\right)$. But such terms would
181: fold the chain into the mirror image of the PDB structure.]
182: \( V_{\rm stretching} \), \( V_{\rm bending} \)
183: and \( V_{\rm torsion} \) together account for the local interactions
184: (between residues that are separated by no more than three places
185: along the chain), which include local conformational propensities for
186: the native structure. The local interactions are expressed in this way
187: because it biases the local geometry of the chain. The fourth term,
188: \begin{equation}
189: \label{fde}
190: V_{\rm native}=\sum _{\left| i-j\right| \geq 4}\varepsilon
191: _{\rm native}\left[5\left(
192: \frac{r_{0}^{ij}}{r^{ij}}\right) ^{12}-6\left( \frac{r_{0}^{ij}}{r^{ij}}\right)
193: ^{10}\right], 
194: \end{equation}
195: sums over the pairwise interactions between residues that are regarded
196: as being in contact in the native structure; this accounts for the
197: nonlocal interactions (between residues that are separated by four
198: or more places along the chain). Finally,
199: \begin{equation}
200: \label{fdf}
201: V_{\rm nonnative}=\sum _{\left| i-j\right| \geq q}
202: \varepsilon \left( \frac{r_{\rm rep}}{r^{ij}}\right) ^{12}
203: \end{equation}
204: contains repulsive pairwise interactions between other pairs of residues,
205: in order to ensure the self-avoidance of the chain. 
206: 
207: The chain contains \( N \) residues. \( l^{i} \) is the length of
208: virtual bond \( i \), \( \theta ^{i} \) is a bond angle, \( \phi ^{i} \)
209: is a dihedral angle and \( r^{ij} \) is the distance between two
210: residues \( i \) and \( j \). The corresponding values in the native
211: structure are \( l^{i}_{0} \), \( \theta ^{i}_{0} \), \( \phi ^{i}_{0} \)
212: and \( r^{ij}_{0} \). The range \( r_{\rm rep} \) of the repulsive potential
213: between pairs of residues that are not bonded and do not interact
214: via a native contact interaction is set to 4~\AA. Length is expressed
215: in units of \AA, and energy in units of \( \varepsilon  \),
216: the energy parameter of the nonnative repulsive interaction, so that
217: \( \varepsilon  \) itself is unity. \( k_{l} \), \( \varepsilon _{\theta } \),
218: \( \varepsilon _{\phi }^{(1)} \), \( \varepsilon _{\phi }^{(3)} \)
219: and \( \varepsilon _{\rm native} \) are also parameters of the potential energy 
220: function. \( k_{l} \) is fixed at \( 100\varepsilon /\textrm{\AA} \),
221: but the other parameters can be varied; 
222: \( \varepsilon _{\theta }=20\varepsilon _{\phi }^{(1)} \)
223: and \( \varepsilon _{\phi }^{(3)}=0.5\varepsilon _{\phi }^{(1)} \)
224: are defined in terms of \( \varepsilon _{\phi }^{(1)} \), which can
225: be varied to test the effect of changing the strength of the local
226: interactions, while \( \varepsilon _{\rm native} \) can also be varied (see
227: below) in order to test the effect of changing the strength of the
228: non-local interactions. The energy of the system is thus controlled
229: by three parameters \( \varepsilon  \), \( \varepsilon _{\rm native} \)
230: and \( \varepsilon _{\phi }^{(1)} \). All interaction parameters
231: are taken to be temperature independent in the present study.
232: 
233: Apart from the variable parameters, this energy function differs from
234: the similar function used in \cite{Kay03a,Cle00,Kog01}
235: in two further important ways, 
236: as follows. (1) For \( r^{ij}/r_{0}^{ij}<\sqrt{5/6} \),
237: we set \( \varepsilon _{\rm native}=\varepsilon  \), while for \(
238: r^{ij}/r_{0}^{ij}\geq \sqrt{5/6} \),
239: we set \( \varepsilon _{\rm native}=\varepsilon _{a} \). Then we can
240: vary the native interaction parameter \( \varepsilon _{a} \), in
241: order to test the effect of changing the strength of the non-bonded
242: attractive interactions between residues, while the short-range repulsive
243: part of \( V_{\rm native} \) maintains the self-avoidance of the chain.
244: (2) The value \( q \), which is the smallest number of places along
245: the chain by which two residues can be separated if they are to interact
246: by \( V_{\rm nonnative} \), can be set either to \( q=4 \) (in order
247: to eliminate any double counting of local interactions, in situations
248: where \( \varepsilon ^{(1)}_{\phi } \) is not being varied) or to
249: \( q=2 \) (in order to allow \( \varepsilon _{\phi }^{(1)} \) to
250: decrease without compromising the self-avoidance of the chain).
251: 
252: The equation of motion of each residue is
253: \begin{equation}
254: \label{fdg}
255: m\frac{\partial v^{i}(t)}{\partial t}=F_{\rm conf}^{i}(t)-m\gamma v^{i}(t)+\eta
256: ^{i}(t),
257: \end{equation}
258: where \( m \) is the mass of a residue (set to unity), \( \gamma  \)
259: is the coefficient of friction, \( t \) is time, and \( v^{i}(t) \),
260: \( F_{\rm conf}^{i}(t) \) and \( \eta ^{i}(t) \) represent each of the
261: three components of the velocity, conformational force and random
262: force, respectively \cite{Vei97}. 
263: The random force is given by\begin{equation}
264: \label{fdh}
265: \eta ^{i}(t)=\sqrt{\frac{2m\gamma k_{B}T}{\delta t}}\xi ^{i},
266: \end{equation}
267: where \( \delta t \) is the integration time step and \( \xi ^{i} \)
268: is a random variable taken from a Gaussian distribution with zero
269: mean and unit variance. The most appropriate time scale can be estimated
270: \cite{Vei97} by \( \tau =\sqrt{m_{0}a_{0}^{2}/\varepsilon _{0}} \),
271: where \( m_{0} \), \( a_{0} \) and \( \varepsilon _{0} \) are the
272: mass, length and energy scales, respectively. We set \( m_{0}=m=1 \),
273: \( \varepsilon _{0}=\varepsilon =1 \) and \( a_{0}=4 \)~\AA
274: (the latter is approximately the length of a virtual bond between two
275: residues and is also the range \( r_{\rm rep} \) of the repulsive interaction),
276: and so \( \tau =4 \). 
277: We define the integration time step \( \delta t=0.005\tau  \)
278: and the coefficient of friction \( \gamma =0.05\tau ^{-1} \) in terms
279: of this time scale. The velocity-verlet algorithm 
280: \cite{Kay03a,Vei97,All87} is used to integrate the equations of motion.
281: 
282: 
283: \subsection{Lattice models}
284: 
285: The lattice models considered here are 27mers with a maximally
286: compact native (ground-state) conformation. Details of the models have 
287: been described elsewhere \cite{Kay03c,Kay03d}. 
288: We compare three native-centric 
289: interaction scenarios which have varying degrees, and different mechanisms, of 
290: thermodynamic cooperativity. As an example of a particular native conformation
291: to which these three scenarios can be applied, we choose the one in 
292: Ref.~\cite{Kay03d} with 
293: relative contact order $0.410$. In scenario (i), which corresponds to the 
294: common G\=o model, the native contact interactions are pairwise additive. 
295: In scenario (ii), we add an extra favorable energy $E_{\rm gs}$ for the 
296: native structure as a whole (as defined by equation~5 in Ref.~\cite{Kay03c}).
297: Scenario (iii) introduces, in place of the extra favorable energy, a coupling 
298: between the strength of the contact interaction and the local geometry: two 
299: residues which are in contact in the native state will interact strongly 
300: only when the local geometries of the protein chain around the residues are 
301: the same as those in the native state, as described in 
302: Ref.~\cite{Kay03d}. We 
303: characterize this mechanism as local-nonlocal coupling or ``a cooperative 
304: interplay between favorable nonlocal interactions and local conformational 
305: preferences'' \cite{Kay03d}. In this scenario, the strength of the native
306: contact 
307: interaction is reduced by an 
308: attenuation factor \( a \) when the local geometry is nonnative. The common
309: (uncoupled) G\=o model is 
310: equivalent to \( a=1 \) (no attenuation), while \( a=0 \) implies complete 
311: coupling; $a=0$ is used here. Under scenarios (i) and (iii), the native 
312: state has an energy of $-28$ units, while the extra favorable native
313: energy in scenario (ii) changes the energy of the native state to $-42$ units.
314: Standard Monte Carlo methods are used for conformational sampling 
315: \cite{Kay03c,Kay03d}.
316: The permitted chain moves are end flips, corner flips, crankshafts
317: and rigid rotations. Each attempted move is counted as one simulation time
318: step, irrespective of whether the move is accepted by the Metropolis
319: criterion.
320: \\
321: 
322: 
323: \section{Thermodynamics}
324: 
325: All simulations are performed at constant temperature, with no
326: explicit consideration of pressure. This is because the focus of
327: the present study is protein behavior under atmospheric pressure,
328: and the contribution of a \( PV \) term to protein 
329: energetics is small under these conditions \cite{Cha00}.  
330: Therefore, for our present
331: purposes, we can
332: consider the Helmholtz and Gibbs free energies to be equivalent. 
333: 
334: \subsection{Calculation of the heat capacity}
335: 
336: The specific heat capacity \( C_{V}(T) \) of the model protein is 
337: given by the standard relation
338: \begin{equation}
339: \label{ffo}
340: C_{V}(T)=\frac{1}{k_{B}T^{2}}\left[ \left\langle E^{2}(T)
341: \right\rangle -\left\langle E(T) \right\rangle^{2}\right] ,
342: \end{equation}
343: where \( k_{B} T \) is the Boltzmann constant multiplied by the absolute
344: temperature, 
345: \( \left\langle X(T) \right\rangle \) denotes the Boltzmann average 
346: of quantity \( X \) at temperature \( T \), and the total energy
347: \( E \) is the sum \( V+E_{K} \) of potential and kinetic energies.
348: We compute the averages by standard histogram sampling techniques 
349: \cite{Kay03a,Cle00}.
350: 
351: In lattice studies, the kinetic energy \( E_{K} \) is not treated. 
352: Therefore, \( E \) in Eq.~(\ref{ffo}) has traditionally been taken, in protein
353: modeling, to be the
354: potential energy term \( V \). This procedure 
355: has often been extended to continuum model studies, in Ref.~\cite{Kay03a} 
356: for example, 
357: although \( E_{K} \) is accessible and well-defined in off-lattice models. 
358: However, $\langle E^2\rangle$ $-$ $\langle E\rangle^2$ $\ne$
359: $\langle V^2\rangle$ $-$ $\langle V\rangle^2$ in general. The two quantities
360: would be equal if $E_K$ were a constant, but that would be unphysical.
361: In this study, we have calculated \( C_{V}(T) \) using Eq.~(\ref{ffo}) both 
362: with $E=V+E_{K}$ and with the substitution $E\rightarrow V$. 
363: The results are not identical: an example is given in Fig.~1.
364: 
365:  Fig.~1 shows that the difference between heat capacity values obtained 
366: using the two methods is small around the transition midpoint \( T_{m} \).
367: This is because any energy added to the system during the unfolding
368: transition contributes mostly to the potential rather than to the kinetic energy. 
369: The difference is less negligible for the ``shoulders'' on either side 
370: of the heat capacity peak. At very low temperatures, including the kinetic 
371: energy contribution can lead to a smaller heat capacity, because the 
372: molecule at this temperature is in a relatively fixed state: nearly all of 
373: the kinetic energy is accounted for by the oscillation of pairs of residues 
374: about the minima of their mutual (bonded or non-bonded) interaction 
375: energies. The potential energy and the kinetic energy associated with 
376: these oscillations both fluctuate, but their sum fluctuates much less, and 
377: so the fluctuations in total energy are smaller than the fluctuations in 
378: potential energy, with the result that the calculated heat capacity is 
379: smaller when \( E_{K} \) is taken into account. Overall, Fig.~1 indicates
380: that while the difference between the heat capacities calculated using the two 
381: different methods is not negligible, it is not drastic. Probably this
382: is because \( E_{K} \), while not invariant, fluctuates much less than 
383: \( V \). For this reason, we do not expect conclusions drawn from previous 
384: calculations of heat capacities \cite{Kay03a}, which used \( V \), to be 
385: changed 
386: greatly by calculations using \( E \). Nonetheless, we do expect a proper 
387: account of the kinetic contributions to protein heat capacities
388: to be important in addressing the contribution of bond vector motions
389: to the heat capacity \cite{Yan97}. 
390: 
391: Since the \( PV \) term is neglected in the present formulation, 
392: \( C_{V}(T) \) is effectively equal to \( C_{P}(T) \), which is generally 
393: measured by calorimetry (and which can be expressed in a form similar to 
394: Eq.~(\ref{ffo}), but with the enthalpy \( H \) taking the place of the 
395: energy \( E \) \cite{Cha00,Kay00a}). 
396: Therefore, we may refer to the quantity computed 
397: using Eq.~(\ref{ffo}) simply as heat capacity. All subsequent heat capacity 
398: curves shown in this article for the continuum models are obtained using 
399: the total energy \( E=V+E_K \).
400: 
401: \subsection{The free energy}
402: 
403: The Helmholtz free energy of the model system is \( F(T) = -k_{B}T\ln Z(T) \), 
404: where \( Z(T) \) is the partition function at temperature \( T\). It 
405: follows that, in the vicinity of the simulation temperature \( T_{\rm sim} \),
406: the Helmholtz free energy of the model system at temperature \( T \), relative to
407: its value at the simulation temperature, may be approximated using the formula:
408: \begin{equation}
409: \label{ffna}
410: \frac{\Delta F(T)}{k_{B}T}=
411: \frac{F(T)}{k_{B}T}-\frac{F(T_{\rm sim})}{k_{B}T_{\rm sim}}
412: =-\ln \left\{ \sum_{i}p(E_{i};T_{\rm sim})\exp 
413: \left( E_{i}\left[ \frac{1}{k_{B}T_{\rm sim}}-\frac{1}{k_{B}T}\right] 
414: \right) \right\}, 
415: \end{equation}
416: where the sum is performed over sets of microstates in different energy
417: ranges $E_i$. \( p(E_{i};T_{\rm sim}) \) is the probability density at 
418: the simulation temperature, and is estimated directly from the Langevin 
419: dynamics simulations.
420: 
421: The inset of Fig.~1 provides an example of \( \Delta F(T) \), showing that 
422: the gradient of the free energy with 
423: respect to $T$ changes rather abruptly around the transition 
424: temperature \( T_{m} \) (vertical dotted lines). The transition
425: temperature \( T_{m} \) corresponds to the temperature at the peak of 
426: the heat capacity curve, which was denoted by \( T_{\rm max} \) in 
427: Ref.~\cite{Kay00a}. 
428: Apparently, below \( T_{m} \), the system spends most time in states 
429: in the vicinity of the bottom of the native basin, and so the 
430: changes in \( F \) with respect to temperature are dominated by the 
431: behavior of these states. However, as the temperature 
432: increases past \( T_{m} \), the system, and therefore the rate of 
433: change of \( F(T) \), starts to be dominated by states near
434: the bottom of the denatured basin. As a result, the
435: gradient of \( F(T) \) changes rather suddenly at \( T_{m} \). 
436: While the \( F(T) \) gradient could never be discontinuous because the
437: model system is finite, the kink at \( T_{m} \) does indicate
438: that the transition is two-state-like, and therefore that it is 
439: similar to a first order phase transition.
440: 
441: \subsection{Thermodynamic cooperativity}
442: 
443: The presence of a peak in the heat capacity at a transition temperature
444: \( T_{m} \), as in Fig.~1, indicates that the folding/unfolding transition
445: possesses a degree of thermodynamic cooperativity. As our group has
446: argued, the degree of thermodynamic cooperativity in protein models 
447: can be quantified by the ratio 
448: \( \kappa_{2}=\Delta H_{\rm vH}/\Delta H_{\rm cal} \) of the van't Hoff 
449: enthalpy \( \Delta H_{\rm vH} \) to the calorimetric enthalpy 
450: \( \Delta H_{\rm cal} \) of the transition. This ratio is closely
451: related to that determined experimentally by differential scanning
452: calorimetry \cite{Pri74}. In model studies, the calorimetric enthalpy 
453: $\Delta H_{\rm cal}$ may be determined from an integral of the heat 
454: capacity across the transition region,
455: \begin{equation}
456: \label{ffua}
457: \Delta H_{\rm cal}=\int dT C_{P}(T),
458: \end{equation}
459: while the van't Hoff enthalpy is equal to twice the maximum standard
460: deviation of the enthalpy distribution at the transition midpoint,
461: \begin{equation}
462: \label{ffub}
463: \Delta H_{\rm vH}=2\sqrt{k_{B}T_{m}^{2}C_{P}^{\rm max}},
464: \end{equation}
465: where \( C_{P}^{\rm max} \) is the peak value of the heat capacity. We
466: have followed standard usage in this section by expressing \( \kappa _{2} \)
467: in terms of \( H \) and \( C_{P} \). However, as mentioned in the
468: previous section, simulations produce values for \( E \) and \( C_{V} \),
469: which for the present application are essentially equivalent 
470: to \( H \) and \( C_{P} \). 
471: 
472: As has been pointed out \cite{Kay00a}, comparison of simulation heat capacity 
473: scans to experiment is often complicated by the fact that the heat capacity
474: tails which we observe in simulations would, if they occurred in a real 
475: system, be swamped by the solvent contribution and ignored by the
476: common procedure of using empirical baseline subtraction to calculate
477: $\Delta H_{\rm vH}/\Delta H_{\rm cal}$. In other words, tail contributions 
478: that arise from conformational transitions may be masked by solvent 
479: contributions in real data analysis \cite{Kay00a,Kay02}. Therefore, for
480: completeness,
481: we also perform empirical baseline subtractions on our simulated heat 
482: capacity scans, producing a revised ratio \( \kappa^{({\rm s})}_{2} \) 
483: (defined in Ref.~\cite{Kay00a}) to facilitate comparison with experiment.
484: 
485: \subsection{Radius of gyration}
486: 
487: The radius of gyration \( R_g \) of a particular conformation of the
488: protein is an indicator of its compactness. It is defined by
489: \begin{equation}
490: \label{ffv}
491: R_g^{2}=\frac{1}{N}\sum _{i=1}^{N}\left| r_{i}-\left\langle r\right\rangle \right| ^{2},
492: \end{equation}
493: where \textbf{\( N \)} is the number of residues, \( r_{i} \)
494: is the position of the \( i \)th residue, and $\langle r\rangle$ is the
495: average position (centroid) of the given conformation. The Boltzmann
496: average \( \left\langle R_g\right\rangle =\left\langle \sqrt{R_g^{2}}
497: \right\rangle \) over a 
498: given conformational ensemble is obtained by standard histogram techniques.
499: Two-state-like behavior requires a steplike sigmoidal change in 
500: \( \left\langle R_g\right\rangle  \) upon denaturation at \( T_{m} \), with
501: little postdenaturational expansion of the chain \cite{Kay00a}.
502: 
503: \section{Results and discussion}
504: 
505: To study the effect of local vs. nonlocal interactions, we first
506: vary the strength $\varepsilon_\phi^{(1)}$ of the local interactions while 
507: keeping the strength of the nonlocal interactions fixed in the continuum CI2 
508: construct (Figs.~2, 3). The heat capacity scans, for four scenarios (four
509: models) with different values for $\varepsilon_\phi^{(1)}$, are shown in Fig.~2.
510: They 
511: all exhibit a fairly sharp peak except for the
512: model with $\varepsilon_\phi^{(1)}=0.25$. The
513: heat capacity peak signifies substantial heat absorption within 
514: a narrow temperature range at the folding/unfolding transition. The absorbed
515: energy propels the chain from its low-energy folded conformations (native 
516: ensemble) to its high-energy unfolded conformations (denatured ensemble). 
517: However, the mere existence of a relatively sharp peak in the heat capacity 
518: function does not necessarily mean that the transition is as cooperative as 
519: those observed in small single-domain proteins. Coil-globule transitions in
520: homopolymers 
521: are not two-state-like, but their calorimetric heat capacity scans can have 
522: very sharp peaks \cite{Tik94}. A more quantitative measure of 
523: thermodynamic cooperativity is the traditional calorimetric two-state 
524: criterion (see above), which has emerged recently as a powerful modeling 
525: tool \cite{Cha00,Kay00a,Kay00b,Kay02,Kay03a,Cri02,Jan02,Pok03,Cle03}. 
526: Ratios of van't Hoff to calorimetric enthalpy were 
527: calculated for the four models as described above; the ranges of the temperature 
528: integrations used in the determination of $\Delta H_{\rm cal}$ [Eq.~(\ref{ffua})]
529: were taken to be equal to the ranges shown in Fig.~2.
530: 
531: The inset of Fig.~2 shows that the $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ 
532: ratio (diamonds) of these models is only weakly dependent on
533: $\varepsilon_{\phi}^{(1)}$ over an extended range of 
534: $\varepsilon_\phi^{(1)}$ values, but that the ratio is significantly smaller
535: when the local interactions are 
536: substantially weaker, at $\varepsilon_\phi^{(1)}=0.25$, than the nonlocal 
537: interactions. As discussed above, quantitative comparisons between 
538: simulated and experimental $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ values 
539: require the introduction of model calorimetric baselines \cite{Kay00a} 
540: similar to 
541: those employed in the interpretation of experimental data. Traditionally, 
542: experimental baselines are designed to remove solvation contributions 
543: (temperature-dependent effective interactions), in order to extract the heat
544: capacity effects 
545: associated with the folding/unfolding transition itself \cite{Pri74}. 
546: The present 
547: models do not contain temperature-dependent interactions. Therefore, the 
548: heat capacity contributions eliminated by the model calorimetric 
549: baselines in Fig.~2 can only originate from vibrational motions and 
550: conformational transitions. Increasingly, it is being recognized 
551: \cite{Kay00a,Yan97,Dra02} 
552: that similar heat capacity contributions from bond vector motions and 
553: more collective conformational transitions might also be ``hidden'' below 
554: traditional baselines constructed for analyzing experimental calorimetric data,
555: although the magnitude of such contributions needs to be elucidated.
556:  The three models in Fig.~2 that are relatively more cooperative
557: (with higher $\kappa_2$ values) all have modified 
558: $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ values ($\kappa_2^{({\rm s})}$,
559: circles in the inset), after empirical baseline subtractions, that are very 
560: close to unity. (We note that the recent determination of 
561: $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ values in an all-atom G\=o model
562: \cite{Cle03}
563: involved baseline subtractions as well: c.f. Fig.~8 of Ref.~\cite{Cle03}.) 
564: 
565: However, a protein chain model's ability to attain a near-unity 
566: $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ ratio after baseline subtractions 
567: does not by itself imply that its thermodynamic behavior is similar to that of 
568: real, small single-domain proteins \cite{Kay00a,Kay00b}. This is because the 
569: heat capacity contributions discarded by certain baselines can actually be
570: symptoms of significant deviations from two-state-like behavior. It has been
571: recognized \cite{Kay00a} that, to clarify this situation, we can use the
572: behavior of the 
573: average radius of gyration $\langle R_g \rangle$ of a protein chain model as
574: an
575: additional evaluation criterion for the model's thermodynamic cooperativity.
576: Small angle X-ray scattering (SAXS) experiments have demonstrated that the
577: average 
578: radius of gyration $\langle R_g \rangle$ of several small single-domain proteins
579: behaves 
580: in an apparently two-state manner \cite{Sos92,Hag98,Mil02}, 
581: showing very little 
582: postdenaturational ($T>T_{m}$) expansion of the chain outside the 
583: transition regime that corresponds to the region of the heat capacity
584: peak. We require chain models 
585: of small single-domain proteins to exhibit similar behavior \cite{Kay00a}. 
586: Now, to further assess the four models in Fig.~2 with different 
587: local interaction strengths, we calculate their average radii of
588: gyration as a function of temperature (Fig.~3). To 
589: ensure adequate sampling, $\langle R_g\rangle$ for each model is obtained 
590: from three different simulation temperatures; the results are thus 
591: displayed as three discontinuous curves. Despite some minor discrepancies
592: (owing to sampling uncertainties) between parts of the $\langle R_g\rangle$ 
593: function deduced from different simulation temperatures for the 
594: $\varepsilon_\phi^{(1)}=0.25$ case, the general trend in Fig.~3 is very
595: clear. Models with weaker local interactions are less cooperative 
596: in that their $\langle R_g\rangle$ curves show more postdenaturational increase
597: than do those of models having stronger local interactions. For instance, 
598: the $\langle R_{g} \rangle$ of the $\varepsilon_{\phi}^{(1)}=0.25$ model
599: increases by $\approx 3.0$~\AA~ between $T \approx 0.82$ (the end of the
600: transition region) and $T \approx 1.11$. In contrast, a similar
601: temperature increase for the 
602: $\varepsilon_\phi^{(1)}=1.00$ model from $T\approx 1.11$ (the end of the
603: transition region) to $T\approx 1.40$ leads to
604: an increase of only $\approx 1.6~$\AA~ in $\langle R_g\rangle$.
605: These observations indicate that two-state-like thermodynamic cooperativity
606: cannot be achieved if the local conformational propensities of a protein 
607: are much weaker than the favorable nonlocal interactions. This confirms a 
608: similar conclusion which was derived recently from a more limited study of a 
609: ``contact dominant model'' \cite{Kay03a}.
610: 
611: We next extend our analysis by applying the same computational
612: procedure to varying the strength $\varepsilon_a$ of the favorable 
613: nonlocal interactions while keeping the strength of the local 
614: interactions fixed. Consistent with the seminal study of G\=o and Taketomi 
615: \cite{Go_78}, Figs.~4 and 5 show that variations in nonlocal $\varepsilon_a$
616: have a more prominent effect on thermodynamic cooperativity than 
617: variations in local $\varepsilon_\phi^{(1)}$. 
618: While the peak heat capacity values for the three models in Fig.~2 with 
619: $\varepsilon_\phi^{(1)}\ge 0.5$ are similar, the peak heat capacity
620: values for the three models in Fig.~4 with $\varepsilon_a\ge 0.5$
621: show a significant monotonic increase with $\varepsilon_a$. In addition,
622: for the $\varepsilon_a=0.5$ model in Fig.~4, the difference
623: between unity and the $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ ratio 
624: after baseline subtraction is not negligible ($\kappa_2^{({\rm s})}=0.91$).
625: Despite these differences, the trends in Figs.~4,~5 are in large measure 
626: similar to those in Figs.~2,~3. In particular, Fig.~4 shows that the model with
627: $\varepsilon_a=0.25$, like the $\varepsilon_\phi^{(1)}=0.25$ case in Fig.~2,
628: has a significantly
629: lower $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ ratio than the other
630: three models considered in the same figure. The 
631: $\langle R_g\rangle$ data in Fig.~5 shows that thermodynamic cooperativity 
632: increases with $\varepsilon_a$, as manifested in a smaller
633: amount of postdenaturational
634: conformational expansion with increasing $\varepsilon_a$; this is comparable
635: to the effect of increasing $\varepsilon_{\phi}^{(1)}$ in Fig.~3.
636: Taken together, 
637: the results in Figs.~2--5 suggest that a high degree of thermodynamic 
638: cooperativity, similar to that in real, small single-domain proteins,
639: requires both strong local and strong nonlocal interactions. Apparently,
640: a high degree of thermodynamic cooperativity is {\it incompatible} with either 
641: a much weakened local conformational preference relative to the favorable 
642: nonlocal interactions ($\varepsilon_\phi^{(1)}\ll \varepsilon_a$) or much 
643: weakened favorable nonlocal interactions relative to the local conformational 
644: preference ($\varepsilon_a\ll \varepsilon_\phi^{(1)}$).
645: 
646: Although three-dimensional G\=o-like models with strong local 
647: and nonlocal interactions appear to satisfy the thermodynamic criterion 
648: of calorimetric two-state cooperativity, it has recently been noted that 
649: they are unable to produce simple two-state folding/unfolding kinetics 
650: \cite{Kay02,Kay03a}.
651: This is because the thermodynamic cooperativity of these models is not
652: sufficiently high. As a result, and in spite of the native-centric nature
653: of the common pairwise additive G\=o-like interactions, kinetic trapping 
654: becomes significant under strongly native conditions, leading to folding 
655: rate slow-downs and chevron rollovers \cite{Kay03b}. More recent lattice 
656: model 
657: investigations indicate that simple two-state folding/unfolding kinetics 
658: require a high degree of thermodynamic cooperativity that may be
659: characterized as ``near-Levinthal'' \cite{Kay03c}, necessitating many-body 
660: interactions beyond those postulated by the common G\=o model 
661: \cite{Kay03c,Kay03d}.
662: 
663: In view of this recent development, and to facilitate the construction and
664: investigation of continuum models that incorporate these new ideas,
665: it is instructive to compare in more detail the thermodynamic behavior of 
666: the common lattice G\=o construct (with only pairwise additive contact
667: energies) with the behavior of models which have many-body interactions and
668: enhanced cooperativity. We also wish to investigate whether results obtained from
669: lattice models supply additional support for the conclusions which we have derived
670: from our continuum model results.
671: To this end, Figs.~6--8 compare three 27mer lattice models. 
672: 
673: Because of their intrinsic restrictions on conformational possibilities,
674: it is more straightforward to construct cooperative lattice models 
675: than to construct off-lattice continuum models that are similarly cooperative.
676: Recently, using evidence from kinetic simulations of chevron plots, our group has
677: proposed 
678: that a $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ ratio of $\kappa_2>0.9$ before 
679: baseline subtractions [as for models (ii) and (iii) in Fig.~6a] is likely to be 
680: required in order for a lattice protein chain model to produce chevron 
681: plots with linear regimes similar in extent to those observed for real, small
682: single-domain 
683: proteins \cite{Kay03c}. 
684: However, this numerical criterion is not readily generalizable
685: to off-lattice continuum models. This is because the heat capacity effects
686: of bond vibrations and kinetic energy have to be taken into account in
687: continuum models, whereas these effects are absent in lattice models. Thus, in
688: the characterization of a model's thermodynamic cooperativity, more detailed
689: information concerning, for example, the behavior of the average radius of
690: gyration, has to be relied upon more heavily for continuous models (see above)
691: than for lattice models.
692: 
693:  For the three lattice models studied here, the $\langle R_g\rangle$ plots 
694: in Fig.~6b show that, while the postdenaturational conformational expansion 
695: of the common lattice G\=o construct (solid curve in Fig.~6b) is considerably 
696: milder than that of its continuum counterpart (solid curves in Figs.~3, 5),
697: the more cooperative lattice models with many-body interactions 
698: exhibit much less (dotted curve in Fig.~6b) or nearly non-existent
699: (dashed curve in Fig.~6b) postdenaturational conformational expansion.
700: The fluctuations in $E$ and $R_g$ near the transition midpoint, shown in 
701:  Figs.~7 and 8, indicate further that the transitions between the native 
702: and denatured ensembles are sharper and more two-state-like for the more
703: cooperative models (ii) and (iii) with many-body interactions [parts (b) and (c) 
704: of Figs.~7, 8] than for the common G\=o model [parts (a) of Figs.~7, 
705: 8]. The corresponding fluctuations in the fractional number of native 
706: contacts $Q$ \cite{Kay03c,Kay03d} (data not shown) were also found 
707: to exhibit a trend 
708: very similar to that of the energy fluctuations in Fig.~7.
709: 
710: The lattice model results, shown in Figs.~6--8, are compatible with the
711: conclusion, reached above on the basis of continuum model results, that both
712: local and nonlocal interactions are important for thermodynamic cooperativity.
713: The common lattice G\=o model of scenario (i) includes only nonlocal interactions,
714: analogous to the interactions encoded by $V_{\rm native}$ in the off-lattice
715: model. Scenario (iii), which takes account also of the local geometry of the
716: chain, displays greater cooperativity than scenario (i).
717: 
718: Higher resolution data such as that in Figs.~7 and 8 opens up future 
719: avenues for the assessment of different mechanisms of cooperativity using
720: comparisons between model
721: predictions and experimental measurements of, for example, conformational
722: sizes and fluctuations \cite{Cho02,Shi02,Gol03}. 
723: It is noteworthy that in the model (ii) 
724: scenario, with an extra favorable energy for the native structure as a whole,
725: the native ensemble does not exhibit much energetic or conformational
726: variation (horizontal line segments at low $E$ and low $R_g$ values in Figs.~7b 
727: and 8b). On the other hand, in the model (iii) scenario with local-nonlocal
728: coupling, there is considerable variation in the native ensemble
729: (c.f. low $E$
730: and low $R_g$ fluctuations in Figs.~7c and 8c). Yet the variation
731: in the denatured ensemble is smaller in model (iii) than in model (ii)
732: (c.f. high $E$ and high $R_g$ fluctuations in parts (b) and (c) of 
733:  Figs.~7, 8), resulting in a more two-state-like average $\langle R_g\rangle$
734: transition for model (iii) than for model (ii), manifested in a 
735: near-immediate postdenaturational ($T>T_{m}$) saturation of the dashed 
736: curve for model (iii) in Fig.~6b compared to a more gradual 
737: postdenaturational saturation of the dotted curve for model (ii) in the 
738: same figure. All these differences in conformational properties are in 
739: principle detectable through experiments on real proteins. Hence, future 
740: experimental efforts along the lines suggested here would help to verify 
741: or falsify different proposed scenarios and interaction mechanisms 
742: \cite{Kay03c,Kay03d,Jew03,Cha03} 
743: in the endeavor to decipher the physical origins of 
744: cooperativity in real proteins. 
745: 
746: \section{Conclusions}
747: 
748: The present study suggests strongly that both local conformational
749: preferences and favorable nonlocal interactions are crucial to
750: protein thermodynamic cooperativity. 
751: This result points to a useful constraint on simplified models of protein 
752: molecules: they should take account both of local and of nonlocal
753: interactions. 
754: 
755: As emphasized above, the scope 
756: of the present study is limited. Only native-centric interaction 
757: schemes are considered; and here we have elected only one particular 
758: physically plausible way to classify energy contributions into ``local'' 
759: and ``nonlocal'' terms in the continuum models. 
760: In addition, by using a native-centric interaction scheme both for local and
761: for nonlocal interactions, we have avoided the possibility of energetic 
762: frustration, which might be significant if a more realistic interaction 
763: scheme were used \cite{Pac03,HG03}. To further elucidate the 
764: answers to the questions we have posed, much remains to be investigated.
765: 
766: Nonetheless,
767: our results show clearly that a high degree of thermodynamic cooperativity is
768: compatible neither with a much weakened local interaction
769: nor with a much weakened nonlocal interaction, indicating that both
770: local and nonlocal interactions are important components in protein
771: energetics \cite{Uve02}. This finding is consistent with the notion that
772: a cooperative interplay between local and nonlocal interactions 
773: \cite{Cha00,Kay00b,Kay02,Kay03c,Kay03d} 
774: is a critical ingredient underlying the apparent simple two-state
775: cooperativity of real, small single-domain proteins. 
776: 
777: With regard to G\=o-like
778: native-centric modeling (see, e.g., discussion in 
779: Refs.~\cite{Kay03a,Cle00,Mic99,Plo01,Cie02,Cie03}), 
780: we observe that significant differences in model 
781: predictions can result from different G\=o-like interaction schemes,
782: even though all of the schemes are designed to bias the chain towards the 
783: same native structure. This underscores our point that requiring a
784: consistent account of cooperativity can be a more productive approach
785: to protein modeling than simply designing a model heteropolymer to 
786: fold to a target structure \cite{Kay03a}. 
787: In this context, the present coarse-grained 
788: representations constitute only a first step in the understanding of protein 
789: cooperativity. Ultimately, atomistic origins of local and nonlocal
790: interactions such as sidechain packing \cite{Kli98,Li_01,Fav02,Zho02} 
791: must be taken into
792: account in an effort to provide the necessary physical underpinning 
793: for the cooperative mechanisms proposed here.
794: 
795: 
796: \section*{Acknowledgments}
797: The research reported here was partially supported by the Canadian 
798: Institutes of Health Research (CIHR grant no. MOP-15323), PENCE, 
799: a Premier's Research Excellence Award from the Province of Ontario, and 
800: the Ontario Centre for Genomic Computing at the Hospital for Sick 
801: Children in Toronto. H. S. C. holds a Canada Research Chair in Biochemistry.
802: 
803: 
804: \vfill\eject
805: 
806: \begin{thebibliography}{99}
807: 
808: \bibitem{Cha02} Chan HS, Kaya H, Shimizu S. 
809: In: Jiang T, Xu Y, Zhang MQ, editors. Current Topics in Computational
810: Molecular Biology. Cambridge, MA: The MIT Press; 2002. p 403--447.
811: \bibitem{Jac91}Jackson SE, Fersht AR. Biochemistry 1991;30:10428--10435.
812: \bibitem{Bak00}Baker D. Nature 2000;405:39--42.
813: \bibitem{Cha00} Chan HS. Proteins 2000;40:543--571.
814: \bibitem{Kay00a} Kaya H, Chan HS. Proteins 2000;40:637--661 
815:                       [Erratum: Proteins 2001;43:523].
816: \bibitem{Kay00b} Kaya H, Chan HS. Phys Rev Lett 2000;85:4823--4826.
817: \bibitem{Kay02} Kaya H, Chan HS. J Mol Biol 2002;315:899--909.
818: \bibitem{Kay03a} Kaya H, Chan HS. J Mol Biol 2003;326:911--931.
819: \bibitem{Tak75} Taketomi H, Ueda Y, G\=o N. 
820:                Int J Pept Protein Res 1975;7:445--459.
821: \bibitem{Cle00}
822: Clementi C, Nymeyer H, Onuchic JN. J Mol Biol 2000;298:937--953.
823: \bibitem{Mic99} Micheletti C, Banavar JR, Maritan A, Seno F. 
824:                   Phys Rev Lett 1999;82:3372--3375.
825: \bibitem{Zho99} Zhou Y, Karplus M. Nature 1999;401:400--403.
826: \bibitem{Por01} Portman JJ, Takada S, Wolynes PG. 
827:                  J Chem Phys 2001;114:5082--5096.
828: \bibitem{Hal97}Haliloglu T, Bahar I, Erman B. 
829:                   Phys Rev Lett 1997;79:3090--3093.
830: \bibitem{Kes02} Keskin O, Bahar I, Flatow D, Covell DG, Jernigan RL. 
831:                 Biochemistry 2002;41:491--501.
832: \bibitem{Isi02} Isin B, Doruker P, Bahar I. Biophys J 2002;82:569--581.
833: \bibitem{Mic02} Micheletti C, Lattanzi G, Maritan A. 
834:                  J Mol Biol 2002;321:909--921.
835: \bibitem{Jac01}Jacobs DJ, Radar AJ, Kuhn LA, Thorpe MF. 
836:                 Proteins 2001;44:150--165.
837: \bibitem{Kay03b} Kaya H, Chan HS. Phys Rev Lett 2003;90:258104.
838: \bibitem{Go_78} G\=o N, Taketomi H. Proc Natl Acad Sci USA 1978;75:559--563.
839: \bibitem{Dil90} Dill KA. Biochemistry 1990;29:7133--7155.
840: \bibitem{Abk95} Abkevich VI, Gutin AM, Shakhnovich EI. 
841: J Mol Biol 1995:252:460--471.
842: \bibitem{Cha98} Chan HS. Nature 1998;392:761--763.
843: \bibitem{Bal99}Baldwin RL, Rose GD. Trends Biochem Sci 1999;24:26--33.
844: \bibitem{Por01-2}Portman JJ, Takada S, Wolynes PG. 
845:                  J Chem Phys 2001;114:5069--5081.
846: \bibitem{Deg75}de Gennes PG. J Phys Lett (Paris) 1975;36:L55--L57.
847: \bibitem{Don96}Doniach S, Garel T, Orland H. J Chem Phys 1996;105:1601--1608.
848: \bibitem{Bas97}Bastolla U, Grassberger P. J Stat Phys 1997;89:1061--1078.
849: \bibitem{Doy98}Doye JPK, Sear RP, Frenkel D. J Chem Phys 1998;108:2134--2142.
850: \bibitem{Kog01}Koga N, Takada S. J Mol Biol 2001;313:171--180.
851: \bibitem{Bra67}Brant DA, Miller WG, Flory PJ. J Mol Biol 1967;23:47--65.
852: \bibitem{Vei97}Veitshans T, Klimov D, Thirumalai D. Fold Des 1997:2:1--22.
853: \bibitem{All87}Allen MP, Tildesley DJ. Computer Simulation of Liquids.
854: Oxford: The Oxford University Press; 1987.
855: \bibitem{Kay03c}Kaya H, Chan HS, Proteins 2003; in press.
856: \bibitem{Kay03d}Kaya H, Chan HS, Proteins 2003; in press.
857: \bibitem{Yan97}Yang D, Mok YK, Forman-Kay JD, Farrow NA, Kay LE.
858:                  J Mol Biol 1997;272:790--804.
859: \bibitem{Pri74}Privalov PL, Khechinashvili NN. 
860:                       J Mol Biol 1974;86:665--684.
861: \bibitem{Tik94}Tiktopulo EI, Bychkova VE, Ri{\v c}ka J, Ptitsyn OB.
862:                  Macromolecules 1994;27:2879--2882.
863: \bibitem{Cri02}Crippen GM, Chhadjer M. J Chem Phys 2002;116:2261--2268.
864: \bibitem{Jan02}Jang H, Hall, CK, Zhou Y. Biophys J 2002;82:646--659.
865: \bibitem{Pok03}Pokarowski P, Kolinski A, Skolnick J.
866:                     Biophys J 2003;84:1518--1526.
867: \bibitem{Cle03}Clementi C, Garcia AE, Onuchic JN. 
868:                     J Mol Biol 2003;326:933--954.
869: \bibitem{Dra02}Dragan AI, Privalov PL. J Mol Biol 2002;321:891--908.
870: \bibitem{Sos92}Sosnick TR, Trewhella J. Biochemistry 1992;31:8329--8335.
871: \bibitem{Hag98}Hagihara Y, Hoshino M, Hamada D, Kataoka M, Goto Y.
872:               Fold Des 1998;3:195--201.
873: \bibitem{Mil02}Millet IS, Townsley LE, Chiti F, Doniach S, Plaxco KW.
874:                 Biochemistry 2002;41:321--325.
875: \bibitem{Cho02}Choy WY, Mulder FAA, Crowhurst KA, Muhandiram DR, 
876:                    Millett IS, Doniach S, Forman-Kay JD, Kay LE.
877:                    J Mol Biol 2002;316:101--112.
878: \bibitem{Shi02}Shimizu S, Chan HS. Proteins 2002;49:560--566.
879: \bibitem{Gol03}Goldenberg DP. J Mol Biol 2003;326:1615--1633.
880: \bibitem{Jew03}Jewett AI, Pande VS, Plaxco KW. J Mol Biol 2003;326:247--253.
881: \bibitem{Cha03}Chan HS, Shimizu S, Kaya H. Methods Enzymol 2003: in press.
882: \bibitem{Pac03}Vendruscolo M, Paci E. Curr Opin Struct Biol 2003;13:82--87.
883: \bibitem{HG03}Head-Gordon T, Brown S. Curr Opin Struct Biol 2003;13:160--167.
884: \bibitem{Uve02}Uversky VN, Fink AL. FEBS Lett 2002;515:79--83.
885: \bibitem{Plo01}Plotkin SS. Proteins 2001;45:337--345.
886: \bibitem{Cie02}Cieplak M, Hoang TX. Int J Mod Phys C 2002;13:1231--1242.
887: \bibitem{Cie03}Cieplak M, Hoang TX. Biophys J 2003;84:475--488.
888: \bibitem{Kli98}Klimov DK, Thirumalai D. Fold Des 1998;3:127--139.
889: \bibitem{Li_01}Li L, Shakhnovich EI. 
890:                Proc Natl Acad Sci USA 2001;98:13014--13018.
891: \bibitem{Fav02}Favrin G, Irb\"ack A, Wallin S. Proteins 2002;47:99--105.
892: \bibitem{Zho02}Zhou Y, Linhananta A. J Chem Phys 2002;117:8983--8995.
893: 
894: \end{thebibliography}
895: 
896: \vfill\eject
897: 
898: \noindent
899: {\large\bf Figure Captions}
900: 
901: \noindent
902: FIGURE 1
903: 
904: Heat capacity as a function of temperature, for one particular set
905: of interaction parameters. Solid curve: heat capacity calculated using
906: the total energy; dashed curve: heat capacity calculated 
907: using only the potential energy. 
908: Parameters are \( \varepsilon _{\phi }^{(1)}=1.00 \),
909: \( \varepsilon _{a}=1.00 \), \( q=4 \); 
910: simulation temperature \( T_{\rm sim}=1.02 \).
911: Inset: free energy as a function of temperature, for the same model,
912: showing a sharp change in gradient around \( T_{m} \).
913: The vertical dotted lines in the figure and the inset mark the transition
914: midpoint temperature \( T_{m} \).
915: \\
916: 
917: \noindent
918: FIGURE 2
919: 
920: Heat capacity as a function of temperature, for varying local interaction
921: energy \( \varepsilon _{\phi }^{(1)} \). Other parameters 
922: \( \varepsilon_{a}=1.00 \) and \( q=2 \) are fixed. ({\it From left to right})
923: dotted curve: \( \varepsilon _{\phi }^{(1)}=0.25 \), \( T_{m}=0.74 \); 
924: short dashed curve: \( \varepsilon _{\phi }^{(1)}=0.50 \), \( T_{m}=0.84 \);
925: long dashed curve: \( \varepsilon _{\phi }^{(1)}=0.75 \), \( T_{m}=0.94 \);
926: solid curve: \( \varepsilon _{\phi }^{(1)}=1.00 \), \( T_{m}=1.03 \).
927: These scans are obtained by histogram techniques from simulations performed
928: at $T_{\rm sim}=$ $0.73$, $0.84$, $0.94$, and $1.03$ respectively.
929: The $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ cooperativity coefficients 
930: \( \kappa_2  \) without baseline subtractions are $0.33$, $0.43$, $0.44$, 
931: and $0.44$ respectively. Modified cooperativity coefficients 
932: \( \kappa_2^{({\rm s})} \) after subtraction of the baselines
933: (indicated by thin lines in the figure) for $\varepsilon_{\phi }^{(1)}=$
934: $0.50$, $0.75$, and $1.00$ are $0.97$, $0.98$, and $0.99$ respectively.
935: No value for \( \kappa_2^{({\rm s})} \) was calculated for 
936: \( \varepsilon_{\phi }^{(1)}=0.25 \) because the shape of its heat capacity 
937: curve does not suggest any clear choice of baselines that are intuitively
938: more reasonable than others. The inset shows \( \kappa_2  \) (diamonds) and 
939: \( \kappa_2^{({\rm s})} \) (circles) as functions of 
940: \( \varepsilon_{\phi }^{(1)} \).
941: \\
942: 
943: \noindent
944: FIGURE 3
945: 
946: Average radius of gyration as a function of temperature, for varying
947: local interaction energy \( \varepsilon _{\phi }^{(1)} \); other
948: parameters \( \varepsilon _{a}=1.00 \) and \( q=2 \) are fixed, as in Fig.~2.
949: The correspondence between line styles and \( \varepsilon _{\phi }^{(1)} \)
950: values is identical to that in Fig.~2.
951: For each value of \( \varepsilon _{\phi }^{(1)} \), simulations were
952: performed at three different values of $T_{\rm sim}$ to ensure adequate
953: sampling
954: across the entire temperature range shown. ({\it From left to right})
955: for \( \varepsilon _{\phi }^{(1)}=0.25 \) (dotted curves),
956: \( T_{\rm sim}=0.73 \), $0.93$, $1.13$;
957: for \( \varepsilon _{\phi }^{(1)}=0.50 \) (short dashed curves),
958: \( T_{\rm sim}=0.84, \) $1.04$, $1.24$; 
959: for \( \varepsilon _{\phi }^{(1)}=0.75 \) (long dashed curves),
960: \( T_{\rm sim}=0.94, \) $1.14$, $1.34$; and
961: for \( \varepsilon _{\phi }^{(1)}=1.00 \) (solid curves),
962: \( T_{\rm sim}=1.03 \), $1.23$, and $1.43$.
963: \\
964: 
965: \noindent
966: FIGURE 4
967: 
968: Heat capacity as a function of temperature, for varying nonlocal interaction
969: energy \( \varepsilon _{a} \). Other parameters 
970: \( \varepsilon _{\phi }^{(1)}=1.00 \) and \( q=4 \) are fixed. 
971: ({\it From left to right})
972: dotted curve: \( \varepsilon _{a}=0.25 \), \( T_{m}=0.43 \);
973: short dashed curve: \( \varepsilon _{a}=0.50 \), \( T_{m}=0.65 \); 
974: long dashed curve: \( \varepsilon _{a}=0.75 \), \( T_{m}=0.84 \); 
975: solid curve: \( \varepsilon _{a}=1.00 \), \( T_{m}=1.02 \).
976: These scans are obtained by histogram techniques from simulations performed
977: at $T_{\rm sim}=$ $0.42$, $0.64$, $0.84$, and $1.02$ respectively.
978: The $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ cooperativity coefficients 
979: \( \kappa_2  \) without baseline subtractions are $0.28$, $0.40$, $0.44$, 
980: and $0.46$ respectively. Modified cooperativity coefficients 
981: \( \kappa_2^{({\rm s})} \) after subtraction of the baselines
982: (indicated by thin lines in the figure) for $\varepsilon _{a}=$
983: $0.50$, $0.75$, and $1.00$ are $0.91$, $1.00$, and $0.99$ respectively.
984: No value for \( \kappa_2^{({\rm s})} \) was calculated for 
985: \( \varepsilon _{a}=0.25 \) for the same reason that no 
986: \( \kappa_2^{({\rm s})} \) was provided for 
987: \( \varepsilon_{\phi }^{(1)}=0.25 \) in Fig.~2.
988: The inset shows \( \kappa_2  \) (diamonds) and \( \kappa_2^{({\rm s})} \) 
989: (circles) as functions of \( \varepsilon _{a} \).
990: \\
991: 
992: \noindent
993: FIGURE 5
994: 
995: Average radius of gyration as a function of temperature, for varying
996: nonlocal interaction energy \( \varepsilon _{a} \). Other parameters
997: \( \varepsilon _{\phi }^{(1)}=1.00 \) and \( q=4 \) are fixed, as in Fig.~4.
998: The correspondence between line styles and \( \varepsilon _{a} \)
999: values is identical to that in Fig.~4.
1000: For each value of \( \varepsilon _{a} \), simulations were performed
1001: at three different values of $T_{\rm sim}$ to ensure adequate sampling across
1002: the whole range shown.  ({\it From left to right}) 
1003: for \( \varepsilon _{a}=0.25 \) (dotted curves), \( T_{\rm sim}=0.42 \),
1004: $0.62$, $0.82$;
1005: for \( \varepsilon _{a}=0.50 \) (short dashed curves), \( T_{\rm sim}=0.64 \),
1006: $0.84$, $1.04$; 
1007: for \( \varepsilon _{a}=0.75 \) (long dashed curves), \( T_{\rm sim}=0.84, \)
1008: $1.04$, $1.24$; and 
1009: for \( \varepsilon _{a}=1.00 \) (solid curves), \( T_{\rm sim}=1.02 \),
1010: $1.22$, and $1.42$.
1011: \\
1012: 
1013: \noindent
1014: FIGURE 6
1015: 
1016: Thermodynamic cooperativities of the three representative 27mer
1017: lattice models described in the text. Model definitions and simulation 
1018: details are given in Refs.~[29,30]. Heat capacity (a) and average 
1019: radius of gyration (b) are determined by histogram techniques based 
1020: upon Monte Carlo sampling performed at $T_{\rm sim}=T_{m}$.
1021: (a) Heat capacity of (i) the common G\=o model with pairwise additive 
1022: contact energy (solid curve, {\it left}); (ii) the model that assigns an 
1023: extra favorable energy to the native structure as a whole (dotted curve, 
1024: {\it right}); and (iii) the model with local-nonlocal coupling (dashed curve, 
1025: {\it middle}). The transition temperatures
1026: for the models are $T_{m}=$ $0.701$ (i), $1.13$ (ii), and $0.755$ (iii).
1027: Their $\Delta H_{\rm vH}/\Delta H_{\rm cal}$ without baseline subtractions
1028: are $\kappa_2=$ $0.86$, $0.98$, and $0.99$, and the corresponding ratios
1029: after subtracting the baselines shown are $\kappa_2^{({\rm s})}=$ 
1030: $1.00$, $1.00$, and $1.00$ respectively. (b) Average radius of gyration 
1031: as a function of model temperature for the three models [represented by 
1032: the same line styles as in (a)].
1033: \\
1034: 
1035: \noindent
1036: FIGURE 7
1037: 
1038: Representative trajectories of the three models in Fig.~6 at their
1039: respective transition temperatures.  Variations in the potential energy of models 
1040: (i)--(iii) are shown in (a)--(c) respectively.
1041: \\
1042: 
1043: \noindent
1044: FIGURE 8
1045: 
1046: Same as Fig.~7, except that variations in the radius of gyration are shown here.
1047: 
1048: \end{document}
1049: 
1050: 
1051: