1: \documentclass[aps,10pt,nofootinbib]{revtex4}
2: \usepackage{epsfig,citesort,float, verbatim}
3:
4: \begin{document}
5:
6: \begin{center}
7: {\Large \bf Folding of Cu, Zn superoxide dismutase and\\
8: Familial Amyotrophic Lateral Sclerosis}
9:
10: \vspace*{1cm}
11:
12: {Sagar D. Khare$^*$, Feng Ding$^{\dagger}$ and Nikolay V. Dokholyan$^*$\footnote{Corresponding author, Email: dokh@med.unc.edu}}
13: \vspace*{0.5cm}
14:
15: {$^*$Department of Biochemistry and Biophysics, \\ University of
16: North Carolina at Chapel Hill, \\ School of Medicine, Chapel Hill, NC
17: 27599, USA\\
18: $^{\dagger}$Center for Polymer Studies, Department of Physics,\\ Boston
19: University, Boston, MA 02215, USA}
20:
21: \date{\today}
22: \end{center}
23: \smallskip
24:
25: %\maketitle
26: \vspace*{2cm}
27:
28: \centerline{\Large\bf Summary}
29:
30:
31: {\bf Cu,Zn superoxide dismutase (SOD1) has been implicated in the familial form of the neurodegenerative disease Amyotrophic Lateral Sclerosis (ALS). It has been suggested that mutant mediated SOD1 misfolding/aggregation is an integral part of the pathology of ALS. We study the folding thermodynamics and kinetics of SOD1 using a hybrid molecular dynamics approach. We reproduce the experimentally observed SOD1 folding thermodynamics and find that the residues which contribute the most to SOD1 thermal stability are also crucial for apparent two-state folding kinetics. Surprisingly, we find that these residues are located on the surface of the protein and not in the hydrophobic core. Mutations in some of the identified residues are found in patients with the disease. We argue that the identified residues may play an important role in aggregation. To further characterize the folding of SOD1, we study the role of cysteine residues in folding and find that non-native disulfide bond formation may significantly alter SOD1 folding dynamics and aggregation propensity.}
32: %\maketitle
33: \medskip
34:
35:
36: \noindent {\bf Keywords:} Cu,Zn superoxide dismutase, Familial Amyotrophic Lateral Sclerosis, molecular dynamics simulations, aggregation, misfolding.
37: \clearpage
38:
39: %INTRODUCTION
40: \section{Introduction}
41:
42: ALS is the most common motor neuron disease in human adults
43: that is characterized by selective motor neuron death \cite{Cleveland01, Beckman01, Rowland01, Siddique91}. In
44: approximately 10\% of the cases of ALS, the disease is inherited, thus
45: called Familial ALS (FALS) \cite{Cleveland01}.
46: Mutations in the cytoplasmic enzyme SOD1 were identified
47: as the primary cause of approximately 20\% of FALS cases
48: \cite{Rosen93, Gurney94}. More than 90 mutations have been identified so far.
49: The FALS mutations are scattered throughout the primary sequence and
50: three-dimensional structure of SOD1, which exists as a homo-dimer in
51: native state (each monomer is 153 amino acids in length)
52: \cite{Gaudette00}. One hypothesis to explain the toxic gain-of-function
53: of the mutants is that the toxicity is derived from intracellular
54: aggregates and/or failure of SOD1 degradation, which is supported by the observation that in both mice and
55: cell culture models, death of mutant neurons is preceded by formation
56: of cytoplasmic aggregates containing mutant SOD1 \cite{Johnston00, Bruijn98, Durham97}. In addition, SOD1 knockout mice do not develop motor neuron disease \cite{Reaume96}.
57: Aggregation or misfolding is therefore a characteristic of
58: SOD1 mediated FALS. Toxicity may arise through (i) aberrant chemistry
59: due to misfolded mutants \cite{Bromberg99,Cleveland99, Elliot01}, (ii) saturation
60: of essential cellular machinery such as chaperones and proteasome
61: components \cite{Tran99, Bruening01}, or (iii) oligomeric forms of the aggregate may themselves be toxic to the cells, as has been suggested for other
62: neurodegenerative diseases \cite{Walsh02}.
63:
64: Aggregation of proteins is mediated by a variety of factors including
65: native state stability \cite{Chiti00, Canet02}, \(\beta\)-sheet propensity \cite{Ding02b}, net charge and overall hydrophobicity \cite{Chiti02a}. In addition, folding and aggregation are kinetically competing processes \cite{Chiti02b, Ding02b, Dobson99}. Thus, it is crucial to study the folding thermodynamics and kinetics of SOD1 in order to understand the causes of SOD1 misfolding, aggregation and toxicity. A microscopic picture of SOD1 folding and/or aggregation dynamics of SOD1 is not available from either theory or experiment. We reconstruct the SOD1 folding mechanism from molecular dynamics simulations.
66:
67: Neither the structure of SOD1 aggregates nor the mechanism of
68: aggregation is fully understood. Several experimental studies have
69: characterized the folding thermodynamics of SOD1 and the associated
70: FALS mutants \cite{Hallewell91, McRee90, Parge92, Lepock90, Stroppolo00,
71: Mei92, Rodriguez02}. Lepock et al. \cite{Lepock90} and Rodriguez et
72: al. \cite{Rodriguez02} measured the stability of mutants in differential
73: scanning calorimetry studies of the irreversible unfolding of the
74: distinctly metallated species of the enzyme, and showed that mutants
75: are destabilized relative to the wild type. In addition, a
76: crystal structure of the G37R mutant \cite{Hart98} shows higher atomic
77: displacement parameters (B-factors) for the side-chains compared to
78: the wild type, but the backbone conformation is not significantly
79: different, thus indicating greater molecular flexibility in some
80: portions of the structure. It was recently shown that the for some mutants, the apo-state (without metals) of the mutants is markedly destabilized compared to that of the wild-type, while the holo-state (with metals) of both mutants and wild-type is not significantly affected \cite{Lindberg02}. It is, therefore, possible that the apo-state of SOD1 --- the subject of our studies --- is implicated in misfolding and aggregation.
81:
82: Most Cu, Zn superoxide dismutases, including human SOD1, have been shown to undergo irreversible aggregation on exposure to temperatures higher than their respective melting temperatures. The presence of {\em free} cysteines --- those that do not form disulfide bonds in the native state --- is known to be involved in this phenomenon. It is well-established that improper disulfide bond formation, concomitant with cysteine oxidation, possibly enhanced by metals in SOD1, is a cause of heat-denatured aggregation \cite{Perry87}. In the case of bovine SOD1, substitution by site-directed mutagenesis of free cysteine residues was found to greatly increase the reversibility of denaturation without substantially affecting the conformational stability \cite{McRee90, Lepock90}. Yeast SOD1 \cite{Lepock85} and {\em E. coli} SOD1 \cite{Battistoni98}, neither of which contain free cysteines, show partial and full reversibility respectively upon denaturation. However, SOD1 from {\em P. leiognathi} also denatures irreversibly despite having no free cysteines \cite{Bourne96}. Clearly, the presence of free cysteines is one of the crucial factors (but not the only factor) responsible for irreversible denaturation. One plausible reason for the effect of free cysteines is that the formation of improper disulfide bonds makes SOD1 more prone to kinetic traps. Therefore, we also study the effect of disulfide bonds on SOD1 folding reversibility.
83:
84: An atomic resolution simulation of the folding process using traditional molecular mechanics force-fields is difficult
85: by direct computational approaches because of the
86: vast dimensionality of the protein conformational space \cite{Karplus94a}.
87: Simplified models such as the G\={o} model
88: \cite{go81, Abe81} provide a
89: powerful alternative to study folding because of their ability to
90: simulate folding on reasonable time scales and to
91: reproduce the basic thermodynamic and kinetic properties of
92: proteins both on- \cite{Shimada01, Clementi00, Abkevich94} and off-lattices \cite{Dokholyan98b,
93: Dokholyan02b}. In the G\={o} model, the energy of the protein is expressed as a sum of pairwise native contact energies. A native contact exists between amino acids if they are closer to each other than a cut-off distance in the native state. Thus, in the G\={o} approximation, folding can be construed as a transition from a state with no or a small number of native contacts to a state with all native contacts.
94: Despite their apparent simplicity, native topology-based approaches
95: have yielded results in agreement with experiments, in particular for small proteins (\(<100\) amino
96: acids) \cite{Borreguero02, Ding02a} reflecting the underlying simplicity in the folding of these proteins. However, a G\={o} model of longer proteins may lead to discrepancy in protein folding dynamics between simulation and experiment because sequence-specific interactions may significantly alter the protein folding dynamics. Thus, we develop methodology to incorporate sequence specific information of the relative contribution of various amino-acids into the G\={o} model of SOD1.
97:
98:
99: %RESULTS
100: \section{Results}
101:
102: We perform DMD simulations using the unscaled G\={o} model, to determine the thermodynamic properties of a model
103: of SOD1 (as described in Methods). The temperature dependence of
104: average potential energy is shown in Fig.~1a. There is a sigmoidal increase in the
105: potential energy with increasing temperature. At low temperatures, SOD1
106: is present mostly in its native state, as evidenced by the small ($\leq 2.
107: 83~$ \AA) root-mean-square deviation, $RMSD$, from the native state (Fig.~1b), and the small
108: fluctuations in the radius of gyration, \(R_{g}\), around its native state value (14.2 \AA) (data not shown).
109: At high temperatures, SOD1 is unfolded with the $RMSD$ from the native state being greater than 40 \AA, demonstrating the loss of any structural similarity with the native state.
110: Also, $R_g$ is approximately three times its value in the native state (data not shown), indicating that the average distance between any two amino acids is approximately three times that in the native state.
111:
112: In a typical trajectory (Fig.~2a) at the temperature corresponding to the mid-point of the abrupt change in potential energy (Fig.~1a) dependence on temperature (defined as the the folding
113: transition temperature $T_F$), we observe three distinct states: folded, unfolded and intermediate. A histogram of potential energies of
114: protein conformations near \(T_{F}\) is, therefore, tri-modal (Fig.~2c), instead of the
115: expected bi-modal for a two-state protein, indicating the presence of
116: a folding intermediate. The detection of the intermediate state is in contrast with the experimental observation of the two-state folding dynamics of the SOD1 monomer \cite{Stroppolo00}. Thus, the {\em unscaled} G\={o} model for SOD1 does not accurately reproduce the folding dynamics of SOD1.
117: Next, we calculate the pairwise interaction energies in the native state of SOD1 monomer and then employ the {\em scaled} G\={o} model, to uncover SOD1 folding thermodynamics and kinetics.
118:
119: {\bf A few contacts make a large thermodynamic contribution to SOD1 native state stability.} We decompose the stability of SOD1 into pairwise contributions (as described in Methods) that are plotted in
120: the lower triangle in Fig. 3 (the energy map).
121: In the energy map, some contacts, corresponding to both short-range and long-range interactions, have significantly stronger CHARMM interaction energies (from three to five times) than average.
122: We find that there are 16 long-range interactions that have large (about five times larger than the average) interaction energies: Lys3-Glu21, Lys3-Glu153, Lys30-Glu100, Ser34-Asp96, Lys36-Asp92, Glu40-Lys91, Glu49-Arg115, Arg69-Glu77, His71-Asp83, Arg79-Asp83, Arg79-Asp101, His80-Asp83, Asp101-Val103, Glu121-Ser142, Asp125-Lys128, Glu133-Lys136.
123: The residues involved in the strong long-range contacts are highlighted in Fig.
124: 4 and are involved in contacts between oppositely charged amino
125: acid pairs on the surface of the protein. Out of the 16
126: identified interactions, 6 have been found to be directly disrupted in patients
127: with FALS as a result of point mutations (listed in Table I) in the residues making these contacts \footnote{http://www.alsod.org}. In most cases, the mutations
128: correspond to the change of a charged amino acid for an
129: oppositely charged or a neutral one, suggesting an important role
130: these key contacts play in SOD1 folding.
131:
132: {\bf Scaling thermodynamically important contacts modulates folding kinetics in the G\={o} model.} We test the importance of the identified amino acids in DMD
133: simulations by scaling their contact strength in the G\={o} potential. We increase the
134: depth of the interaction well of the identified key contacts to five times their original
135: value in the {\em unscaled} G\={o} model ($b=5$).
136: Using this {\em scaled} G\={o} model,
137: we perform DMD simulations to calculate the thermodynamic properties of the SOD1.
138: We find that the potential energy, $RMSD$ and $R_g$ (data not shown) exhibit the same sigmoidal trend as
139: for the {\em unscaled} G\={o} model, but the folding transition temperature increases to approximately 1.1
140: from its earlier value of 1.0. Remarkably, a typical
141: trajectory near the folding transition temperature ($T_F=1.15$) shows (Fig.~2b) that the intermediate state is no longer populated and the
142: protein only exists in either the folded or the unfolded state (Fig.~2b and d) near $T_F$.
143: Thus, by strengthening these {\em key contacts} we reproduce the experimentally observed two state folding of SOD1 monomer \cite{Mei92, Stroppolo00}. As control, we randomly select several combinations of sixteen contacts and scale them by the same procedure. In such control simulations, we do not observe such effect on the folding kinetics (Fig.~2b), thus demonstrating that the effect of strengthening the identified contacts is specific.
144: Importantly, this result uncovers a crucial connection between the folding thermodynamics and kinetics
145: of SOD1: in our model, the residues that contribute the most to the SOD1 thermodynamic stability of the native state are also crucial for keeping the folding kinetics two-state.
146:
147: {\bf Improper disulfide bonds cause kinetic trapping of SOD1.} Human SOD1 is known to aggregate
148: irreversibly on heat denaturation \cite{Rodriguez02} {\em in vitro}. One of the reasons for the irreversibility is improper disulfide bond formation
149: between free cysteines. There
150: are four cysteine residues in the SOD1 monomer, at positions 6, 57, 111 and
151: 146 of which residues 57 and 146 form a disulfide linkage in the native state. We study the effect of non-native disulfide bond formation
152: by starting from a fully unfolded conformation,
153: generated at T=2.3 (\(\gg T_{F}\)), and cooling the system to low
154: temperature T=0.4 (\(\ll T_{F}\)) slowly, while allowing the possibility of interactions
155: between any of the four cysteines to form improper disulfide
156: bond(s). We model disulfide bonds by an attractive potential ($b=2$, $\gamma=-1$ in Eq.~(2)). As a
157: control, we repeat the annealing procedure without these non-native
158: disulfide bonds ($b=1$, $\gamma=1$ in Eq.~(2)). We observe that out of 20 trajectories with improper
159: disulfide interactions, 4 result in SOD1 structures that are {\em trapped} in non-native conformations
160: while all of the 20 control trajectories fold to the correct native structure on the same time scale, implying that occurrence of improper disulfide linkage increases the number of kinetic traps for the SOD1 chain {\em en route} to its native state. The above observation is statistically significant (the probability of observing this result by chance: $p$-value $\approx$0.01). The increased number of kinetic traps makes intermediate states more populated and interaction of these intermediate states may make SOD1 prone to aggregation. We conclude that free cysteine residues play an important part in the kinetics of folding, and possibly, aggregation.
161:
162: \section{Discussion}
163:
164: Experimental evidence suggests that the unfolding of dimeric
165: superoxide dismutase occurs in a three state equilibrium \cite{Mei92, Stroppolo00}.
166:
167: \begin{equation}
168: D{\rightleftharpoons}2M{\rightarrow}U
169: \label{eq:sucks}
170: \end{equation}
171: where D, M and U are SOD1 dimer, monomer and unfolded state respectively.
172: In order to understand the effect of mutations on SOD1 folding, it is
173: important to study the effect of mutation on both processes in Eq.~(\ref{eq:sucks}). Here, we study the second process (\(M{\rightarrow}U\))
174: and report thermodynamically and kinetically important residues
175: for SOD1 monomer folding.
176:
177: Using our hybrid method, which incorporates molecular mechanics energetics into the G\={o} model, we identify the key residues involved in the
178: folding of the protein. Surprisingly, the key residues are found on the
179: surface of the protein and not in its hydrophobic core (Fig.~4). By comparing
180: the identified residues with a comprehensive list of mutations associated with FALS
181: \footnote{http://www.alsod.org}, we find that, 6 out of the 16 contacts
182: identified in our simulations are found to be implicated in the
183: disease. In Table I, we list those FALS mutations, that
184: disrupt contacts identified from our simulations. We find that these mutations
185: replace a charged residue in the wild-type protein to a neutral or
186: oppositely charged residue in the mutant, thereby disrupting the electrostatic interaction.
187: The presence of mutations disrupting these contacts further underscores the
188: importance of these key contacts and indicates that the disruption of
189: even one of these contacts may potentially result in protein
190: misfolding.
191:
192: There have been no experimental studies characterizing SOD1 monomer folding kinetics. However, other members of the immunoglobulin-like (Ig-like) superfold family \cite{Murzin95}, to which SOD1 belongs, are experimentally well-characterized. Clarke and colleagues have established that the folding of pathways of Ig-like proteins share some common features \cite{Clarke99}. They observe that for five structurally similar Ig-like proteins, the folding rates are correlated with thermal stabilities, suggesting a similar folding pathway for all members of the family. Thus, for Ig-like proteins, the kinetic importance of residues in folding ($\phi$-values \cite{Fersht95}) is largely determined by the topology of the fold. We, therefore, expect that the residues important for SOD1 folding are in structurally equivalent positions to other members of the Ig-like super-fold family, whose folding kinetics is experimentally well-studied \cite{Clarke99}. We choose Tnf3, the fibronectin type III domain of human tenascin, for which, based on $\phi$-values, eight residues in the hydrophobic core were identified as part of the folding nucleus \cite{Hamill00}. We identify residues in SOD1 that are structurally equivalent to the nucleus residues in Tnf3 by performing a structural alignment of the two proteins using DALI \cite{Holm95} and list these residues in Table II. We discover that two of the residues corresponding to this putative nucleus correspond to the strongly interacting key residues identified by our simulations, while three others are nearest neighbors of other residues we identify. This is significant because (i) mutations in the nucleus residues, being in proximity to the identified key residues, may disrupt the network of interactions of key residues and affect folding kinetics and (ii) in the G\={o} model entropic contributions are included as effective energetic interactions. Therefore, the identified key residues may be kinetically crucial because of the entropic contribution of the corresponding region of the structure (which is reflected by high $\phi$-values of the neighboring nucleus residues). Therefore, kinetic evidence further supports our hypothesis that residues important for SOD1 native thermodynamics are important for folding kinetics.
193:
194: Richardson and Richardson have pointed out \cite{Richardson02} that the interactions between
195: edges of \(\beta\)-strands are crucial to regulate protein aggregation propensity.
196: Naturally occurring \(\beta\)-sheet
197: proteins prevent aggregation by protecting the edges of
198: \(\beta\) strands through a multitude of fold-specific mechanisms. In
199: particular, for Ig-like \(\beta\)-sandwiches,
200: strategically placed side-chain charges at the edges and
201: loop-crossing are dominant mechanisms which shield the edges of the
202: sheets. A recently solved crystal structure of mutant apo-SOD1 supramolecular assembly \cite{Elam03} identified three regions of the SOD1 structure where non-native gain of interaction was observed leading to amyloid like arrangement of SOD1 in crystals:
203: the cleft between strands S5 and S6 (residues 83-100),
204: the zinc loop (residues 65-78) and the electrostatic loop (residues 125-142). It was noted that disorder in the zinc and electrostatic loops promoted non-native interactions with the S5-S6 cleft, which is the interface between the two $\beta$-sheets of SOD1 and hence a likely site for edge-to-edge aggregation \cite{Richardson02}. We find that most of the key contacts for proper folding are made by residues in the three regions identified above. In particular, we see that contacts on both the edge-strands S5 and S6 play a crucial role in folding kinetics. The contacts made by residues Asp92, Asp96 and Lys91 which form the turn in the S5-S6 hairpin are crucial to seal the interface between the two $\beta$-sheets and likely prevent gain of non-native interactions, as shown in Fig.~5.
205: Thus, our simulations constitute another piece in the growing body of evidence which implicates edge protection as the dominant mechanism by which natural proteins avoid aggregation. We postulate that the disruption of key interactions is a plausible
206: scenario for mutant mediated
207: aggregation of SOD1. The re-organization of the S5-S6 edge may be the dominant feature which distinguishes the aggregation-prone misfolded state of SOD1, from the native state. The disruption of charge-charge interactions of
208: the side-chains of the identified key residues, which stabilize the $\beta$-barrel architecture of SOD1, might expose the edges of \(\beta\)-strand main-chain to interact with other structures, thus leading to
209: aggregation. Wang and Hecht \cite{Wang02} have shown that strategically designed mutations in \(\beta\)-strand
210: overhangs disfavor aggregation of normally
211: aggregating proteins, further supporting our hypothesis.
212:
213: We show that the presence of oxidisable cysteines enhances the
214: kinetic trapping during folding. In the presence of metals, cysteine
215: oxidation is enhanced suggesting a plausible cause
216: for the misfolding of SOD1. However, it has recently been suggested \cite{Lindberg02} that
217: misfolding of SOD1 likely takes place in the apo-enzyme state, when the metals have not yet been delivered to SOD1. In the reducing environment of the cell, cysteine-mediated misfolding/aggregation of the apoenzyme is, therefore, not a likely mechanism of SOD misfolding/aggregation. However, our observations do suggest a generic mechanism for SOD1 misfolding especially for mutations that do not affect the native state conformational stability \cite{Rodriguez02}. In the scenario similar to the one we have presented for free cysteines, other mutations may affect the folding kinetics of SOD1 by introducing non-native interactions along the folding pathway, leading to kinetic trapping without substantially affecting the native state stability.
218:
219: %METHODS
220: \section{Methods}
221: {\bf The {\em unscaled} G\={o} model.} We use the Discrete Molecular Dynamics (DMD) algorithm \cite{Dokholyan98b, Zhou97, Smith97} to study the folding thermodynamics of SOD1. DMD has recently been used to study the folding kinetics \cite{Borreguero02, Ding02a, Zhou99a, Zhou99b} and aggregation of proteins \cite{Ding02b, Smith01}. We model the SOD1 chain by the beads-on-a-string model developed by Ding et al. \cite{Ding02a} with beads corresponding to all $C_{\alpha}$ and $C_{\beta}$ atoms and constraints between neighboring beads to mimic real protein flexibility. We study the folding of one monomer derived from the crystal structure of SOD1 (Protein data bank access code: 1SPD). We use the G\={o} potential \cite{go81} to model the interaction energy, $V_{ij}$, between $C_{\beta}$ atoms ($C_{\alpha}$ for Gly) $i$ and $j$ of SOD1:
222: \begin{eqnarray}
223: V_{ij} \, = \, \left \{
224: \begin{array}{ll}
225: +\infty, & \mbox{ $|r_i-r_j| \leq r_0$ }\\
226: \gamma u_{ij}, & \mbox{ $r_0<|r_i-r_j| \leq r_1$ }\\
227: 0, & \mbox{ $|r_i-r_j| > r_1$ }
228: \end{array}
229: \right. \, ,
230: \label{eq:potential}
231: \end{eqnarray}where $|r_i-r_j|$ is the distance between atoms $i$ and $j$. The parameters $r_0$ and $r_1$ are the hard-core diameter (set to 3.25 \AA) and the cut-off distance (7.5 \AA) between $C_{\beta}$ atoms ($C_{\alpha}$ for glycine) in the native state respectively. We assign attractive interaction for native contacts ($\gamma= -1$), defined to exist between pairs of residues whose $C_{\beta}$ atoms ($C_{\alpha}$ for glycine) are closer than 7.5 \AA ~in the native state, and repulsive interaction ($\gamma= 1$) for non-native contacts, defined to exist between $C_{\beta}$ atoms ($C_{\alpha}$ for glycine) which are farther than 7.5 \AA ~in the native state.
232:
233: The depth of the attractive square-well $u_{ij}$ in Eq.~(\ref{eq:potential}) determines the strength of the attractive interaction between two amino acids. First,
234: we consider the case where all contacts are equally strong, i.e. the
235: depth of each attractive well is $u$. We call this case the {\em unscaled} G\={o}
236: model and use it to perform DMD simulations of SOD1. We find that our model undergoes a collapse transition to its native state through a meta-stable intermediate (see Results) in contrast with the experimental observation of two-state folding dynamics of SOD1 monomer \cite{Mei92, Stroppolo00}. A possible reason for the existence of meta-stable intermediates is that in the unscaled G\={o} model, all native contacts have equal strength. While such an approximation holds for smaller proteins (such as Src SH3 \cite{Borreguero02}) with relatively low number of contacts ($\sim162$), SOD1 monomer has 495 native contacts and the assumption of equal contact strength overestimates the population of intermediate species. Thus, in order to include sequence specificity of native interactions, we increase the strength of few specific native contacts
237: according to an all-atom model of SOD1 described below by making the depth of the
238: attractive well of some chosen native contacts $bu_{ij}$, where $b$ is the scale factor. We
239: call this the {\em scaled} G\={o} model. By scaling G\={o} potential according to an all atom model of the protein, we
240: effectively include the sequence information.
241:
242:
243: {\bf The {\em scaled} G\={o} model.} To determine the relative strength of each individual native contact in SOD1, we use an effective energy function to decompose the total energy of the protein as a sum of pairwise terms \cite{Paci02}:
244: \begin{equation}
245: E = \sum_{i} \sum_{j>i} E_{ij}\, ,
246: \end{equation} where the summation is taken is over all pairs of residues of SOD1 and \(E_{ij}\) is the energy of interaction between residues $i$ and $j$. To determine the terms $E_{ij}$, we perform simulations with the molecular mechanics force field CHARMM \cite{Brooks83} to construct an all
247: atom model of the SOD1 monomer in the native state, starting from its crystal
248: structure (PDB accession code:1SPD). Recently, higher resolution crystal structures of apo-SOD1 have appeared in the literature~\cite{Strange03}, but our results are not qualitatively affected by starting with the higher resolution structure. We add hydrogen atoms and remove any steric
249: clashes occurring in the structure by energy minimization. The interactions which contribute the most to the relative contact interaction strength are the non-bonded interactions and the solvent-induced interaction screening \cite{Paci02}:
250: \begin{equation}
251: E_{ij} = E_{ij}^{non-bonded} + G_{ij}\, .
252: \end{equation}We compute \(E_{ij}^{non-bonded}\) as the sum of the Lennard-Jones and electrostatic interaction energies. The solvation free energy, $G_{ij}$, measures
253: the screening of the pairwise interaction due to polarization of the
254: solvent.
255:
256: We calculate $G_{ij}$ using an implicit description of the solvent \cite{Sharp94}.
257: We approximate the solvent as a continuous dielectric medium with a
258: dielectric constant (\(\epsilon = 80\)) of water and the protein as a
259: continuous medium with dielectric constant (\(\epsilon =
260: 1\)) of vacuum.
261: We compute the solvation energy by solving the
262: Poisson-Boltzmann equation using the finite difference method \cite{Nicholls91}
263: available in CHARMM. Following Dominy {\em et al.}~\cite{Dominy02}, in order to decompose the solvation energy into pairwise
264: contributions, we eliminate all charges
265: in the protein except those corresponding to residue $i$ and
266: calculate its ``self'' solvation energy \(U_{i}\). Similarly, we compute $U_{j}$ and $U_{ij}$ when charges corresponding
267: to both $i$ and $j$ are present. Then, the solvent mediated screening of the interaction is:
268: \begin{equation}
269: G_{ij} = U_{ij} - { [U_{i} + U_{j}] }\, .
270: \end{equation}
271: So, we calculate the effective pairwise energy
272: contribution of a contact in the folded state.
273:
274: The $E_{ij}$ values that we thus obtain correspond to the folded state of the protein. In order to calculate the pairwise contribution of residues to the stability, we subtract the value of the contact energy in the unfolded state from the $E_{ij}$ values obtained above. The unfolded state is an ensemble of conformations. It has been noted that there is considerable native like structure in the unfolded state \cite{Zagrovic02,Shortle01, Elcock99}, meaning that the average environment of a contact is likely the same as in the folded state, but this contact is formed less frequently. Therefore, we model the unfolded state by simulating the {\em unscaled} G\={o} model near $T_{F}$ and estimating the frequency of native contact formation. The total contribution of a given contact to the stability can be written as:
275:
276: \begin{equation}
277: \Delta E_{ij}=E_{ij}(1-p_{ij})
278: \end{equation}where $E_{ij}$ is the contact energy in the folded state and $p_{ij}$ is the probability of forming the contact in the unfolded state. The energy and contact maps are plotted in Fig.~3.
279:
280: \section{Conclusions}
281:
282: We present the folding thermodynamic and kinetic analysis of SOD1 using a hybrid molecular dynamics method. We show that our simple model qualitatively
283: reproduces the folding of SOD1 observed in
284: experiments. We find that interactions at the edges of SOD1 \(\beta\)
285: sheets and the zinc, electrostatic and cross-over loops
286: are the key interactions that modulate the folding of SOD1
287: suggesting that mutants destabilizing these interactions may make
288: the enzyme more prone to aggregation. In particular, we identify the S5-S6 cleft (residues 83-100) as a likely site for reorganization to yield an aggregation prone misfolded state, an observation consistent with recent experimental evidence \cite{Elam03} and the {\em negative design} paradigm \cite{Richardson02}. We also provide a plausible
289: explanation for the phenomenon of increased reversibility of folding
290: upon mutation of free cysteine residues. We find good agreement between the positions identified by us to be crucial for folding kinetics and those identified by experiments \cite{Hamill00} on other members of the super-fold family (Ig-like proteins) to which SOD1 belongs. Further, we find that mutations at six of the residue contacts we identify are implicated in FALS.
291:
292: \section{Acknowledgements}
293: We thank S. V. Buldyrev for his help with DMD simulations,
294: B. N. Dominy for help with CHARMM calculations and J. M. Borreguero, L. J. Hayward, B. Kuhlman and Z. Xu for helpful
295: discussions. We acknowledge the support of the UNC-CH Research Council Grant. SDK acknowledges the support of UNC Molecular and Cellular Biophysics Training Program and Scholars for Tomorrow Fellowship of UNC Graduate School.
296:
297: %REFERENCES.
298: \bibliographystyle{jmb}
299: \bibliography{ndbiblio}
300:
301: \pagebreak
302:
303: %TABLES
304:
305: \begin{table}
306: \caption{Mutations in the identified kinetically important residues also found in patients with FALS. In the first column, 6 of the 16 contacts that we identify are shown. The second column lists mutations that disrupt these contacts and are found in FALS patients.}
307: \begin{tabular}{|l|l|l|}
308: \hline
309: Contact& Mutation(s)\\ \hline Lys3-Glu21&Glu21Lys, Glu21Gly\\\hline
310: Glu49-Arg115&Glu49Lys\\\hline Asp96-Ser34&Asp96Asn\\\hline Lys30-Glu100&Glu100Lys, Glu100Gly\\\hline Arg79-Asp101&Asp101Gly, Asp101Asn\\\hline Glu49-Arg115& Arg115Gly\\ \hline
311: \end{tabular}
312: \label{t:1}
313: \end{table}
314:
315: \clearpage
316: %\clearpage
317:
318: \begin{table}
319: \caption{Comparison of kinetically important residues. Kinetically important in SOD1 were determined by (i) structural comparison with the protein 1TEN \cite{Hamill00} using DALI \cite{Holm95} and (ii) by our simulations. Nucleus represents high $\phi$-value residues in 1TEN, equivalent positions refer to structurally equivalent positions determined by DALI alignment and key residues in the vicinity of equivalent positions identified by our simulations are shown in column 3. N/A refers to no equivalent residue found.}
320: \begin{tabular}{|l|l|l|}
321: \hline
322: Nucleus (1TEN) & Equivalent Position (1SPD) & Key Residue\\ \hline Ile20&Phe31&Lys30\\\hline Tyr36&His48&Glu49\\\hline Ile48&Asp83&Asp83\\\hline Leu50
323: &Asn86&N/A\\\hline Ile59&Asp101&Asp101\\\hline Thr66&His110&N/A\\\hline Tyr68&Ile112&N/A\\\hline Val70&Gly114&Arg115\\ \hline
324: \end{tabular}
325: \label{t:2}
326: \end{table}
327: \clearpage
328:
329: %FIGURES
330: \begin{figure}[ht!]
331: \centerline{ \vbox{ \hbox{\epsfxsize=12cm
332: \epsfbox{newfig3.eps} }}}
333: \label{wall}
334: \caption{SOD1 thermodynamic properties.
335: Dependence on temperature of (a) average energy
336: (b) $RMSD$ in DMD simulations with {\em unscaled} G\={o} potential. The sigmoidal curves in both (a) and (b) show that SOD1 undergoes a collapse transition. Similar curves are obtained for DMD simulations with the {\em scaled} G\={o} potential.}
337: \end{figure}
338:
339: \clearpage
340:
341: \begin{figure}[ht!]
342: \centerline{ \vbox{ \hbox{\epsfxsize=10cm
343: \epsfbox{newunscaled-tf-spa.eps} }}
344: \vbox{ \hbox{\epsfxsize=10cm
345: \epsfbox{newcompare-tf.eps} }}}
346: \vspace*{0.5cm}
347: \centerline{ \vbox{ \hbox{\epsfxsize=10cm
348: \epsfbox{newhisto-unscaled-snap.eps} }}
349: \vbox{ \hbox{\epsfxsize=10cm
350: \epsfbox{newhisto-scaled-snap.eps} }}}
351: \label{scale-unscaled}
352: \caption{Typical DMD trajectories near \(T_{F}\)
353: for (a) unscaled and (b) scaled G\={o} models. One trajectory is shown out of 4 for the unscaled G\={o} model and 5 for the scaled G\={o} model. In (b), a control trajectory with randomly chosen contacts (see text) strengthened is shown in grey. Histograms of energy values near $T_F$, for (c) unscaled and (d) scaled G\={o} models, averaged over 4 and 5 trajectories respectively, are also shown. From (b) and (d) we observe that the
354: intermediate state is not populated in the scaled G\={o} model but is present in the unscaled G\={o} model, indicating that the kinetics of the scaled G\={o} model is indeed two-state as observed in experiments \cite{Mei92, Stroppolo00}.}
355: \end{figure}
356: \clearpage
357:
358: \begin{figure}[ht!]
359: \centerline{ \vbox{ \hbox{\epsfxsize=13cm
360: \epsfbox{newfig2copy.eps} }}}
361: \vspace*{0.5cm}
362: \label{contm}
363: \caption{Contact map (upper triangle) and CHARMM generated energy map (lower triangle) of the SOD1 monomer. The color shading in the energy map is assigned according to the interaction energy values; red indicates repulsive interactions and blue indicates attractive interactions. Energy values are in kcal/mol.}
364: \end{figure}
365: \clearpage
366:
367: \begin{figure}[ht!]
368: \centerline{\vbox{ \hbox{\epsfxsize=10cm
369: \epsfbox{newfig4a_molscript.eps} }}
370: \vbox{ \hbox{\epsfxsize=10cm
371: \epsfbox{newfig4b_molscript.eps} }}}
372: \vspace*{0.5cm}
373: \label{cvcomp}
374: \caption{Two projections (a) and (b) of SOD1 monomer structure highlighting residues making the identified contacts important for the two-state folding of SOD1: Lys3-Glu21, Lys3-Glu153, Lys30-Glu100, Ser34-Asp96, Lys36-Asp92, Glu40-Lys91, Glu49-Arg115, Arg69-Glu77, His71-Asp83, Arg79-Asp83, Arg79-Asp101, His80-Asp83, Asp101-Val103, Glu121-Ser142, Asp125-Lys128, Glu133-Lys136. Figure generated using Kraulis' Molscript program and co-ordinates from PDB structure 1SPD.}
375: \end{figure}
376: \clearpage
377:
378: \begin{figure}[ht!]
379: \centerline{ \vbox{ \hbox{\epsfxsize=13cm
380: \epsfbox{newcloop.eps} }}}
381: \vspace*{0.5cm}
382: \label{cleft}
383: \caption{S5-S6 cleft. The residues in the cleft which are identified as crucial for making the folding kinetics two-state in our model, are highlighted. This cleft represents one of the edges of the $\beta$-barrel and is a possible site for initiation of non-native contact formation as also found in Ref. \cite{Elam03}, leading to aggregation.}
384: \end{figure}
385: \clearpage
386:
387: \end{document}
388: