67e5b56085091082.tex
1: \begin{abstract}
2: The weighted ensemble (WE) simulation strategy provides unbiased sampling of non-equilibrium processes, such as molecular folding or binding, but the extraction of rate constants relies on characterizing steady state behavior.
3: Unfortunately, WE simulations of sufficiently complex systems will not relax to steady state on observed simulation times.  
4: Here we show that a post-simulation clustering of molecular configurations into ``microbins'' using methods developed in the Markov State Model (MSM) community, can yield unbiased kinetics from WE data \emph{before} steady-state convergence of the WE simulation itself. 
5: Because WE trajectories are directional and not equilibrium-distributed, the  history-augmented MSM (haMSM) formulation can be used, which yields the mean first-passage time (MFPT) without bias for arbitrarily small lag times.
6: Accurate kinetics can be obtained while bypassing the often prohibitive convergence requirements of the non-equilibrium weighted ensemble. 
7: We validate the method in a simple diffusive process on a 2D random energy landscape, and then analyze atomistic protein folding simulations using WE molecular dynamics. 
8: We report significant progress towards the unbiased estimation of protein folding times and pathways, though key challenges remain.
9: \end{abstract}
10: