Ensembler: Enabling high-throughput molecular simulations at the superfamily scale

Daniel L. Parton, Patrick B. Grinaway, Sonya M. Hanson, Kyle A. Beauchamp, and John D. Chodera
PLoS Computational Biology 12:e1004728, 2016. [DOI] [PDF] [bioRxiv] / data: [Dryad] / code: [GitHub]

We demonstrate a new tool that enables---for the first time---massively parallel molecular simulation studies of biomolecular dynamics at the superfamily scale, illustrating its application to protein tyrosine kinases, an important class of drug targets in cancer.

A simple method for automated equilibration detection in molecular simulations

John D. Chodera.
J. Chem. Theor. Comput. 12:1799, 2016. [DOI[PDF] / code to reproduce manuscript: [GitHub] / preprint: [bioRxiv] / available in pymbar.timeseries

We present a simple scheme for automatically selecting how much initial simulation data to discard to equilibration or burn-in based on maximizing the number of statistically uncorrelated samples in the dataset.

Keywords: molecular simulation; molecular dynamics; burn-in; equilibration; production; analysis

Modeling error in experimental assays using the bootstrap principle: Understanding discrepancies between assays using different dispensing technologies

Sonya M. Hanson, Sean Ekins, and John D. Chodera.
Journal of Computer Aided Molecular Design 29:1073, 2015. [DOI] [PDF] // IPython notebook [GitHub] // preprint: [bioRxiv]
Inspired by this In the Pipeline blog post

The drug development community faced a puzzling challenge when a disturbing paper published in PLoS One demonstrated results from the same assay performed with different dispensing technologies both varied wildly and significantly different in magnitude of reported potencies. Inspired by a talk given at the 2014 CADD GRC by Cosma Shalizi on bootstrapping to model error, we show how this simple idea can help explain a large amount of the discrepancy in this assay, and provide simple mathematical tools and an IPython notebook illustrating how easy it is to model the error and bias in experimental assays even when other information about assay reliability is unavailable.

Avoiding accuracy-limiting pitfalls in the study of protein-ligand interactions with isothermal titration calorimetry

Sarah E. Boyce, Joel Tellinghuisen, and John D. Chodera.
Manuscript prior to submission. [bioRxiv] [PDF]
Supplementary files: ITC worksheet [PDF] [XLSX] [ODS]
doi:10.1101/023796

We show how to avoid common accuracy-limiting mistakes in isothermal titration calorimetry, and provide a simple spreadsheet to aid in propagating the dominant source of uncertainty (titrant concentration errors) into the resulting thermodynamic parameters.

Keywords: isothermal titration calorimetry; ITC; propagation of error; entropy-enthalpy compensation

Towards Automated Benchmarking of Atomistic Forcefields: Neat Liquid Densities and Static Dielectric Constants from the ThermoML Data Archive

Kyle A. Beauchamp, Julie M. Behr, Ariën S. Rustenburg, Christopher I. Bayly, Kenneth Kroenlein, and John D. Chodera.
J. Phys. Chem. B 119:12912, 2015. [DOI] [PDF] // code: [GitHub] // preprint: [arXiv

Progress in forcefield validation and parameterization has been hindered by the availability of high-quality machine-readable physical property data for small organic molecules. We show how the NIST ThermoML dataset provides a solution to this problem, and demonstrate its utility in benchmarking the GAFF/AM1-BCC small molecule forcefield on neat liquid densities and static dielectric constants to uncover problems in the representation of low-dielectric environments.

Spectral rate theory for two-state kinetics

Jan-Hendrik Prinz, John D. Chodera, and Frank Noé.
Phys. Rev. X 4:011020, 2014. [DOI] [PDF]

We present a new mathematical framework for unifying various two-state rate theories presented in the physical chemistry literature over many decades, and provide a quantitative way to measure reaction coordinate quality.

Time step rescaling recovers continuous-time dynamical properties for discrete-time Langevin integration of nonequilibrium systems

David A. Sivak, John D. Chodera, and Gavin E. Crooks.
J. Phys. Chem. B, 118:6466-6474, 2014. William C. Swope Festschrift issue. [DOI] [PDF]

We derive a simple, easy-to-implement Langevin integrator that has universally useful properties in molecular simulations.

Keywords: velocity Verlet with velocity randomization; VVVR; nonequilibrium integration

Identifying ligand binding sites and poses using GPU-accelerated Hamiltonian replica exchange molecular dynamics

Kai Wang K, John D. Chodera, Yanzhi Yang, and Michael R. Shirts. 
J. Comput. Aid. Mol. Des. 27:989, 2013. [DOI] [PDF]

We show how bound ligand poses can be identified even when the location of the binding sites are unknown using the machinery of alchemical modern free energy calculations on graphics processors. 

Systematic improvement of a classical molecular model of water

Lee-Ping Wang, Teresa L. Head-Gordon, Jay W. Ponder, Pengyu Ren, John D. Chodera, Peter K. Eastman, Todd J. Martinez, and Vijay S. Pande.
J. Phys. Chem. B 117:9956, 2013. [DOI] [PDF]

A new inexpensive polarizable model of liquid water for next-generation forcefields is derived using an automated parameterization engine.

 

Using nonequilibrium fluctuation theorems to understand and correct errors in equilibrium and nonequilibrium discrete Langevin dynamics simulations

David A. Sivak, John D. Chodera, and Gavin E. Crooks.
Phys. Rev. X 3:011007, 2013. [DOI] [PDF]

The finite-timestep errors in molecular dynamics simulations can be interpreted as a form of nonequilibrium work.  We show how this leads to straightforward schemes for correcting for these errors or assessing their impact.

Keywords: velocity verlet with Velocity randomization; VVVR; nonequilibrium free energy; integrator error; nonequilibrium integration

OpenMM 4: A reusable, extensible, hardware independent library for high performance molecular simulation

Peter Eastman, Mark S. Friedrichs, John D. Chodera, Randy J. Radmer, Chris M. Bruns, Joy P. Ku, Kyle A. Beauchamp, T. J. Lane, Lee-Ping Wang, Diwakar Shukla, Tony Tye, Mike Houston, Timo Stich, Christoph Klein, Michael R. Shirts, and Vijay S. Pande.
J. Chem. Theor. Comput. 9:461, 2013. [DOI] [PDF]

We describe the latest version of an open-source, GPU-accelerated library and toolkit for molecular simulation.

The limitations of constant-force-feedback experiments

Phillip J. Elms, John D. Chodera, Carlos J. Bustamante, and Susan Marqusee.
Biophys. J. 103:1490, 2012. [DOI] [PDF]

Popular constant-force-feedback single-molecule experiments can cause severe artifacts in single-molecule force spectroscopy data.  We demonstrate a simple alternative that eliminates these artifacts.

The molten globule state is unusually deformable under mechanical force

Philip J. Elms, John D. Chodera, Carlos Bustamante, and Susan Marqusee.
Proc. Natl. Acad. Sci. USA 109:3796, 2012. [DOI] [PDF]

We measure the physical properties of the molten globule state of apo-myoglobin, and show that it is unusually deformable compared to typical protein native states.

Replica exchange and expanded ensemble simulations as Gibbs sampling: Simple improvements for enhanced mixing

John D. Chodera and Michael R. Shirts.
J. Chem. Phys. 135:194110, 2011. [DOI] [PDF

We show how a simple change to the way exchanges are handled in the popular replica-exchange simulation methodology can enormously increase efficiency at no increase in computational cost.

The ribosome modulates nascent protein folding

Christian M. Kaiser, Daniel H. Goldman, John D. Chodera, Ignacio Tinoco Jr., and Carlos Bustamante.
Science 334:1723, 2011. [DOI] [PDF]

Using single-molecule force spectroscopy, we show how the ribosome itself modulates the folding dynamics of nascent protein chains emerging from the exit tunnel.

Nonequilibrium candidate Monte Carlo is an efficient tool for equilibrium simulation

Jerome P. Nilmeier, Gavin E. Crooks, David D. L. Minh, and John D. Chodera. 
Proc. Natl. Acad. Sci. USA 108:E1009, 2011. [DOI] [PDF]

We present a significant generalization of Monte Carlo methods that provide an enormously useful tool for enhancing the efficiency of molecular simulations and enabling molecular design.

Keywords: NCMC; Monte Carlo; Metropolis-Hastings; acceptance rates; molecular dynamics