Molecular dynamics simulations necessarily use a finite timestep, which introduces error or bias in the sampled configuration space density that grows rapidly with increasing timestep. For the first time, we show how to compute a natural measure of this error---the KL divergence---in both phase and configuration space for a widely used family of Langevin integrators, and show that VRORV is generally superior for simulation of molecular systems.
Yosi Shamay, Janki Shah, Mehtap Işık, Aviram Mizrachi, Josef Leibold, Darjus F. Tschaharganeh, Daniel Roxbury, Januka Budhathoki-Uprety, Karla Nawaly, James L. Sugarman, Emily Baut, Michelle R. Neiman, Megan Dacek, Kripa S. Ganesh, Darren C. Johnson, Ramya Sridharan, Karen L. Chu, Vinagolu K. Rajasekhar, Scott W. Lowe, John D. Chodera, and Daniel A. Heller.
Nature Materials, in press. [DOI] [PDF] [Supporting Info] [nano-drugbank]
In a collaboration with the Heller Lab at MSKCC, we show how indocyanine nanoparticles can package insoluble selective kinase inhibitors with high mass loadings and efficiently deliver them to tumors.
Kevin Hauser, Christopher Negron, Steven K. Albanese, Soumya Ray, Thomas Steinbrecher, Robert Abel, John D. Chodera, and Lingle Wang.
Manuscript prior to publication: [bioRxiv] [input files and analysis scripts]
In our first collaborative paper with Schrödinger, we present the first comprehensive benchmark assessing the ability for alchemical free energy calculations to predict clinical mutational resistance or susceptibility to targeted kinase inhibitors using the well-studied kinase Abl, the target of therapy for chronic myelogenous leukemia (CML).
We show how NCMC can be used to implement an efficient osmostat in molecular dynamics simulations to model realistic fluctuations in ion environments around biomolecules, and illustrate how the local salt environment around biological macromolecules can differ substantially from bulk.
Emily F. Ruff, Joseph M. Muretta, Andrew Thompson, Eric W. Lake, Soreen Cyphers, Steven K. Albanese, Sonya M. Hanson, Julie M. Behr, David D. Thomas, John D. Chodera, and Nicholas M. Levinson.
eLife, in press. [bioRxiv]
We show that, contrary to the canonical belief that activation shifts DFG-out to DFG-in populations, phosphorylation of AurA does not shift DFG-in/out equilibrium but instead remodels the conformational distribution of the DFG-in state.
Daniel L. Parton, Sonya M. Hanson, Lucelenie Rodríguez-Laureano, Steven K. Albanese, Scott Gradia, Chris Jeans, Markus Seeliger, and John D. Chodera.
Manuscript prior to publication: [bioRxiv]
Interactive data browser: [github.io]
Plasmids available via AddGene
Human kinase catalytic domains---the therapeutic target of selective kinase inhibitors used in the treatment of cancer and other diseases---are notoriously difficult and expensive to express in insect or human cells. Here, we utilize the phosphatase co-expression technology developed by Markus Seeliger (now at Stony Brook) to develop a library of human kinase catalytic domains for facile and inexpensive expression in bacteria.
Nonequilibrium candidate Monte Carlo can be used to accelerate the sampling of ligand binding modes by orders of magnitude over instantaneous Monte Carlo.
Peter Eastman, Jason Swails, John D. Chodera, Robert T. McGibbon, Yutong Zhao, Kyle A. Beauchamp, Lee-Ping Wang, Andrew C. Simmonett, Matthew P. Harrigan, Bernard R. Brooks, Vijay S. Pande. PLoS Computational Biology 13:e1005659, 2017. [DOI] [bioRxiv] [website] [GitHub]
We describe the latest version of OpenMM, a GPU-accelerated framework for high performance molecular simulation applications.
Guilherme Duarte Ramos Matos, Daisy Y. Kyu, Hannes H. Loeffler, John D. Chodera, Michael R. Shirts, David Mobley
Journal of Chemical Engineering Data, Journal of Chemical Engineering Data 62:1559, 2017. [DOI] [bioRxiv] [GitHub]
We review alchemical methods for computing solvation free energies and present an update (version 0.5) to the FreeSolv database of experimental and calculated hydration free energies of neutral compounds.
At low pH, metabolic enzymes lactate dehydrogenase and malate dehydrogenase undergo shifts in substrate utilization that have high relevance to cancer metabolism due to surprisingly simple protonation state effects.
Xu Jianing, Pham CG, Albanese SK, Dong Yiyu, Oyama T, Lee CH, Rodrik-Outmezguine V, Yao Z, Han S, Chen D, Parton DL, Chodera JD, Rosen N, Cheng EH, and Hsieh J. Journal of Clinical Investigation 126:3526, 2016. [DOI] [PDF]
In work with the James Hsieh lab at MSKCC, we examine the surprising origin of how different clinically-identified cancer-associated mutations in MTOR activate the kinase through distinct mechanisms.
Ariën S. Rustenburg, Justin Dancer, Baiwei Lin, Jianweng A. Feng, Daniel F. Ortwine, David L. Mobley, and John D. Chodera.
Journal of Computer-Aided Molecular Design 30:945, 2016. [DOI] [bioRxiv] [PDF] // data: [GitHub]
Solicited manuscript for special issue of the Journal of Computer Aided Molecular Design on the SAMPL5 Challenge.
The SAMPL Challenges have driven predictive physical modeling for ligand:protein binding forward by focusing the community on a series of blind challenges that evaluate performance on blind datasets, focus attention on current challenges for physical modeling techniques, and provide high-quality experimental datasets to the community after the challenge is over. For many years, challenges focused around hydration free energies have proven to be extremely useful, with theory now able to determine when experiment is wrong. To replace these challenges, since no more hydration free energy data is being measured, we proposed to use the partition or distribution coefficients of small druglike molecules between aqueous and apolar phases. We report the collection of cyclohexane-water partition data for a set of compounds used to drive the SAMPL5 distribution coefficient challenge, providing the experimental data, methodology, and insight for future iterations of this challenge.
We demonstrate a new tool that enables---for the first time---massively parallel molecular simulation studies of biomolecular dynamics at the superfamily scale, illustrating its application to protein tyrosine kinases, an important class of drug targets in cancer.
We present a simple scheme for automatically selecting how much initial simulation data to discard to equilibration or burn-in based on maximizing the number of statistically uncorrelated samples in the dataset.
Keywords: molecular simulation; molecular dynamics; burn-in; equilibration; production; analysis
Sonya M. Hanson, Sean Ekins, and John D. Chodera.
Journal of Computer Aided Molecular Design 29:1073, 2015. [DOI] [PDF] // IPython notebook [GitHub] // preprint: [bioRxiv]
Inspired by this In the Pipeline blog post
The drug development community faced a puzzling challenge when a disturbing paper published in PLoS One demonstrated results from the same assay performed with different dispensing technologies both varied wildly and significantly different in magnitude of reported potencies. Inspired by a talk given at the 2014 CADD GRC by Cosma Shalizi on bootstrapping to model error, we show how this simple idea can help explain a large amount of the discrepancy in this assay, and provide simple mathematical tools and an IPython notebook illustrating how easy it is to model the error and bias in experimental assays even when other information about assay reliability is unavailable.
We show how to avoid common accuracy-limiting mistakes in isothermal titration calorimetry, and provide a simple spreadsheet to aid in propagating the dominant source of uncertainty (titrant concentration errors) into the resulting thermodynamic parameters.
Keywords: isothermal titration calorimetry; ITC; propagation of error; entropy-enthalpy compensation
Kyle A. Beauchamp, Julie M. Behr, Ariën S. Rustenburg, Christopher I. Bayly, Kenneth Kroenlein, and John D. Chodera.
J. Phys. Chem. B 119:12912, 2015. [DOI] [PDF] // code: [GitHub] // preprint: [arXiv]
Progress in forcefield validation and parameterization has been hindered by the availability of high-quality machine-readable physical property data for small organic molecules. We show how the NIST ThermoML dataset provides a solution to this problem, and demonstrate its utility in benchmarking the GAFF/AM1-BCC small molecule forcefield on neat liquid densities and static dielectric constants to uncover problems in the representation of low-dielectric environments.