Development and benchmarking of Open Force Field 2.0.0---the Sage small molecule force field

Boothroyd S, Behara PK, Madin OC, Hahn DF, Jang H, Gapsys V, Wagner JR, Horton JT, Dotson DL, Thompson MW, Maat J, Gokey T, Wang L-P, Cole DJ, Gilson MK, Chodera JD, Bayly CI, Shirts MR, Mobley DL
Journal of Chemical Theory and Computation 19:3251, 2023 [DOI] [chemRxiv] [GitHub] [examples]

We present a new generation of small molecule force field for molecular design from the Open Force Field Initiative fit to both quantum chemical and experimental liquid mixture data

Improving force field accuracy by training against condensed-phase mixture properties

Boothroyd S, Madin OC, Mobley DL, Wang L-P, Chodera JD, and Shirts MR
Journal of Chemical Theory and Computation 18:3577, 2022 [DOI] [GitHub]

We use a new automated framework for physical property evaluation and fitting to show how molecular mechanics force fields can be systematically improved by fitting to condensed phase properties.

SAMPL7 protein-ligand challenge: A community-wide evaluation of computational methods against fragment screening and pose-prediction

Grosjean H, Isik M, Aimon A, Mobley D, Chodera JD, von Delft F, and Biggin PC
Journal of Computer-Aided Molecular Design 36:291, 2022 [DOI]

We field a blind community challenge to assess how well state of the art computational chemistry methods can predict the binding modes of small druglike fragments to a protein target for which no chemical matter is known, PHIP2, using fragment screening at the Diamond Light Source.

The Open Force Field Evaluator: An automated, efficient, and scalable framework for the estimation of physical properties from molecular simulation

Simon Boothroyd, Lee-Ping Wang, David L. Mobley, John D. Chodera, and Michael R. Shirts

Preprint ahead of submission: [ChemRxiv]

We describe a new software framework for automated evaluation of physical properties for the benchmarking and optimization of small molecule force fields according to best practices.

Best practices for constructing, preparing, and evaluating protein-ligand binding affinity benchmarks

David F Hahn, Christopher I Bayly, Hannah E Bruce Macdonald, John D Chodera, Antonia SJS Mey, David L Mobley, Laura Perez Benito, Christina EM Schindler, Gary Tresadern, Gregory L Warren
Preprint ahead of publication: [arXiv] [GitHub]

This living best practices paper for the Living Journal of Computational Molecular Sciences describes the current community consensus in how to curate experimental benchmark data for assessing predictive affinity models for drug discovery, how to prepare these systems for affinity calculations, and how to assess the results to compare performance.

Overview of the SAMPL6 pKa challenge: evaluating small molecule microscopic and macroscopic pKa predictions

Mehtap Işık, Ariën S Rustenburg, Andrea Rizzi, Marilyn R Gunner, David L Mobley, John D Chodera
Journal of Computer-Aided Molecular Design 35:131, 2021
[DOI] [bioRxiv] [GitHub] [manuscript and figure sources]

The SAMPL6 pKa challenge assessed the ability of the computational chemistry community to predict macroscopic and microscopic pKas for a set of druglike molecules resembling kinase inhibitors. This paper reports on the overall performance and lessons learned, including the surprising finding that many tools predict reasonably accurate macroscopic pKas corresponding to the wrong microscopic protonation sites.

Development and benchmarking of Open Force Field v1.0.0, the Parsley small molecule force field

Yudong Qiu, Daniel Smith, Simon Boothroyd, Hyesu Jang, Jeffrey Wagner, Caitlin C Bannan, Trevor Gokey, Victoria T Lim, Chaya Stern, Andrea Rizzi, Xavier Lucas, Bryon Tjanaka, Michael R Shirts, Michael Gilson, John D. Chodera, Christopher I Bayly, David Mobley, Lee-Ping Wang
Preprint ahead of publication: [chemRxiv] [force fields] [Open Force Field Initiative]

We present a new, modern small molecule force field for molecular design from the Open Force Field Initiative, a large industry-academic collaboration that focuses on open science, open data, and modern open source infrastructure.

Assessing the accuracy of octanol-water partition coefficient predictions in the SAMPL6 Part II log P Challenge

Mehtap Işık, Teresa Danielle Bergazin, Thomas Fox,  Andrea Rizzi, John D. Chodera, and David L. Mobley.
Journal of Computer Aided Molecular Design, 34:335, 2020. [DOI] [PDF] [bioRxiv] [GitHub]

We report the performance assessment of the 91 methods that were submitted to the SAMPL6 blind challenge for predicting octanol-water partition coefficient (logP) measurements. The average RMSE of the most accurate five MM-based, QM-based, empirical, and mixed approach methods based on RMSE were 0.92±0.13, 0.48±0.06, 0.47±0.05, and 0.50±0.06, respectively.

The SAMPL6 SAMPLing challenge: Assessing the reliability and efficiency of binding free energy calculations

Andrea Rizzi, Travis Jensen, David R. Slochower, Matteo Aldeghi, Vytautas Gapsys, Dimitris Ntekoumes, Stefano Bosisio, Michail Papadourakis, Niel M. Henriksen, Bert L. de Groot, Zoe Cournia, Alex Dickson, Julien Michel, Michael K. Gilson, Michael R. Shirts, David L. Mobley, and John D. Chodera
Journal of Computer Aided Molecular Design 34:601, 2020. [DOI] [PDF] [bioRxiv] [GitHub]

To assess the relative efficiencies of alchemical binding free energy calculations, the SAMPL6 SAMPLing challenge asked participants to submit predictions as a function of computer effort for the same force field and charge model. Surprisingly, we found that most molecular simulation codes cannot agree on the binding free energy was, even for the same force field.

Octanol-water partition coefficient measurements for the SAMPL6 Blind Prediction Challenge

sampl6-part2-logP.png

Mehtap Işık, Dorothy Levorse, David L. Mobley, Timothy Rhodes, and John D. Chodera.
Journal of Computer Aided Molecular Design
34:405, 2020. [DOI] [bioRxiv] [data] [GitHub]

We describe the design and data collection (and associated challenges) for the SAMPL6 part II logP octanol-water blind prediction challenge, where the goal was to benchmark the accuracy of force fields for druglike molecules (here, molecules resembling kinase inhibitors).

Toward learned chemical perception of force field typing rules

Camila Zanette, Caitlin C. Bannan, Christopher I. Bayly, Josh Fass, Michael K. Gilson, Michael R. Shirts, John Chodera, and David L. Mobley
Journal of Chemical Theory and Computation, 15:402, 2019. [DOI] [ChemRxiv] [GitHub]

We show how machine learning can learn typing rules for molecular mechanics force fields within a Bayesian statistical framework.

Overview of the SAMPL6 host-guest binding affinity prediction challenge

Andrea RizziSteven MurkliJohn N. McNeillWei YaoMatthew SullivanMichael K. Gilson, Michael W. Chiu, Lyle IsaacsBruce C. GibbDavid L. Mobley*, John D. Chodera*
* denotes co-corresponding authors
Journal of Computer-Aided Molecular Design special issue on SAMPL6, 32:937, 2018. [DOI] [bioRxiv] [GitHub]

We present an overview of the host-guest systems and participant performance for the SAMPL6 host-guest blind affinity prediction challenges, assessing how well various physical modeling approaches were able to predict ligand binding affinities for simple ligand recognition problems where receptor sampling and protonation state effects are eliminated due to the simplicity of supramolecular hosts. We find that progress is now stagnated likely due to force field limitations.

Binding Modes of Ligands Using Enhanced Sampling (BLUES): Rapid Decorrelation of Ligand Binding Modes Using Nonequilibrium Candidate Monte Carlo

Samuel Gill, Nathan M. Lim, Patrick Grinaway, Ariën S. Rustenburg, Josh Fass, Gregory Ross, John D. Chodera, and David Mobley.
Journal of Physical Chemistry B 22:5579, 2018. [DOI] [ChemRxiv] [GitHub]

Nonequilibrium candidate Monte Carlo can be used to accelerate the sampling of ligand binding modes by orders of magnitude over instantaneous Monte Carlo.

Approaches for calculating solvation free energies and enthalpies demonstrated with an update of the FreeSolv database

Guilherme Duarte Ramos Matos, Daisy Y. Kyu, Hannes H. Loeffler, John D. Chodera, Michael R. Shirts, David Mobley
Journal of Chemical Engineering Data 62:1559, 2017. [DOI] [bioRxiv] [GitHub]

We review alchemical methods for computing solvation free energies and present an update (version 0.5) to the FreeSolv database of experimental and calculated hydration free energies of neutral compounds.

Measuring experimental cyclohexane-water distribution coefficients for the SAMPL5 challenge

Ariën S. Rustenburg, Justin Dancer, Baiwei Lin, Jianweng A. Feng, Daniel F. Ortwine, David L. Mobley, and John D. Chodera.
Journal of Computer-Aided Molecular Design 30:945, 2016. [DOI] [bioRxiv] [PDF] // data: [GitHub]
Solicited manuscript for special issue of the Journal of Computer Aided Molecular Design on the SAMPL5 Challenge.

The SAMPL Challenges have driven predictive physical modeling for ligand:protein binding forward by focusing the community on a series of blind challenges that evaluate performance on blind datasets, focus attention on current challenges for physical modeling techniques, and provide high-quality experimental datasets to the community after the challenge is over. For many years, challenges focused around hydration free energies have proven to be extremely useful, with theory now able to determine when experiment is wrong. To replace these challenges, since no more hydration free energy data is being measured, we proposed to use the partition or distribution coefficients of small druglike molecules between aqueous and apolar phases. We report the collection of cyclohexane-water partition data for a set of compounds used to drive the SAMPL5 distribution coefficient challenge, providing the experimental data, methodology, and insight for future iterations of this challenge.