Abstract
Common methods for assigning atom-centered partial charges in computational chemistry, such as RESP and AM1-BCC, rely on quantum mechanical or semiempirical calculations of the molecule of interest, which are expensive to compute and dependent on the choice of input molecular conformer(s). Graph neural network (GNN) based continuous atom embeddings have been shown to be a fast and flexible solution for partial charge assignment, but those developed so far for condensed phase modeling have usually been trained to reproduce AM1-BCC charges, which themselves seek to reproduce the HF/6-31G(d) molecular electrostatic potential. Here, we investigate the suitability of various common charge assignment schemes, including ESP and atoms-in-molecule (AIM) based approaches, as training targets for new GNN-based charge models. We show that the strengths of both approaches can be combined by cotraining GNN models to AIM charges and molecular dipoles and electrostatic potentials. We collect a data set of quantum mechanical AIM properties computed at a high level of theory (ωB97X-D/def2-tzvpp), in both vacuum and implicit solvent, and train new GNN charge models to each. Charges can be scaled between the vacuum and solvated charge sets, and combined with Lennard-Jones parameters optimized using the Open Force Field infrastructure, to yield force fields that are suitably polarized for condensed phase modeling. We further demonstrate that the charge models may be applied to explore electrostatics-driven structure-activity relationships in medicinal chemistry. The charge models are freely available at: https://github.com/cole-group/nagl-mbis/.