Asnaashari, K and Krems, R V (2022) Gradient domain machine learning with composite kernels: improving the accuracy of PES and force fields for large molecules. Machine Learning: Science and Technology, 3 (1). 015005. ISSN 2632-2153
Asnaashari_2022_Mach._Learn.__Sci._Technol._3_015005.pdf - Published Version
Download (1MB)
Abstract
The generalization accuracy of machine learning models of potential energy surfaces (PES) and force fields (FF) for large polyatomic molecules can be improved either by increasing the number of training points or by improving the models. In order to build accurate models based on expensive ab initio calculations, much of recent work has focused on the latter. In particular, it has been shown that gradient domain machine learning (GDML) models produce accurate results for high-dimensional molecular systems with a small number of ab initio calculations. The present work extends GDML to models with composite kernels built to maximize inference from a small number of molecular geometries. We illustrate that GDML models can be improved by increasing the complexity of underlying kernels through a greedy search algorithm using Bayesian information criterion as the model selection metric. We show that this requires including anisotropy into kernel functions and produces models with significantly smaller generalization errors. The results are presented for ethanol, uracil, malonaldehyde and aspirin. For aspirin, the model with composite kernels trained by forces at 1000 randomly sampled molecular geometries produces a global 57-dimensional PES with the mean absolute accuracy 0.177 kcal mol−1 (61.9 cm−1) and FFs with the mean absolute error 0.457 kcal mol−1 Å−1.
Item Type: | Article |
---|---|
Subjects: | East India library > Multidisciplinary |
Depositing User: | Unnamed user with email support@eastindialibrary.com |
Date Deposited: | 06 Jul 2023 04:34 |
Last Modified: | 26 Jun 2024 11:12 |
URI: | http://info.paperdigitallibrary.com/id/eprint/1564 |