New publication - Investigating the Error Imbalance of Large-Scale Machine Learning Potentials in Catalysis
Posted January 28, 2025 at 02:41 PM | categories: publication, news | tags:
Updated January 28, 2025 at 03:19 PM
The paper “Investigating the Error Imbalance of Large-Scale Machine Learning Potentials in Catalysis” by Kareem Abdelmaqsoud et al. explores the persistent error imbalance in machine learning potentials (MLPs) trained on the Open Catalyst 2020 (OC20) dataset. While MLPs have significantly accelerated catalyst discovery by approximating density functional theory (DFT) calculations, the study finds that non-metals exhibit disproportionately high prediction errors compared to intermetallics and metalloids. The authors investigate two primary sources of this imbalance: (1) DFT convergence errors and (2) surface reconstruction inconsistencies. They determine that while DFT convergence errors exist, they do not significantly impact MLP performance. However, surface reconstructions introduce inconsistencies in adsorption energy referencing, making them difficult for MLPs to model accurately. By removing these reconstructions from validation sets, the study achieves a 35% reduction in mean absolute error (MAE) and a more balanced error distribution. The paper suggests an alternative approach: training MLPs on total energies rather than adsorption energies, which eliminates referencing issues and maintains comparable accuracy . This research provides valuable insights for improving MLP robustness in catalysis and optimizing large-scale computational datasets.
@article{abdelmaqsoud-2024-inves-error, author = {Kareem Abdelmaqsoud and Muhammed Shuaibi and Adeesh Kolluru and Raffaele Cheula and John R. Kitchin}, title = {Investigating the Error Imbalance of Large-Scale Machine Learning Potentials in Catalysis}, journal = {Catalysis Science & Technology}, volume = {14}, number = {20}, pages = {5899-5908}, year = 2024, doi = {10.1039/d4cy00615a}, url = {http://dx.doi.org/10.1039/d4cy00615a}, DATE_ADDED = {Sat Aug 17 09:45:44 2024}, }
Copyright (C) 2025 by John Kitchin. See the License for information about copying.
Org-mode version = 9.8-pre