The Kitchin Research Group

pycse YouTube Channel

Posted November 23, 2021 at 06:03 PM | categories: uncategorized | tags:

Updated November 23, 2021 at 06:06 PM

Over the past few months, I have been making a series of short Python videos on YouTube. You can find the playlist at https://www.youtube.com/playlist?list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62. They are not particularly well organized there, since I make them in the order I feel like, and when I have some spare time, so today I took some time to organize them by some topics here. If you find them useful, please subscribe to the channel and tell your friends about them!

Basic Python
- Conditional statements https://www.youtube.com/watch?v=XymPeBMILUY&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=2
- Membership operators https://www.youtube.com/watch?v=CZstHHjfCHo&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=3
- Logical operators https://www.youtube.com/watch?v=q-uDWDSF0l8&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=4
- Comparison operators https://www.youtube.com/watch?v=BayqeeF_iKM&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=5
- Operator precedence https://www.youtube.com/watch?v=Vy4USf-UVAI&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=19
- Iteration https://www.youtube.com/watch?v=7rVsD9kT9RM&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=16
- functions https://www.youtube.com/watch?v=kidVLLHtzbc&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=17
Math
- Integration https://www.youtube.com/watch?v=6xAO7te0kdA&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=10
- polynomials https://www.youtube.com/watch?v=8e1yOKMTP8o&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=11
- Numpy array broadcasting https://www.youtube.com/watch?v=slWgreHaQNE&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=23
- Array broadcasting example https://www.youtube.com/watch?v=FXhEKNtvoVs&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=24
Root finding
- solving nonlinear functions https://www.youtube.com/watch?v=KeRNoXWs_y0&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=15
- Newton's method https://www.youtube.com/watch?v=spLsyP-5PF8&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=1
- Replacing fsolve https://www.youtube.com/watch?v=_1bOzIYcDaA&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=6
- root failures https://www.youtube.com/watch?v=FaOJxeVfeH4&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=26
Differential equations
Optimization
- minimizing a function https://www.youtube.com/watch?v=2HMKU2nHAbE&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=13
- constrained optimization https://www.youtube.com/watch?v=QKiOm1iqciE&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=12
Data
- Using Excel in colab https://www.youtube.com/watch?v=rfcstL5eTbs&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=8
- Reading data files in colab https://www.youtube.com/watch?v=xf6qprxmBaM&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=21
- More ways to read data in colab https://www.youtube.com/watch?v=NjRd41QtU14&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=22
- linear regression https://www.youtube.com/watch?v=ZXSaLcFSOsU&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=20
- nonlinear regression https://www.youtube.com/watch?v=hbchKAgeDcU&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=18
Miscellaneous
- hand written work in Jupyter lab https://www.youtube.com/watch?v=5qCY9Eoeoyo&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=9
- uncertainty quantification https://www.youtube.com/watch?v=KP-km6XedVg&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=27
- units in python https://www.youtube.com/watch?v=au5Jwz_STXI&list=PL0sMmOaE_gs2yzwy54kLZk5c1ZH-Nh-62&index=28

org-mode source

Org-mode version = 9.5

Discuss on Twitter

Youtube live-streamed research talks

Posted June 29, 2021 at 12:07 PM | categories: news | tags:

Updated June 29, 2021 at 12:08 PM

1. Machine learned potentials and automatic differentiation in molecular simulation
2. Leveraging machine learning to accelerate simulations of dilute alloy catalysts with adsorbates

For fun, I have been live-streaming some of our research talks from the past year. Two of these talks are shown below. This is an experiment of sorts, let me know if you like them!

1. Machine learned potentials and automatic differentiation in molecular simulation

Machine learned potentials are revolutionizing molecular simulations. In this talk, I will introduce what machine learned potentials are, how we think about them, and how we create them. Then, I will show an example of how we use them to model segregation in a metal alloy surface at many different bulk compositions. Finally, I will show how automatic differentiation, which is one of the foundations of machine learning, can be used more broadly in scientific programming with derivatives.

This was the second talk I did, but it is somewhat of an introduction to the next talk below.

2. Leveraging machine learning to accelerate simulations of dilute alloy catalysts with adsorbates

In this talk I talk about how we use machine learning to build cheap and accurate surrogate models of alloy catalyst surfaces in the dilute limit. I will show how we use this to simulate acrolein adsorption on dilute Ag-Pd alloys.

org-mode source

Org-mode version = 9.4.6

Discuss on Twitter

New publication "Machine-learning accelerated geometry optimization in molecular simulation"

Posted June 21, 2021 at 11:52 AM | categories: news | tags:

Updated June 23, 2021 at 12:40 PM

Geometry optimization and transition state searches are two very common tasks in molecular simulation. Typically every one of these is done from scratch, and they can only be made faster by using a better initial guess. Typical optimization algorithms use some variation of gradient descent, and the best ones also use an iterative approach to estimate the Hessian (second derivatives). The problem is we do not know the underlying function that is being optimized, so there is hardly any choice to benefit from the Hessian (which allows bigger, more accurate steps to be taken).

In this paper, we use machine learning to develop a surrogate model that is cheap compared to the DFT calculations, and that has an uncertainty quantification so we can tell when it is accurate. This allows us to take many cheap steps when the surrogate model is accurate, and only do expensive calculations when needed. More importantly though, the surrogate model works across many different geometry optimizations, which allows us to benefit from previous calculations. We show this works on a variety of atomic geometries ranging from metal slabs, slabs with adsorbates, and nanoparticle geometries, as well as with nudged elastic band calculations for transitions state searches.

@article{yang-2021-machin-learn,
  author =       {Yilin Yang and Omar A. Jim{\'e}nez-Negr{\'o}n and John R.
                  Kitchin},
  title =        {Machine-Learning Accelerated Geometry Optimization in
                  Molecular Simulation},
  journal =      {The Journal of Chemical Physics},
  volume =       154,
  number =       23,
  pages =        234704,
  year =         2021,
  doi =          {10.1063/5.0049665},
  url =          {https://doi.org/10.1063/5.0049665},
}

org-mode source

Org-mode version = 9.4.6

Discuss on Twitter

New publication - Semi-grand Canonical Monte Carlo Simulation of the Acrolein induced Surface Segregation and Aggregation of AgPd with Machine Learning Surrogate Models

Posted March 07, 2021 at 04:00 PM | categories: news | tags:

Updated June 24, 2021 at 04:37 PM

Modeling alloys is tricky, and modeling dilute alloys has been an open challenge for a long time. Segregation is one of the biggest challenges; the surface composition is not the same as the bulk, and is influenced by adsorption. Although we know that the composition varies to lower the surface free energy, the configurational degrees of freedom make it difficult to find the lowest energy arrangement of atoms. In the dilute limit, the quantum chemical calculations and Monte Carlo simulates we do become very expensive due to the unit cell size. In this work, we use machine learning to build surrogate models for the configurational energy of an alloy slab in the dilute limit. Then, we use those models in conjunction with a semi-grand canonical Monte Carlo (MC) simulation to solve several of these problems. First, by fixing the alloy chemical potential at the desired dilute limit, we avoid the need for large unit cells. Second, the surrogate models are much more efficient than DFT, so we can use them in the MC simulations to run tens of thousands of simulation steps to get the required samples for reliable statistical averaging. In dilute alloys, the focus is on the unique catalytic properties of single atoms of an active metal like Pd in an inert metal like Ag. In the single atom limit though, there are few active sites. If you increase the bulk concentration, at some point the single atoms begin to aggregate into dimers and trimers, and adsorbates can make the aggregation happen faster. Aggregation is undesirable because it usually leads to lower selectivity, and more bulk like reactivity. An open question has been how do you design alloy catalysts under reaction conditions, given all this complexity. We apply this approach to the adsorption of acrolein on a dilute AgPd alloy, and show how to use the method to identify the bulk concentration where aggregation begins in a reactive environment.

@article{liu-2021-semi-grand,
  author =       {Mingjie Liu and Yilin Yang and John R. Kitchin},
  title =        {Semi-Grand Canonical Monte Carlo Simulation of the Acrolein
                  Induced Surface Segregation and Aggregation of {AgPd} With
                  Machine Learning Surrogate Models},
  journal =      {The Journal of Chemical Physics},
  volume =       154,
  number =       13,
  pages =        134701,
  year =         2021,
  doi =          {10.1063/5.0046440},
  url =          {https://doi.org/10.1063/5.0046440},
}

org-mode source

Org-mode version = 9.4.6

Discuss on Twitter

New publication SingleNN - Modified Behler–Parrinello Neural Network with Shared Weights for Atomistic Simulations with Transferability

Posted July 09, 2020 at 11:35 AM | categories: news | tags:

Updated June 21, 2021 at 08:40 PM

Many machine learned potentials work by creating a numeric fingerprint that represents the local atomic environment around an atom, and then "machine learning" a function that computes the atomic energy for that atom. The total energy of an atomic configuration is then simply the sum of the atomic energies, and the forces are simply the derivative of that energy with respect to the atomic positions. In the Behler-Parrinello formulation, each element gets its own neural network for these calculations. In this work, we show that a single neural network with multiple outputs can be used instead. This means that all the elements share the weights in the neural network, and the atomic energy of each element is linearly proportional to the output of the last hidden layer. This has some benefits for transferability and suggests that there is a common nonlinear dimensional transform of the numeric fingerprints for the elements in this study.

@article{liu-2020-singl,
  author =       {Mingjie Liu and John R. Kitchin},
  title =        {Singlenn: Modified Behler-Parrinello Neural Network With
                  Shared Weights for Atomistic Simulations With Transferability},
  journal =      {The Journal of Physical Chemistry C},
  volume =       124,
  number =       32,
  pages =        {17811-17818},
  year =         2020,
  doi =          {10.1021/acs.jpcc.0c04225},
  url =          {https://doi.org/10.1021/acs.jpcc.0c04225},
}

Discuss on Twitter

org-mode source

Org-mode version = 9.4

Discuss on Twitter

« Previous Page -- Next Page »

The Kitchin Research Group

Chemical Engineering at Carnegie Mellon University

pycse YouTube Channel

Youtube live-streamed research talks

Table of Contents

1. Machine learned potentials and automatic differentiation in molecular simulation

2. Leveraging machine learning to accelerate simulations of dilute alloy catalysts with adsorbates

New publication "Machine-learning accelerated geometry optimization in molecular simulation"

New publication - Semi-grand Canonical Monte Carlo Simulation of the Acrolein induced Surface Segregation and Aggregation of AgPd with Machine Learning Surrogate Models

New publication SingleNN - Modified Behler–Parrinello Neural Network with Shared Weights for Atomistic Simulations with Transferability