Compute forces on the IPU. #66

AlexanderMath · 2023-09-04T12:25:13Z

nanoDFT computes forces on the CPU using def grad(..) on line 230. To run def grad(..) on the IPU it is sufficient to port lines 269-273 and line 283.

Different strategies for porting lines 269-273:

Compile libcint to poplar and replace all mol.intor(..) with corresponding poplar calls (ERI is only problematic part).
Use Jax implementation from D4FT for the forward pass of the mol.intor(..) and match up the jax.grad(..) of the forward passes with lines 269-273 (pyscfad matched up libcint with jax.grad for CPU => their code may be helpful).
Reimplement all integrals from first principles in Jax/tesselate.

Note: Line 230 uses this theorem to compute gradients. We could use jax.grad(_nanoDFT) instead of the theorem. That would require us to fix all calls in _nanoDFT(..) which don't support derivatives. We currently believe the work involved is the same as fixing def grad(..) (see the above different strategies). In other words: the non-autograd stuff _nanoDFT calls are calls which have derivatives as computed on line 269-273 and 283.

The text was updated successfully, but these errors were encountered:

AlexanderMath · 2023-09-04T17:54:00Z

Note: If we use autodiff/jax.grad we need to store "activations" during jax.lax.fori_loop(0, opts.its, nanoDFT_iteration, ..). We might use jax.checkpoint to only store density_matrix of shape (N,N) in each iteration, and then during backprop keep all activations within an iteration in memory. This leads to a peak memory consumption of N^2*num_iterations + floats_within_one_iteration.

Note: It takes 3x more memory to store ERI_grad = mol.intor("int2e_ip") because ERI_grad.shape=(N, N, N, N, 3). ERI_grad is only used twice see lines 244-245 (similar einsum to how ERI is used).

vj = - jnp.einsum('sijkl,lk->sij', ERI_grad, dm0) # (3, N, N)
vk = - jnp.einsum('sijkl,jk->sil', ERI_grad, dm0) # (3, N, N)

Since we only have "one iteration of einsums" (opposed to 20 in forward pass), there is no advantage in storing ERI_grad. We might aswell compute the needed entries on the fly during the einsums. This may cause problems with the trick in #65. We may reuse the sparsity-pattern found in #63.

AlexanderMath · 2023-09-05T09:33:20Z

Another implementation (thanks to @paolot-gc) https://github.com/erikkjellgren/SlowQuant/blob/master/slowquant/molecularintegrals/electronrepulsion.py

AlexanderMath · 2023-09-05T09:34:21Z

Another implementation (watch our for license) https://github.com/theochem/gbasis

AlexanderMath added the epic label Sep 4, 2023

AlexanderMath assigned awf and hatemhelal Sep 4, 2023

hatemhelal linked a pull request Sep 20, 2023 that will close this issue

WIP: nuclear gradients #101

Draft

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compute forces on the IPU. #66

Compute forces on the IPU. #66

AlexanderMath commented Sep 4, 2023 •

edited

Loading

AlexanderMath commented Sep 4, 2023 •

edited

Loading

AlexanderMath commented Sep 5, 2023

AlexanderMath commented Sep 5, 2023

Compute forces on the IPU. #66

Compute forces on the IPU. #66

Comments

AlexanderMath commented Sep 4, 2023 • edited Loading

AlexanderMath commented Sep 4, 2023 • edited Loading

AlexanderMath commented Sep 5, 2023

AlexanderMath commented Sep 5, 2023

AlexanderMath commented Sep 4, 2023 •

edited

Loading

AlexanderMath commented Sep 4, 2023 •

edited

Loading