Create a `torch.nn.Module` for the composition weights #269

PicoCentauri · 2024-06-21T07:30:10Z

Composition weights are used by almost every architecture to subtract the energy from the chemical composition of the dataset to make it easier for the actual architecture to learn the targets. In metatrain, there is a utility function to compute the composition weights function

https://github.com/lab-cosmo/metatrain/blob/40d4d6a30d9add0fa4d1e9d2ce9e2635423fcc48/src/metatrain/utils/composition.py#L9

However, so far each architecture has to apply these composition weights on their own by looping over the Systems and creating the output TensorBlocks. See for example the GAP architecture

https://github.com/lab-cosmo/metatrain/blob/40d4d6a30d9add0fa4d1e9d2ce9e2635423fcc48/src/metatrain/experimental/gap/model.py#L242

I think it would be useful to create a torch.nn.Module for the CompositionEnergy to make this essential part of an architecture easier and usable for the devs. Even though we could, I wouldn't make this a public architecture to users right now. The idea for an Module is also in line for a short range Module as discussed in #265 where we concluded to also create a torch.nn.Module.

My idea for the design below is basically copied from the metatensor atomistic tutorial.

from typing import Dict, List, Optional

import torch

from metatensor.torch import Labels, TensorBlock, TensorMap
from metatensor.torch.atomistic import ModelOutput, System


class CompositionEnergy(torch.nn.Modules):

    def __init__(self, atomic_types):
        self.atomic_types = atomic_types

    def compute_weights(atasets: Union[Dataset, List[Dataset]], property: str) -> None:
        """Calculate the composition weights for a dataset."""
        # Basically the code form
        # `metatrain.utils.composition.calculate_composition_weights()`
        ...

    def forward(
        self,
        systems: List[System],
        outputs: Dict[str, ModelOutput],
        selected_atoms: Optional[Labels] = None,
    ) -> Dict[str, TensorMap]:
        # if the model user did not request an energy calculation, we have nothing to do
        if "energy" not in outputs:
            return {}

        # we don't want to worry about selected_atoms yet
        if selected_atoms is not None:
            raise NotImplementedError("selected_atoms is not implemented")

        if outputs["energy"].per_atom:
            raise NotImplementedError("per atom energy is not implemented")

        # compute the energy for each system by adding together the energy for each atom
        energy = torch.zeros((len(systems), 1), dtype=systems[0].positions.dtype)
        for i, system in enumerate(systems):
            energy[i] += 0.0  # do actual calculations here

        # Add metadata to the output
        block = TensorBlock(
            values=energy,
            samples=Labels("system", torch.arange(len(systems)).reshape(-1, 1)),
            components=[],
            properties=Labels("energy", torch.tensor([[0]])),
        )
        return {
            "energy": TensorMap(keys=Labels("_", torch.tensor([[0]])), blocks=[block])
        }

Once we have this Module, an architecture basically has to call the compute_weights function to store the weights and can use the forward function to apply the composition energies.

The text was updated successfully, but these errors were encountered:

frostedoyster · 2024-06-21T08:20:06Z

I like it. It would be cool if, for such a lightweight component of a model, we could avoid recomputing the labels at each iteration (but this is an implementation detail)

PicoCentauri added Priority: Medium Important issues to address after high priority. Discussion Issues to be discussed by the contributors Infrastructure: Miscellaneous General infrastructure issues labels Jun 21, 2024

PicoCentauri mentioned this issue Jul 2, 2024

Add a general torch CompositionModel #280

Merged

6 tasks

frostedoyster closed this as completed in #280 Sep 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create a `torch.nn.Module` for the composition weights #269

Create a `torch.nn.Module` for the composition weights #269

PicoCentauri commented Jun 21, 2024

frostedoyster commented Jun 21, 2024 •

edited

Loading

Create a torch.nn.Module for the composition weights #269

Create a torch.nn.Module for the composition weights #269

Comments

PicoCentauri commented Jun 21, 2024

frostedoyster commented Jun 21, 2024 • edited Loading

Create a `torch.nn.Module` for the composition weights #269

Create a `torch.nn.Module` for the composition weights #269

frostedoyster commented Jun 21, 2024 •

edited

Loading