Merge pull request #85 from choderalab/improve-doc

Improve doc
choderalab · Oct 6, 2021 · 83b9c93 · 83b9c93
2 parents b2f2b67 + ad2fd04
commit 83b9c93
Show file tree

Hide file tree

Showing 6 changed files with 127 additions and 9 deletions.
diff --git a/.github/workflows/sphinx.yml b/.github/workflows/sphinx.yml
@@ -0,0 +1,38 @@
+name: "Build Doc"
+on: 
+- push
+
+jobs:
+  docs:
+    runs-on: ubuntu-latest
+    steps:
+    - uses: actions/checkout@v2
+    - name: Set up Python 3.7
+      uses: actions/setup-python@v2
+      with:
+        python-version: 3.7
+
+    - uses: conda-incubator/setup-miniconda@v2
+      with:
+        installer-url: ${{ matrix.conda-installer }}
+        python-version: ${{ matrix.python-version }}
+        activate-environment: test
+        channel-priority: true
+        environment-file: devtools/conda-envs/espaloma.yaml
+        auto-activate-base: false
+        use-mamba: true
+
+    - name: Install package
+      shell: bash -l {0}
+      run: |
+        python -m pip install --no-deps .
+    - name: Compile
+      shell: bash -l {0}
+      run: |
+        python -m pip install sphinx sphinx-rtd-theme numpydoc
+        cd docs && make html   
+    - name: Deploy
+      uses: peaceiris/actions-gh-pages@v3
+      with:
+        github_token: ${{ secrets.GITHUB_TOKEN }}
+        publish_dir: docs/_build/html
diff --git a/README.md b/README.md
@@ -1,20 +1,25 @@
-espaloma
+espaloma: **E**xtensible **S**urrogate **P**otenti**al** **O**ptimized by **M**essage-passing **A**lgorithms
 ==============================
 [//]: # (Badges)
 [![CI](https://github.com/choderalab/espaloma/actions/workflows/CI.yaml/badge.svg?branch=master)](https://github.com/choderalab/espaloma/actions/workflows/CI.yaml)
 [![Total alerts](https://img.shields.io/lgtm/alerts/g/choderalab/espaloma.svg?logo=lgtm&logoWidth=18)](https://lgtm.com/projects/g/choderalab/espaloma/alerts/)
 [![Language grade: Python](https://img.shields.io/lgtm/grade/python/g/choderalab/espaloma.svg?logo=lgtm&logoWidth=18)](https://lgtm.com/projects/g/choderalab/espaloma/context:python)
+[![docs stable](https://img.shields.io/badge/docs-stable-5077AB.svg?logo=read%20the%20docs)](www.espaloma.wangyq.net/)
 
-Extensible Surrogate Potential of Ab initio Learned and Optimized by Message-passing Algorithms
 
-Rather than:
+Source code for [Wang Y, Fass J, and Chodera JD "End-to-End Differentiable Construction of Molecular Mechanics Force Fields."](https://arxiv.org/abs/2010.01196)
 
-molecule ---(atom typing schemes)---> atom-types ---(atom typing schemes)---> bond-, angle-, torsion-types ---(table lookup)---> force field parameters
+![abstract](docs/_static/espaloma_abstract_v2-2.png)
 
-we want to have
 
-molecule ---(graph nets)---> atom-embedding ---(pooling)---> hypernode-embedding ---(feedforward neural networks)---> force field parameters
 
+# Paper Abstract
+Molecular mechanics (MM) potentials have long been a workhorse of computational chemistry.
+Leveraging accuracy and speed, these functional forms find use in a wide variety of applications in biomolecular modeling and drug discovery, from rapid virtual screening to detailed free energy calculations.
+Traditionally, MM potentials have relied on human-curated, inflexible, and poorly extensible discrete chemical perception rules _atom types_ for applying parameters to small molecules or biopolymers, making it difficult to optimize both types and parameters to fit quantum chemical or physical property data.
+Here, we propose an alternative approach that uses _graph neural networks_ to perceive chemical environments, producing continuous atom embeddings from which valence and nonbonded parameters can be predicted using invariance-preserving layers.
+Since all stages are built from smooth neural functions, the entire process---spanning chemical perception to parameter assignment---is modular and end-to-end differentiable with respect to model parameters, allowing new force fields to be easily constructed, extended, and applied to arbitrary molecules.
+We show that this approach is not only sufficiently expressive to reproduce legacy atom types, but that it can learn and extend existing molecular mechanics force fields, construct entirely new force fields applicable to both biopolymers and small molecules from quantum chemical calculations, and even learn to accurately predict free energies from experimental observables.
 
 # Manifest
 
@@ -48,6 +53,6 @@ This software is licensed under [MIT license](https://opensource.org/licenses/MI
 
 Copyright (c) 2020, Chodera Lab at Memorial Sloan Kettering Cancer Center and Authors:
 Authors:
-- Yuanqing Wang
+- [Yuanqing Wang](http://www.wangyq.net)
 - Josh Fass
 - John D. Chodera
diff --git a/devtools/conda-envs/espaloma.yaml b/devtools/conda-envs/espaloma.yaml
@@ -4,6 +4,7 @@ channels:
   - dglteam
   - openeye
   - defaults
+  - anaconda
 dependencies:
   # Base dependencies
   - python
@@ -29,3 +30,5 @@ dependencies:
   - nose-timer
   - coverage
   - qcportal
+  - sphinx
+  - sphinx_rtd_theme
diff --git a/docs/_static/espaloma_abstract_v2-2.png b/docs/_static/espaloma_abstract_v2-2.png
diff --git a/docs/conf.py b/docs/conf.py
@@ -16,7 +16,7 @@
 import os
 import sys
 
-sys.path.insert(0, os.path.abspath('../espaloma'))
+sys.path.insert(0, os.path.abspath('..'))
 
 import espaloma
 from espaloma import mm, nn, data, graphs

diff --git a/docs/getting_started.rst b/docs/getting_started.rst
@@ -1,4 +1,76 @@
 Getting Started
 ===============
 
-This page details how to get started with espaloma. 
+.. image:: _static/espaloma_abstract_v2-2.png
+
+Paper Abstract
+--------------
+Molecular mechanics (MM) potentials have long been a workhorse of computational chemistry.
+Leveraging accuracy and speed, these functional forms find use in a wide variety of applications in biomolecular modeling and drug discovery, from rapid virtual screening to detailed free energy calculations.
+Traditionally, MM potentials have relied on human-curated, inflexible, and poorly extensible discrete chemical perception rules _atom types_ for applying parameters to small molecules or biopolymers, making it difficult to optimize both types and parameters to fit quantum chemical or physical property data.
+Here, we propose an alternative approach that uses _graph neural networks_ to perceive chemical environments, producing continuous atom embeddings from which valence and nonbonded parameters can be predicted using invariance-preserving layers.
+Since all stages are built from smooth neural functions, the entire process---spanning chemical perception to parameter assignment---is modular and end-to-end differentiable with respect to model parameters, allowing new force fields to be easily constructed, extended, and applied to arbitrary molecules.
+We show that this approach is not only sufficiently expressive to reproduce legacy atom types, but that it can learn and extend existing molecular mechanics force fields, construct entirely new force fields applicable to both biopolymers and small molecules from quantum chemical calculations, and even learn to accurately predict free energies from experimental observables.
+
+Minimal Example
+---------------
+::
+
+    import torch, dgl, espaloma as esp
+
+    # retrieve QM dataset used to train OpenFF 1.0.0 ("parsley") small molecule force field
+    dataset = esp.data.dataset.GraphDataset.load("parsley").view(batch_size=128)
+
+    # define Espaloma stage I: graph -> atom latent representation
+    representation = esp.nn.Sequential(
+        layer=esp.nn.layers.dgl_legacy.gn("SAGEConv"), # use SAGEConv implementation in DGL
+        config=[128, "relu", 128, "relu", 128, "relu"], # 3 layers, 128 units, ReLU activation
+    )
+
+    # define Espaloma stage II and III: 
+    # atom latent representation -> bond, angle, and torsion representation and parameters
+    readout = esp.nn.readout.janossy.JanossyPooling(
+        in_features=128,
+        config=[128, "relu", 128, "relu", 128, "relu"],
+        out_features={              # define modular MM parameters Espaloma will assign
+            1: {"e": 1, "s": 1},
+            2: {"coefficients": 2}, # bond linear combination
+            3: {"coefficients": 3}, # angle linear combination
+            4: {"k": 6}, # torsion barrier heights (can be positive or negative)
+        },
+    )
+
+    # compose all three Espaloma stages into an end-to-end model
+    espaloma_model = torch.nn.Sequential(
+                     representation, 
+                     readout,
+                     esp.mm.geometry.GeometryInGraph(),
+                     esp.mm.energy.EnergyInGraph(),
+                     esp.nn.readout.charge_equilibrium.ChargeEquilibrium(),
+    )
+
+    # define training metric
+    metrics = [
+        esp.metrics.GraphMetric(
+                base_metric=torch.nn.MSELoss(), # use mean-squared error loss
+                between=['u', "u_ref"],         # between predicted and QM energies
+                level="g",
+        )
+        esp.metrics.GraphMetric(
+                base_metric=torch.nn.MSELoss(), # use mean-squared error loss
+                between=['q', "q_hat"],         # between predicted and reference charges
+                level="n1",
+        )
+    ]
+
+    # fit Espaloma model to training data
+    results = esp.Train(
+        ds_tr=dataset, net=espaloma_model, metrics=metrics,
+        device=torch.device('cuda:0'), n_epochs=5000,
+        optimizer=lambda net: torch.optim.Adam(net.parameters(), 1e-3), # use Adam optimizer
+    ).run()
+
+
+
+
+