Skip to content

Physical Encoding Improves OOD Performance in Deep Learning Materials Property Prediction

Notifications You must be signed in to change notification settings

funihang/PhysicalEncoding

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PhysicalEncoding

Physical Encoding Improves OOD Performance in Deep Learning Materials Property Prediction

Nihang Fu, Sadman Sadeed Omee, Jianjun Hu*
Machine Learning and Evolution Laboratory
Department of Computer Science and Engineering
University of South Carolina

Table of Contents

Requirements

Packages requirements

pip install pymatgen
pip install matminer

Datasets

Datasets are from MatBench and Matminer. After processing, we can get datasets as shown below. There are composition- and structure-based datasets. For each, we use different OOD selection methods. The size of each dataset for PV, ER1, ER2, and FF selection methods are shown in the figure. The first column lists the dataset names; the second column indicates the number of materials used in the training process (the training and the ID test sets); the third column specifies the number of materials in the OOD sets. Datasets

Acknowledgement

Datasets in this paper are from MatBench and Matminer.

@article{dunn2020benchmarking,
  title={Benchmarking materials property prediction methods: the Matbench test set and Automatminer reference algorithm},
  author={Dunn, Alexander and Wang, Qi and Ganose, Alex and Dopp, Daniel and Jain, Anubhav},
  journal={npj Computational Materials},
  volume={6},
  number={1},
  pages={138},
  year={2020},
  publisher={Nature Publishing Group UK London}
}

@article{ward2018matminer,
  title={Matminer: An open source toolkit for materials data mining},
  author={Ward, Logan and Dunn, Alexander and Faghaninia, Alireza and Zimmermann, Nils ER and Bajaj, Saurabh and Wang, Qi and Montoya, Joseph and Chen, Jiming and Bystrom, Kyle and Dylla, Maxwell and others},
  journal={Computational Materials Science},
  volume={152},
  pages={60--69},
  year={2018},
  publisher={Elsevier}
}

Experiments in this paper are based on a composition-based network (Roost) and a structure-based network (ALIGNN). You can check source codes here: Roost and ALIGNN

Cite our work



Contact

If you have any problems, please feel free to reach out to us at [email protected].

About

Physical Encoding Improves OOD Performance in Deep Learning Materials Property Prediction

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published