Skip to content

Latest commit

 

History

History
57 lines (47 loc) · 4.09 KB

index.md

File metadata and controls

57 lines (47 loc) · 4.09 KB
title layout classes author_profile header excerpt feature_row
BLASTNet
splash
wide
false
overlay_image
/assets/img/blastnet_banner.png
Bearable Large Accessible Scientific Training Network-of-Datasets
title btn_class btn_label url excerpt
Publications
btn--primary
Cite us
/pub
<nav><ul><li><a href="https://zenodo.org/records/8034232">BLASTNet Zenodo</a></li><li><a href="https://arxiv.org/abs/2309.13457">2023 NeurIPS paper</a></li><li><a href="https://www.sciencedirect.com/science/article/pii/S2666352X22000309?via%3Dihub">2023 AECS paper</a></li><li><a href="https://openreview.net/forum?id=LxGTZM7L6qn">2022 ICML W. paper</a></li></ul></nav>
title url btn_class btn_label excerpt
Data
/datasets
btn--primary
Details
<nav><ul> <li><a href="/assets/bash/batch_download_v2.1.sh" title="Bash script; uses Kaggle API">Download all BLASTNet data</a></li> <li><a href="/assets/bash/mom_download.sh" title="Bash script; uses Kaggle API">Download Momentum128 3D SR Dataset</a></li> <li><a href="https://www.kaggle.com/datasets/waitongchung/blastnet-momentum-3d-sr-dataset">Kaggle repository for Momentum128 3D SR Dataset</a></li> </ul></nav>
title excerpt
Code and Tutorials
<nav><ul> <li><a href="https://www.kaggle.com/code/waitongchung/smallhit-browsedata" title="Kaggle notebook">Browse data and learn our formats</a></li> <li><a href="https://www.kaggle.com/code/waitongchung/smallhit-tfmultigpu" title="Kaggle notebook">Multi-GPU training with BLASTNet data</a></li> <li><a href="https://github.com/blastnet/kaggle_tutorials">GitHub repository for all Kaggle tutorials</a></li> <li><a href="https://colab.research.google.com/drive/1_dk1IPQsOK-Dxq9VYSci4lB_LOyVCb5c?usp=sharing" title="Colab notebook">Using BLASTNet outside Kaggle envs</a></li> <li><a href="https://blastnet.github.io/tutorial.html#kaggle-command-line-api">Kaggle CLI quick-start instructions</a></li> </ul></nav>

Mission

summary

BLASTNet aims to address gaps in open machine learning (ML) within the sciences, specifically fluid mechanics by providing researchers in reacting and non-reacting flow physics communities with (mostly) externally contributed open-source ML resources.

This data is useful for fluid flows in a wide range of ML applications tied to automotive, propulsion, energy, and the environment. Specifically, scientific engineering tasks related to these domains may include turbulent closure modeling, spatio-temporal modeling, and inverse modeling.

These contributions now include (i) 4.8 TB of high-fidelity simulation datasets that have been processed in a convenient format for ML applications, (ii) >13,000 lines of code that aid the training and evaluating of these models, (iii) >100 pre-trained weights in flow physics problems, and (iv) regular workshop events that disseminate ML for flow physics via seminars and competitions.

Distribution

Our ML resources are shared via GitHub and Kaggle. Specifically, code is shared via GitHub, while data and models are shared via Kaggle.

To circumvent Kaggle storage constraints, we partition our data into a network of <100 GB subsets, with each subset containing a separate simulation configuration. This partitioned data can then be uploaded as separate datasets on Kaggle. To download all cases via Kaggle API, download this bash script. Summaries of the data are available here!

Our network of datasets approach: approach

{% include feature_row %}