Skip to content
This repository has been archived by the owner on Jul 29, 2023. It is now read-only.

Commit

Permalink
updated data org
Browse files Browse the repository at this point in the history
  • Loading branch information
mattersoflight committed May 24, 2023
1 parent db96a52 commit 35763cb
Showing 1 changed file with 4 additions and 8 deletions.
12 changes: 4 additions & 8 deletions micro_dl/data_organization.md
Original file line number Diff line number Diff line change
@@ -1,10 +1,6 @@
# Data Organization for Virtual Staining

> This advisory only applies to data management on Biohub's compute infrastructure.
> It is not normative for external users.
Here we document the conventions for storing data, metadata, configs,
and models during the development of the virtual staining pipeline.
Here we document our conventions for storing data, metadata, configs, and models.

## Data flow in the pipeline

Expand Down Expand Up @@ -90,16 +86,16 @@ virtual_staining:
config.yaml
yyyymmdd-hhmmss:
...
# test select models
# Inference and/or Evaluation of selected models.
test:
# config for prediction with test dataset.
test_<suffix>.yml # follow the lightning CLI/config.
test_<suffix>.yml # config used for inference, optionally copies ground truth and input for evaluation. This config will follow the lightning CLI/config format.

# inference output on test dataset, may include copies of input and ground truth to facilitate visualization of model performance.
test_<suffix>.zarr # Not all test datasets need to have human curated ground truth.
...

# configs for evaluation: checkpoint path, test data path that have ground turth included, and choice of metrics.
# config for evaluation: checkpoint path, test data path that have ground turth included, and choice of metrics.
evaluation_<suffix>.yaml
...

Expand Down

0 comments on commit 35763cb

Please sign in to comment.