You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The concept of a dataset is starting to become an overloaded term. It could mean the following:
A BigQuery dataset which is a collection of tables. This is the original definition we based the datasets folder from.
A collection of datasets can also be called a dataset. e.g. the Vizgen dataset, which includes the Mouse Brain Map dataset.
The other way applies just as well: a subset of larger dataset/s can also be called a dataset. e.g. the Mouse Brain Map dataset which is part of the Vizgen dataset
Plus, in the future, we can expect pipelines that need to onboard multiple datasets in one go. Such a concept is difficult to align using the current hierarchy.
Proposed
The proposal here is to switch from using the datasets/DATASET/PIPELINE hierarchy into the pipelines/PIPELINE_GROUP/PIPELINE hierarchy.
Description
The concept of a dataset is starting to become an overloaded term. It could mean the following:
datasets
folder from.Plus, in the future, we can expect pipelines that need to onboard multiple datasets in one go. Such a concept is difficult to align using the current hierarchy.
Proposed
The proposal here is to switch from using the
datasets/DATASET/PIPELINE
hierarchy into thepipelines/PIPELINE_GROUP/PIPELINE
hierarchy.Checklist
The text was updated successfully, but these errors were encountered: