Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Purpose of the schema #4

Open
mslw opened this issue Nov 27, 2023 · 0 comments
Open

Purpose of the schema #4

mslw opened this issue Nov 27, 2023 · 0 comments

Comments

@mslw
Copy link
Contributor

mslw commented Nov 27, 2023

On a high level, I think the main purpose of creating this schema is:

  • formalize the description of terms used to describe crc datasets
  • generate the documentation
  • centralize the key-to-IRI mapping: documentation, discussion, formal specification (cuurently sfb docs provide user-facing description and tabby-utils repo contains slightly obscure IRI mapping, e.g. the tabby convention and some issues)

But on a practical level, what exactly should the schema describe, and for what it would be used in the catalog-generation workflow?

  • datalad-tabby
    • goal: describe the output of load_tabby, i.e. what comes out of a tabby-spreadsheet after adding a context with datalad-tabby?
    • usage: validate spreadsheets
  • tabby-utils
    • goal describe the (desired) output of further processing that is intended to go into the sfb catalog?
    • usage: validate catalog input

I think the two are not necessarily the same (there is some structure editing and enrichment from external lookups going on before passing the data to the catalog for a nice display).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant