Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

New exported format: JSON-LD metadata to explain the CSVs, using W3C Tabular Data (Basic implementation only) #36

Open
fititnt opened this issue Apr 29, 2022 · 0 comments
Labels
archiva-farmatis archīva fōrmātīs; /formats of files/@eng-Latn; About (new) data formats to package dictionaries

Comments

@fititnt
Copy link
Member

fititnt commented Apr 29, 2022


We already export CSVs which can be explained by more than one convention. The more popular one is frictionlessdata mentioned on #35, but W3C, while started based on frictionlessdata, diverged to be compatible with RDF in a way similar to what happened to JSON-LD (which also allow to be mapped directly to anything supported by RDFs).

Basic implementation only

The W3C Tabular Data is obviously more complex to implement than frictionlessdata, but in case things get complicated, this will focus on basic implementation only. For example, ff we share in similar way the CLLD do (https://github.com/dictionaria/daakaka/blob/master/cldf/cldf-metadata.json) the "package" would require each group of dictionaries also export the subset of other tables it uses (in a way that someone could import the full think in a database).

Note that maybe already is possible to convert frictionlessdata to W3C Tabular Data, so we may implement at least one way to do it

About catalog of all datasets: use different approach (out of this issue)

This would be better move to a different issue, but while frictionlessdata have some way to organize more than one dataset (a catalog, https://specs.frictionlessdata.io/patterns/#describing-data-package-catalogs-using-the-data-package-format), which I'm not sure if this is a new feature) there is no direct equivalent with same strategy of this issue.

The close would more likely emulate what CKAN does. At https://github.com/ckan/ckanext-dcat there are some drafts about this. However, CKAN implies there is some way to at least paginate the global list (which is obviously not viable using static generation). In the best case, I think either pushing data to a CKAN (or making a fetch for some static version of all datasets) could be one way to synchronize dictionaries.

@fititnt fititnt added the archiva-farmatis archīva fōrmātīs; /formats of files/@eng-Latn; About (new) data formats to package dictionaries label Apr 29, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
archiva-farmatis archīva fōrmātīs; /formats of files/@eng-Latn; About (new) data formats to package dictionaries
Projects
None yet
Development

No branches or pull requests

1 participant