Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide insights on the columns with stats collected #6

Open
MrPowers opened this issue Mar 8, 2023 · 3 comments
Open

Provide insights on the columns with stats collected #6

MrPowers opened this issue Mar 8, 2023 · 3 comments
Assignees

Comments

@MrPowers
Copy link
Collaborator

MrPowers commented Mar 8, 2023

Some Delta implementations only collect stats on certain columns.

Here are some APIs that will help users debug their questions:

  • what columns are stats being collected for
  • for a given column, are stats being collected in all the files?
  • what columns have stats being collected for some files, but not other files

The get_add_actions API should make this possible.

@robertkossendey
Copy link
Collaborator

Great idea, do we want to return the result as an object, so the user can work with the output programatically, or just print it?

@MrPowers
Copy link
Collaborator Author

MrPowers commented Mar 8, 2023

@robertkossendey - I generally prefer outputting results programmatically, so users can use the functions in programs. There are some exceptions, but I think computer-readable output (that's also human friendly) is usually the best goal.

@robertkossendey
Copy link
Collaborator

@MrPowers you can assign me here, I'd like to work on this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants