Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FeatureRequest: Automatically generated dataframe schemas to catch errors #384

Open
1 of 2 tasks
kenibrewer opened this issue Mar 27, 2024 · 0 comments · May be fixed by #383
Open
1 of 2 tasks

FeatureRequest: Automatically generated dataframe schemas to catch errors #384

kenibrewer opened this issue Mar 27, 2024 · 0 comments · May be fixed by #383
Labels
enhancement New feature or request

Comments

@kenibrewer
Copy link
Member

Feature type

  • Add new functionality

  • Change existing functionality

General description of the proposed functionality

Story: As a pycytominer user, I would like to receive more descriptive error messages about problems with my data. Pycytominer could automatically generate a DataframeSchema to check for the column names I specified in arguments and make sure there aren't NaN or inf values for operations where that will cause errors. By returning an error message with the specific column and row that contain problematic values, I will be more easily able to work with large distributed datasets.

Feature example

Coming

Alternative Solutions

No response

Additional information

No response

@kenibrewer kenibrewer added the enhancement New feature or request label Mar 27, 2024
@kenibrewer kenibrewer linked a pull request Mar 27, 2024 that will close this issue
13 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant