Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE]: Analyze root cause of quality issues using LLMs #42

Open
mwojtyczka opened this issue Jul 23, 2024 · 0 comments
Open

[FEATURE]: Analyze root cause of quality issues using LLMs #42

mwojtyczka opened this issue Jul 23, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@mwojtyczka
Copy link
Contributor

mwojtyczka commented Jul 23, 2024

Problem statement

DQX can quarantine the data but cannot curate it.

Proposed Solution

Certain curation tasks would be possible with LLMs like adjusting timestamp values to the correct format, autofilling missing values based on the data statistics etc.

@mwojtyczka mwojtyczka added the enhancement New feature or request label Jul 23, 2024
@mwojtyczka mwojtyczka changed the title [FEATURE]: Research on analyzing the root cause of quality issues using Gene/LLMs [FEATURE]: Automatic data curation of quarantined data using LLMs Jan 27, 2025
@mwojtyczka mwojtyczka changed the title [FEATURE]: Automatic data curation of quarantined data using LLMs [FEATURE]: Automatic data curation of quarantined data using LLMs 2 Jan 27, 2025
@mwojtyczka mwojtyczka changed the title [FEATURE]: Automatic data curation of quarantined data using LLMs 2 [FEATURE]: Analyze root cause of quality issues using LLMs Jan 27, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant