-
Notifications
You must be signed in to change notification settings - Fork 1.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
move information_schema to datafusion-catalog #14364
base: main
Are you sure you want to change the base?
move information_schema to datafusion-catalog #14364
Conversation
I am checking this one out |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @logan-keede -- this is great 🙏
I also took the liberty of pushing some commits to remove a few unecessary explciit dependencies as I thought that would be better than trying to explain how to do it
If you are feeling like some more refactoring projects, any chance you are interested in working to split out the data sources (aka make datafusion-datasource-parquet
, datafusion-datasource-csv
, etc)?
This doesn't have a ticket yet (as we were still working to extract the catalog and physical optimizers, but now that those are done, I think we are ready to try to break out the listing table / data sources)
datafusion/substrait/Cargo.toml
Outdated
@@ -46,6 +46,7 @@ url = { workspace = true } | |||
|
|||
[dev-dependencies] | |||
datafusion = { workspace = true, features = ["nested_expressions"] } | |||
datafusion-catalog = { workspace = true } |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think since datafusion re-exports datafusion_catalog
we should be able to avoid this dependency:
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I knew there was some chance of removing this dependency, but not sure on how to go about it. So, thanks.
/// assert_eq!(ctes[0].to_string(), "my_cte"); | ||
/// ``` | ||
pub fn resolve_table_references( | ||
statement: &datafusion_sql::parser::Statement, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think since this function belongs in datafusion-sql
as it is walking over the sql parse tree
perhaps we can put it in https://github.com/apache/datafusion/tree/main/datafusion/sql/src/resolve.rs
or something and then remove the dependency of datafusion-catalog
on datafusion-sql
I think we could do this in a follow on PR as well
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
let's do that then since I might not be able to commit over the next few days due to some personal commitments.
Well, I guess I know what I might be doing when I get back. I will definitely look into it, please let me know if there have been some discussion(or any relevant resource) on it in the past if that's not too much trouble. |
Which issue does this PR close?
Rationale for this change
What changes are included in this PR?
Are these changes tested?
Are there any user-facing changes?