-
Notifications
You must be signed in to change notification settings - Fork 440
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[VL] offload table scan when filter need fallback #7261
Comments
Should be. Let me do a quick test once @zml1206 resolve conflicts |
Conflicts resolved. But I think it can not be resoved by #7215, it need rewrite datafilters of FileSourceScanExec. After this pr, I can add a Rewrite rule to rewrite datafilters to resolve it. @FelixYBW @zhztheplayer |
Can we determine whether expression supports offload? @zhztheplayer There are three layers of filters here. The first layer is pushedDownFilters of FileSourceScanExec, which is also used by vanilla spark scan. The second layer is the dataFilters of FileSourceScanExec, which is used to generate the pushedDownFilters of FileSourceScanExec, as well as the filters used after the current offload. The third layer is the condition of the filter. In theory, the filter condition contains dataFilters, and dataFilters contains pushedDownFilters. |
Description
currently when a filter has UDF and fallback, the table scan is also fallback. We should offload table scan since it can benefit from native and much more expensive than filter and new C2R.
In this query, parquet scan output int and string only.
The text was updated successfully, but these errors were encountered: