Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update Logical Types Branch #14241

Merged
merged 770 commits into from
Jan 23, 2025
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
770 commits
Select commit Hold shift + click to select a range
93b3d9c
Handle alias when parsing sql(parse_sql_expr) (#12939)
Eason0729 Dec 11, 2024
a52eb47
Improve documentation for TableProvider (#13724)
alamb Dec 11, 2024
95405a9
Reveal implementing type and return type in simple UDF implementation…
findepi Dec 11, 2024
28e4c64
minor: Extract tests for `EXTRACT` AND `date_part` to their own file …
alamb Dec 11, 2024
08119e6
Support unparsing `UNNEST` plan to `UNNEST` table factor SQL (#13660)
goldmedal Dec 11, 2024
de9e16b
Update to apache-avro 0.17, fix compatibility changes schema handling…
mdroogh Dec 12, 2024
44f4be2
Minor: Add doc example to RecordBatchStreamAdapter (#13725)
alamb Dec 12, 2024
36a1361
Implement GroupsAccumulator for corr(x,y) aggregate function (#13581)
2010YOUY01 Dec 12, 2024
1c73a23
fix union serialisation order in proto (#13709)
onursatici Dec 12, 2024
98c483e
Minor: make unsupported `nanosecond` part a real (not internal) error…
alamb Dec 12, 2024
aeddbd9
Add tests for date_part on columns + timestamps with / without timezo…
alamb Dec 12, 2024
320e4d6
Optimize performance of `initcap` function (~2x faster) (#13691)
tlm365 Dec 12, 2024
b69fa67
Minor: Add documentation explaining that initcap oly works for ASCII …
alamb Dec 13, 2024
4148729
Support sqllogictest --complete with postgres (#13746)
findepi Dec 13, 2024
6ac1999
doc-gen: migrate window functions documentation to attribute based (#…
zjregee Dec 13, 2024
c030bfe
Minor: Remove memory reservation in `JoinLeftData` used in HashJoin (…
jayzhan-synnada Dec 13, 2024
5af4333
Update to bigdecimal 0.4.7 (#13747)
findepi Dec 13, 2024
e8314ab
chore: clean up dependencies (#13728)
comphead Dec 13, 2024
03e39da
fix: Implicitly plan `UNNEST` as lateral (#13695)
rkrishn7 Dec 13, 2024
8b6daaf
Minor: improve the Deprecation / API health guidelines (#13701)
alamb Dec 13, 2024
68ead28
fix: specify roottype in substrait fieldreference (#13647)
mbwhite Dec 13, 2024
08d3b65
Simplify type signatures using `TypeSignatureClass` for mixed type fu…
jayzhan211 Dec 14, 2024
bfabd48
Minor: Add some more blog posts to the readings page (#13761)
alamb Dec 14, 2024
652b5a4
docs: update GroupsAccumulator instead of GroupAccumulator (#13787)
rluvaton Dec 15, 2024
e70319f
Improve Deprecation Guidelines more (#13776)
alamb Dec 15, 2024
bd2c975
fix: add `null_buffer` length check to `StringArrayBuilder`/`LargeStr…
jdockerty Dec 15, 2024
2176330
Revert the removal of reservation in HashJoin (#13792)
jayzhan-synnada Dec 16, 2024
5c29399
added count aggregate slt (#13790)
buraksenn Dec 16, 2024
668984e
Update documentation guidelines for contribution content (#13703)
alamb Dec 16, 2024
59410ea
Add Round trip tests for Array <--> ScalarValue (#13777)
alamb Dec 16, 2024
fe53eaf
fix: Limit together with pushdown_filters (#13788)
zhuqi-lucas Dec 16, 2024
174e08b
Minor: improve Analyzer docs (#13798)
alamb Dec 16, 2024
e665115
Minor: cargo update in datafusion-cli (#13801)
alamb Dec 17, 2024
57d1309
Fix `ScalarValue::to_array_of_size` for DenseUnion (#13797)
kylebarron Dec 17, 2024
452a8f4
fix: enable pruning by bloom filters for dictionary columns (#13768)
korowa Dec 17, 2024
5500b11
Handle empty rows for `array_distinct` (#13810)
cht42 Dec 17, 2024
7e0fc14
Fix get_type for higher-order array functions (#13756)
findepi Dec 18, 2024
63ce486
Chore: Do not return empty record batches from streams (#13794)
mertak-synnada Dec 18, 2024
82a40f3
Handle possible overflows in StringArrayBuilder / LargeStringArrayBui…
wiedld Dec 18, 2024
1fc7769
fix: Ignore empty files in ListingTable when listing files with or wi…
Blizzara Dec 18, 2024
01ffb64
Support Null regex override in csv parser options. (#13228)
dhegberg Dec 18, 2024
c2b6035
Minor: Extend ScalarValue::new_zero() (#13828)
berkaysynnada Dec 18, 2024
32a13d8
chore: temporarily disable windows flow (#13833)
comphead Dec 19, 2024
b7fb32a
feat: `parse_float_as_decimal` supports scientific notation and Decim…
jonahgao Dec 19, 2024
46101f3
Replace `BooleanArray::extend` with `append_n` (#13832)
comphead Dec 19, 2024
2439979
Rename `TypeSignature::NullAry` --> `TypeSignature::Nullary` and impr…
alamb Dec 19, 2024
3467011
[bugfix] ScalarFunctionExpr does not preserve the nullable flag on ro…
ccciudatu Dec 19, 2024
71f157f
Add example of interacting with a remote catalog (#13722)
alamb Dec 19, 2024
9f530dd
Update substrait requirement from 0.49 to 0.50 (#13808)
dependabot[bot] Dec 19, 2024
697894f
typo: remove extraneous "`" in doc comment, fix header (#13848)
akesling Dec 19, 2024
5d563d9
typo: remove extra "`" interfering with doc formatting (#13847)
akesling Dec 19, 2024
95d296c
Support n-ary monotonic functions in ordering equivalence (#13841)
gokselk Dec 20, 2024
f3b1141
Replace `execution_mode` with `emission_type` and `boundedness` (#13823)
jayzhan-synnada Dec 20, 2024
b0d7cd0
Preserve ordering equivalencies on `with_reorder` (#13770)
gokselk Dec 20, 2024
079f621
replace CASE expressions in predicate pruning with boolean algebra (#…
adriangb Dec 20, 2024
d7aeb1a
enable DF's nested_expressions feature by in datafusion-substrait tes…
Blizzara Dec 20, 2024
31acf45
Add configurable normalization for configuration options and preserve…
blaginin Dec 20, 2024
87b77bb
Improve`Signature` and `comparison_coercion` documentation (#13840)
alamb Dec 20, 2024
74480ac
feat: support normalized expr in CSE (#13315)
zhuliquan Dec 20, 2024
75202b5
Upgrade to sqlparser `0.53.0` (#13767)
alamb Dec 20, 2024
9b19d36
Minor: Use `resize` instead of `extend` for adding static values in S…
comphead Dec 20, 2024
667c77a
feat(function): add `least` function (#13786)
rluvaton Dec 20, 2024
4118c43
Improve SortPreservingMerge::enable_round_robin_repartition docs (#1…
alamb Dec 20, 2024
7089c64
Minor: Unify `downcast_arg` method (#13865)
comphead Dec 21, 2024
ade14e7
Implement `SHOW FUNCTIONS` (#13799)
goldmedal Dec 21, 2024
60024c5
Update bzip2 requirement from 0.4.3 to 0.5.0 (#13740)
dependabot[bot] Dec 21, 2024
ede665b
Fix build (#13869)
alamb Dec 21, 2024
b99400e
feat(substrait): modular substrait consumer (#13803)
vbarua Dec 21, 2024
a50ed34
Minor: fix: Include FetchRel when producing LogicalPlan from Sort (#1…
robtandy Dec 21, 2024
8c48a8c
Minor: improve error message when ARRAY literals can not be planned (…
alamb Dec 21, 2024
2639fe0
Add documentation for `SHOW FUNCTIONS` (#13868)
alamb Dec 21, 2024
a267784
Support unicode character for `initcap` function (#13752)
tlm365 Dec 22, 2024
e6f5cb6
[minor] make recursive package dependency optional (#13778)
buraksenn Dec 22, 2024
242f45f
Minor: remove unused async-compression `futures-io` feature (#13875)
jonahgao Dec 22, 2024
a7ff7a5
Consolidate Example: dataframe_output.rs into dataframe.rs (#13877)
zhuqi-lucas Dec 22, 2024
8fd792f
Restore `DocBuilder::new()` to avoid breaking API change (#13870)
alamb Dec 22, 2024
405b99c
Improve error messages for incorrect zero argument signatures (#13881)
alamb Dec 23, 2024
63ba5b6
Consolidate Example: simplify_udwf_expression.rs into advanced_udwf.r…
xarus01 Dec 23, 2024
e718c1a
minor: fix typos in comments / structure names (#13879)
zhuliquan Dec 23, 2024
b4b267a
Support 1 or 3 arg in generate_series() UDTF (#13856)
UBarney Dec 24, 2024
6cfd1cf
Support (order by / sort) for DataFrameWriteOptions (#13874)
zhuqi-lucas Dec 24, 2024
94f08ff
Update sort_merge_join.rs (#13894)
berkaysynnada Dec 24, 2024
901a094
Update join_selection.rs (#13893)
berkaysynnada Dec 24, 2024
e99e02b
Fix `recursive-protection` feature flag (#13887)
alamb Dec 24, 2024
3864b11
Fix visibility of swap_hash_join (#13899)
alamb Dec 24, 2024
7b4e559
Minor: Avoid emitting empty batches in partial sort (#13895)
berkaysynnada Dec 25, 2024
073a3b1
Prepare for 44.0.0 release: version and changelog (#13882)
alamb Dec 25, 2024
f52c56b
Support unparsing implicit lateral `UNNEST` plan to SQL text (#13824)
goldmedal Dec 25, 2024
a44c6e0
fix case_column_or_null with nullable when conditions (#13886)
richox Dec 25, 2024
9fbcf23
Fixed Issue #13896 (#13903)
Spaarsh Dec 25, 2024
482b489
Introduce `UserDefinedLogicalNodeUnparser` for User-defined Logical P…
goldmedal Dec 25, 2024
b9cef8c
Preserve constant values across union operations (#13805)
gokselk Dec 25, 2024
30660e0
chore(deps): update sqllogictest requirement from 0.23.0 to 0.24.0 (#…
xudong963 Dec 26, 2024
2d985b4
fix RecordBatch size in topK (#13906)
getChan Dec 26, 2024
5045bde
ci improvements, update protoc (#13876)
Omega359 Dec 26, 2024
cf8f2f8
Introduce LogicalPlan invariants, begin automatically checking them (…
wiedld Dec 26, 2024
9665e09
Correct return type for initcap scalar function with utf8view (#13909)
timsaucer Dec 27, 2024
933fec8
Consolidate example: simplify_udaf_expression.rs into advanced_udaf.r…
takaebato Dec 27, 2024
d25099e
Implement maintains_input_order for AggregateExec (#13897)
alihan-synnada Dec 27, 2024
a08dc0a
Move join type input swapping to pub methods on Joins (#13910)
alamb Dec 27, 2024
618767e
doc-gen: migrate scalar functions (string) documentation 3/4 (#13926)
Chen-Yuan-Lai Dec 28, 2024
0989649
Update sqllogictest requirement from 0.24.0 to 0.25.0 (#13917)
dependabot[bot] Dec 28, 2024
f3e0fa2
Consolidate Examples: memtable.rs and parquet_multiple_files.rs (#13913)
alamb Dec 28, 2024
fb5378d
doc-gen: migrate scalar functions (crypto) documentation (#13918)
Chen-Yuan-Lai Dec 29, 2024
a203c2b
doc-gen: migrate scalar functions (datetime) documentation 1/2 (#13920)
Chen-Yuan-Lai Dec 29, 2024
a47729c
fix RecordBatch size in hash join (#13916)
getChan Dec 29, 2024
6a92870
doc-gen: migrate scalar functions (array) documentation 1/3 (#13928)
Chen-Yuan-Lai Dec 29, 2024
4290f36
doc-gen: migrate scalar functions (math) documentation 1/2 (#13922)
Chen-Yuan-Lai Dec 29, 2024
fce3fb3
doc-gen: migrate scalar functions (math) documentation 2/2 (#13923)
Chen-Yuan-Lai Dec 29, 2024
2b4e200
doc-gen: migrate scalar functions (array) documentation 3/3 (#13930)
Chen-Yuan-Lai Dec 29, 2024
383f279
doc-gen: migrate scalar functions (array) documentation 2/3 (#13929)
Chen-Yuan-Lai Dec 29, 2024
ab69bb0
doc-gen: migrate scalar functions (string) documentation 4/4 (#13927)
Chen-Yuan-Lai Dec 29, 2024
4d07579
Support explain query when running dfbench with clickbench (#13942)
zhuqi-lucas Dec 30, 2024
0cd0bbe
Consolidate example to_date.rs into dateframe.rs (#13939)
alamb Dec 30, 2024
3ea1b6a
Revert "Update sqllogictest requirement from 0.24.0 to 0.25.0 (#1391…
alamb Dec 30, 2024
fb1d4bc
Implement predicate pruning for `like` expressions (prefix matching) …
adriangb Dec 30, 2024
9b5995f
doc-gen: migrate scalar functions (string) documentation 1/4 (#13924)
Chen-Yuan-Lai Dec 30, 2024
2b15ad1
consolidate dataframe_subquery.rs into dataframe.rs (#13950)
zjregee Dec 31, 2024
9d39358
migrate btrim to user_doc macro (#13952)
delamarch3 Dec 31, 2024
9eca7d1
doc-gen: migrate scalar functions (datetime) documentation 2/2 (#13921)
Chen-Yuan-Lai Dec 31, 2024
aafec07
Add sqlite test files, progress bar, and automatic postgres container…
Omega359 Jan 1, 2025
b54e648
Supporting writing schema metadata when writing Parquet in parallel (…
wiedld Jan 1, 2025
ab9ff56
chore: Create devcontainer.json (#13520)
rluvaton Jan 1, 2025
259443d
Minor: consolidate ConfigExtension example into API docs (#13954)
alamb Jan 2, 2025
846adf3
Parallelize pruning utf8 fuzz test (#13947)
alamb Jan 2, 2025
38ccb00
Add swap_inputs to SMJ (#13984)
ozankabak Jan 2, 2025
04f56bd
fix(datafusion-functions-nested): `arrow-distinct` now work with null…
rluvaton Jan 2, 2025
032d722
Update release instructions for 44.0.0 (#13959)
alamb Jan 2, 2025
f1af933
Add datafusion python 43.1.0 blog post to doc. (#13974)
Omega359 Jan 2, 2025
8fc26c2
Include license and notice files in more crates (#13985)
ankane Jan 2, 2025
858c020
Extract postgres container from sqllogictest, update datafusion-testi…
Omega359 Jan 2, 2025
63c4d1b
Update rstest requirement from 0.23.0 to 0.24.0 (#13977)
dependabot[bot] Jan 2, 2025
264f4c5
Move hash collision test to run only when merging to main. (#13973)
Omega359 Jan 2, 2025
63265fd
Update itertools requirement from 0.13 to 0.14 (#13965)
dependabot[bot] Jan 3, 2025
fe14e77
Change trigger, rename `hash_collision.yml` to `extended.yml` and add…
alamb Jan 3, 2025
e59045d
doc-gen: migrate scalar functions (string) documentation 2/4 (#13925)
Chen-Yuan-Lai Jan 3, 2025
66e4d00
Update substrait requirement from 0.50 to 0.51 (#13978)
dependabot[bot] Jan 3, 2025
33437f7
Update release README for datafusion-cli publishing (#13982)
alamb Jan 3, 2025
ab1de2c
Enhance LastValueAccumulator logic and add SQL logic tests for last_v…
jayzhan211 Jan 3, 2025
98cb948
Improve deserialize_to_struct example (#13958)
alamb Jan 4, 2025
39a69f5
Update docs (#14002)
matthewmturner Jan 4, 2025
0f4b8b1
Optimize CASE expression for "expr or expr" usage. (#13953)
aweltsch Jan 4, 2025
b8b0c55
feat(substrait): introduce consume_rel and consume_expression (#13963)
vbarua Jan 4, 2025
78ee13a
Consolidate csv_opener.rs and json_opener.rs into a single example (#…
cj-zhukov Jan 5, 2025
ecd9f36
FIX : Incorrect NULL handling in BETWEEN expression (#14007)
getChan Jan 5, 2025
4c898b4
feat(substrait): modular substrait producer (#13931)
vbarua Jan 5, 2025
42dce6e
fix: Avoid re-wrapping planning errors Err(DataFusionError::Plan) fo…
avkirilishin Jan 5, 2025
69687da
feat: support `RightAnti` for `SortMergeJoin` (#13680)
irenjj Jan 5, 2025
3f4297f
feat: preserve session id when using cxt.enable_url_table() (#14004)
milenkovicm Jan 6, 2025
487b952
Return error message during planning when inserting into a MemTable w…
tobixdev Jan 6, 2025
de6d511
Minor: Rewrite LogicalPlan::max_rows for Join and Union, made it easi…
maruschin Jan 6, 2025
2848aaf
Chore: update wasm-supported crates, add tests (#14005)
Lordworms Jan 6, 2025
c2e4747
Use workspace rust-version for all workspace crates (#14009)
Jefffrey Jan 6, 2025
cad39e0
[Minor] refactor: make ArraySort public for broader access (#14006)
dharanad Jan 6, 2025
7f1bad9
Update sqllogictest requirement from =0.24.0 to =0.26.0 (#14017)
dependabot[bot] Jan 6, 2025
775c9a5
`url` dependancy update (#14019)
vadimpiven Jan 6, 2025
b2f8e94
Minor: Improve zero partition check when inserting into `MemTable` (#…
jonahgao Jan 6, 2025
4e877a0
refactor: make structs public and implement Default trait (#14030)
dharanad Jan 6, 2025
f31ca5b
Minor: Remove redundant implementation of `StringArrayType` (#14023)
tlm365 Jan 7, 2025
b5fe4a9
Added references to IDE documentation for dev containers along with a…
Omega359 Jan 7, 2025
81b50c4
Use partial aggregation schema for spilling to avoid column mismatch …
kosiew Jan 8, 2025
3887087
Encapsulate fields of `EquivalenceProperties` (#14040)
alamb Jan 8, 2025
b0bd899
Encapsulate fields of `EquivalenceGroup` (#14039)
alamb Jan 8, 2025
05b6f93
Fix error on `array_distinct` when input is empty #13810 (#14034)
cht42 Jan 8, 2025
7af6aa9
Update petgraph requirement from 0.6.2 to 0.7.1 (#14045)
dependabot[bot] Jan 8, 2025
c7e8858
Encapsulate fields of `OrderingEquivalenceClass` (make field non pub)…
alamb Jan 8, 2025
da4208b
Fix: ensure that compression type is also taken into consideration du…
timvw Jan 8, 2025
ad5a04f
feat(optimizer): Enable filter pushdown on window functions (#14026)
nuno-faria Jan 8, 2025
5955860
Unparsing optimized (> 2 inputs) unions (#14031)
MohamedAbdeen21 Jan 9, 2025
61afb0d
Minor: Document output schema of LogicalPlan::Aggregate and LogicalPl…
alamb Jan 9, 2025
ac9584e
Simplify error handling in case.rs (#13990) (#14033)
cj-zhukov Jan 9, 2025
f379719
feat: add `AsyncCatalogProvider` helpers for asynchronous catalogs (#…
westonpace Jan 9, 2025
80c828b
Custom scalar to sql overrides support for DuckDB Unparser dialect (#…
sgrebnov Jan 9, 2025
f9d3133
Improve perfomance of `reverse` function (#14025)
tlm365 Jan 9, 2025
40f1ddd
docs(ci): use up-to-date protoc with docs.rs (#14048)
wackywendell Jan 9, 2025
e113ec1
fix (#14042)
cht42 Jan 9, 2025
bf28e9b
Re-export TypeSignatureClass from the datafusion-expr package (#14051)
niebayes Jan 9, 2025
300c2af
Fix clippy for Rust 1.84 (#14065)
jonahgao Jan 10, 2025
268df42
fix: incorrect error message of function_length_check (#14056)
niebayes Jan 10, 2025
295ffb4
test: Add plan execution during tests for bounded source (#14013)
avkirilishin Jan 10, 2025
918ac1b
Bump `ctor` to `0.2.9` (#14069)
mbrobbel Jan 10, 2025
ec173ab
Refactor into `LexOrdering::collapse`, `LexRequirement::collapse` avo…
alamb Jan 10, 2025
90c5b16
Bump `wasm-bindgen` and `wasm-bindgen-futures` (#14068)
mbrobbel Jan 10, 2025
334d6ec
update (#14070)
berkaysynnada Jan 10, 2025
4b7281e
fix: make get_valid_types handle TypeSignature::Numeric correctly (#1…
niebayes Jan 11, 2025
b813513
Minor: Make `group_schema` as `PhysicalGroupBy` method (#14064)
jayzhan211 Jan 11, 2025
f1671bb
Minor: Move `LimitPushdown` tests to be in the same file as the code …
alamb Jan 11, 2025
d91a7c0
Add comments to physical optimizer tests (#14075)
alamb Jan 11, 2025
17446ad
added "DEFAULT_CLI_FORMAT_OPTIONS" for cli and sqllogic test (#14052)
jatin510 Jan 11, 2025
226afc6
Add H2O.ai Database-like Ops benchmark to dfbench (groupby support) (…
zhuqi-lucas Jan 12, 2025
167c11e
Add telemetry.sh to list of use cases (#14090)
TheBuilderJR Jan 12, 2025
0a2c027
chore: deprecate `ValuesExec` in favour of `MemoryExec` (#14032)
jonathanc-n Jan 12, 2025
2e5c6b2
Improve performance of `find_in_set` function (#14020)
tlm365 Jan 12, 2025
54a5d3f
minor: Add link to example in catalog (#14062)
alamb Jan 12, 2025
41863dd
fix set (#14081)
jayzhan211 Jan 12, 2025
98e821c
Simplify the return type of `sql_select_to_rex()` (#14088)
jonahgao Jan 13, 2025
f9cc332
Minor: Add a link to RecordBatchStreamAdapter to `SendableRecordBatch…
alamb Jan 13, 2025
e8cac16
Update substrait requirement from 0.51 to 0.52 (#14107)
mbrobbel Jan 13, 2025
63b94c8
fix null date args in range (#14093)
cht42 Jan 13, 2025
722307f
feat: add support for `LogicalPlan::DML(...)` serde (#14079)
milenkovicm Jan 13, 2025
fda500a
Avoid Aliased Window Expr Enter Unreachable Code (#14109)
berkaysynnada Jan 14, 2025
888df6a
clarify logic in nth_value window function (#14104)
zjregee Jan 14, 2025
3f91a20
Move JoinSelection into datafusion-physical-optimizer crate (#14073) …
cj-zhukov Jan 14, 2025
53dab46
distinguish none and empty projection (#14116)
XiangpengHao Jan 14, 2025
0a2f09f
Add a hint about normalization in error message (#14089) (#14113)
cj-zhukov Jan 14, 2025
02c8247
fix: incorrect NATURAL/USING JOIN schema (#14102)
jonahgao Jan 14, 2025
c7c200a
Chore: refactor DataSink traits to avoid duplication (#14121)
mertak-synnada Jan 15, 2025
495b82b
Add sqlite sqllogictest run to extended.yml for running sqlite test s…
Omega359 Jan 15, 2025
74e6922
Fix duplicated SharedBitmapBuilder definitions (#14122)
lewiszlw Jan 15, 2025
89d6f0e
feat: add `alias()` method for DataFrame (#14127)
jonahgao Jan 15, 2025
80d3131
doc-gen: make user_doc to work with predefined consts (#14086)
ding-young Jan 15, 2025
0c229d7
Minor: Document the rationale for the lack of Cargo.lock (#14071)
alamb Jan 15, 2025
d42b994
Return err if wildcard is not expanded before type coercion (#14130)
xudong963 Jan 15, 2025
04b6d4d
Fix combine with session config (#14139)
XiangpengHao Jan 16, 2025
e58c27b
Minor: move resolve_overlap a method on OrderingEquivalenceClas (#14138)
alamb Jan 16, 2025
2159ba3
doc-gen: migrate scalar functions (encoding & regex) documentation (#…
Chen-Yuan-Lai Jan 16, 2025
7f70b6c
bugfix: create view with multi union may get wrong schema (#14132) (#…
Curricane Jan 16, 2025
dc22b3f
Deduplicate function `get_final_indices_from_shared_bitmap` (#14145)
lewiszlw Jan 16, 2025
68fd51f
Add tests for PR #14133 (view with multi unions) (#14152)
Curricane Jan 16, 2025
3906c04
Update datafusion-testing git hash (#14137)
Omega359 Jan 16, 2025
50c7977
Reuse `on` expressions values in HashJoinExec (#14131)
lewiszlw Jan 16, 2025
5fa8b3b
fix: encode should work with non-UTF-8 binaries (#14087)
mesejo Jan 16, 2025
e8ad5e5
chore: move `SanityChecker` into `physical-optimizer` crate (#14083)
mnpw Jan 16, 2025
3cd31af
Propagate table constraints through physical plans to optimize sort o…
gokselk Jan 16, 2025
05f4e5a
NestedLoopJoin Projection Pushdown (#14120)
jayzhan-synnada Jan 16, 2025
9403448
Fix: regularize order bys when consuming from substrait (#14125)
gabotechs Jan 16, 2025
5d9be90
Remove dependency on physical-optimizer on functions-aggregates (#14134)
alamb Jan 17, 2025
40d5158
doc-gen: migrate scalar functions (other, conditional, and struct) do…
Chen-Yuan-Lai Jan 17, 2025
3082ac6
chore: fix flaky tests (#14170)
xudong963 Jan 17, 2025
a407500
Upgrade arrow-rs, parquet to `54.0.0` and pyo3 to `0.23.3` (#14153)
Owen-CH-Leung Jan 17, 2025
0522272
Simplify Bloom Filter Check (#14165)
alamb Jan 17, 2025
0e22172
Make `LexOrdering::inner` non pub, add comments, update usages (#14155)
alamb Jan 17, 2025
57eb17f
Fix doctests in ScalarValue (#14164) (#14178)
cj-zhukov Jan 18, 2025
5d18648
Add `ScalarValue::try_as_str` to get str value from logical strings (…
alamb Jan 18, 2025
868fc35
fix: handle scalar predicates in CASE expressions to prevent internal…
avkirilishin Jan 18, 2025
4f58700
Minor: Consolidate dataframe tests into core_integration (#14169)
alamb Jan 18, 2025
e9a77e0
Add a hint about expected extension in error message in register_csv,…
cj-zhukov Jan 18, 2025
0283077
Test: Validate memory limit for sort queries to extended test (#14142)
2010YOUY01 Jan 19, 2025
4e88a1b
refactor: switch BooleanBufferBuilder to NullBufferBuilder in sort fu…
Chen-Yuan-Lai Jan 19, 2025
2ad8955
Add section to howtos.md (#14171)
xarus01 Jan 19, 2025
5340521
refactor: switch BooleanBufferBuilder to NullBufferBuilder in correla…
Chen-Yuan-Lai Jan 19, 2025
1b63e7b
Rename extended test job name (#14199)
alamb Jan 19, 2025
80edb3a
Added job board as a separate header in the documentation (#14191)
edmondop Jan 19, 2025
52fb1a0
Support spaceship operator (`<=>`) support (alias for `IS NOT DISTINC…
Spaarsh Jan 19, 2025
3a65be0
Add benchmark for planning sorted unions (#14157)
alamb Jan 19, 2025
12c4c86
feat: Use `SchemaRef` in `JoinFilter` (#14182)
irenjj Jan 20, 2025
98e8942
refactor: switch BooleanBufferBuilder to NullBufferBuilder in functio…
Chen-Yuan-Lai Jan 20, 2025
0a85bb4
Improve case expr constant handling, Add .slt test (#14159)
alamb Jan 20, 2025
f667a01
Minor add ticket references to deprecated code (#14174)
alamb Jan 20, 2025
a702e64
[logical-types] update to newest main
tobixdev Jan 20, 2025
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
13 changes: 13 additions & 0 deletions .devcontainer/Dockerfile
Original file line number Diff line number Diff line change
@@ -0,0 +1,13 @@
FROM rust:bookworm

RUN apt-get update && export DEBIAN_FRONTEND=noninteractive \
# Remove imagemagick due to https://security-tracker.debian.org/tracker/CVE-2019-10131
&& apt-get purge -y imagemagick imagemagick-6-common

# Add protoc
# https://datafusion.apache.org/contributor-guide/getting_started.html#protoc-installation
RUN curl -LO https://github.com/protocolbuffers/protobuf/releases/download/v25.1/protoc-25.1-linux-x86_64.zip \
&& unzip protoc-25.1-linux-x86_64.zip -d $HOME/.local \
&& rm protoc-25.1-linux-x86_64.zip

ENV PATH="$PATH:$HOME/.local/bin"
16 changes: 16 additions & 0 deletions .devcontainer/devcontainer.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,16 @@
{
"build": {
"dockerfile": "./Dockerfile",
"context": "."
},
"customizations": {
"vscode": {
"extensions": [
"rust-lang.rust-analyzer"
]
}
},
"features": {
"ghcr.io/devcontainers/features/rust:1": "latest"
}
}
12 changes: 7 additions & 5 deletions .github/actions/setup-builder/action.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -28,16 +28,18 @@ runs:
- name: Install Build Dependencies
shell: bash
run: |
apt-get update
apt-get install -y protobuf-compiler
RETRY=("ci/scripts/retry" timeout 120)
"${RETRY[@]}" apt-get update
"${RETRY[@]}" apt-get install -y protobuf-compiler
- name: Setup Rust toolchain
shell: bash
# rustfmt is needed for the substrait build script
run: |
RETRY=("ci/scripts/retry" timeout 120)
echo "Installing ${{ inputs.rust-version }}"
rustup toolchain install ${{ inputs.rust-version }}
rustup default ${{ inputs.rust-version }}
rustup component add rustfmt
"${RETRY[@]}" rustup toolchain install ${{ inputs.rust-version }}
"${RETRY[@]}" rustup default ${{ inputs.rust-version }}
"${RETRY[@]}" rustup component add rustfmt
- name: Configure rust runtime env
uses: ./.github/actions/setup-rust-runtime
- name: Fixup git permissions
Expand Down
6 changes: 4 additions & 2 deletions .github/actions/setup-macos-aarch64-builder/action.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -30,8 +30,8 @@ runs:
run: |
mkdir -p $HOME/d/protoc
cd $HOME/d/protoc
export PROTO_ZIP="protoc-21.4-osx-aarch_64.zip"
curl -LO https://github.com/protocolbuffers/protobuf/releases/download/v21.4/$PROTO_ZIP
export PROTO_ZIP="protoc-29.1-osx-aarch_64.zip"
curl -LO https://github.com/protocolbuffers/protobuf/releases/download/v29.1/$PROTO_ZIP
unzip $PROTO_ZIP
echo "$HOME/d/protoc/bin" >> $GITHUB_PATH
export PATH=$PATH:$HOME/d/protoc/bin
Expand All @@ -43,5 +43,7 @@ runs:
rustup toolchain install stable
rustup default stable
rustup component add rustfmt
- name: Setup rust cache
uses: Swatinem/rust-cache@v2
- name: Configure rust runtime env
uses: ./.github/actions/setup-rust-runtime
4 changes: 2 additions & 2 deletions .github/actions/setup-macos-builder/action.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -30,8 +30,8 @@ runs:
run: |
mkdir -p $HOME/d/protoc
cd $HOME/d/protoc
export PROTO_ZIP="protoc-21.4-osx-x86_64.zip"
curl -LO https://github.com/protocolbuffers/protobuf/releases/download/v21.4/$PROTO_ZIP
export PROTO_ZIP="protoc-29.1-osx-x86_64.zip"
curl -LO https://github.com/protocolbuffers/protobuf/releases/download/v29.1/$PROTO_ZIP
unzip $PROTO_ZIP
echo "$HOME/d/protoc/bin" >> $GITHUB_PATH
export PATH=$PATH:$HOME/d/protoc/bin
Expand Down
4 changes: 2 additions & 2 deletions .github/actions/setup-windows-builder/action.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -30,8 +30,8 @@ runs:
run: |
mkdir -p $HOME/d/protoc
cd $HOME/d/protoc
export PROTO_ZIP="protoc-21.4-win64.zip"
curl -LO https://github.com/protocolbuffers/protobuf/releases/download/v21.4/$PROTO_ZIP
export PROTO_ZIP="protoc-29.1-win64.zip"
curl -LO https://github.com/protocolbuffers/protobuf/releases/download/v29.1/$PROTO_ZIP
unzip $PROTO_ZIP
export PATH=$PATH:$HOME/d/protoc/bin
protoc.exe --version
Expand Down
1 change: 1 addition & 0 deletions .github/workflows/dependencies.yml
Original file line number Diff line number Diff line change
Expand Up @@ -42,6 +42,7 @@ jobs:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
Expand Down
14 changes: 4 additions & 10 deletions .github/workflows/dev.yml
Original file line number Diff line number Diff line change
Expand Up @@ -23,18 +23,12 @@ concurrency:
cancel-in-progress: true

jobs:
rat:
name: Release Audit Tool (RAT)
license-header-check:
runs-on: ubuntu-latest
name: Check License Header
steps:
- name: Checkout
uses: actions/checkout@v4
- name: Setup Python
uses: actions/setup-python@v5
with:
python-version: "3.10"
- name: Audit licenses
run: ./dev/release/run-rat.sh .
- uses: actions/checkout@v4
- uses: korandoru/hawkeye@v5

prettier:
name: Use prettier to check formatting of documents
Expand Down
4 changes: 2 additions & 2 deletions .github/workflows/docs.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -26,7 +26,7 @@ jobs:
- name: Setup Python
uses: actions/setup-python@v5
with:
python-version: "3.10"
python-version: "3.12"

- name: Install dependencies
run: |
Expand Down Expand Up @@ -61,4 +61,4 @@ jobs:
git add --all
git commit -m 'Publish built docs triggered by ${{ github.sha }}'
git push || git push --force
fi
fi
1 change: 1 addition & 0 deletions .github/workflows/docs_pr.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -43,6 +43,7 @@ jobs:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
Expand Down
107 changes: 107 additions & 0 deletions .github/workflows/extended.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,107 @@
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing,
# software distributed under the License is distributed on an
# "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
# KIND, either express or implied. See the License for the
# specific language governing permissions and limitations
# under the License.

name: Datafusion extended tests

concurrency:
group: ${{ github.repository }}-${{ github.head_ref || github.sha }}-${{ github.workflow }}
cancel-in-progress: true

# https://docs.github.com/en/actions/writing-workflows/choosing-when-your-workflow-runs/events-that-trigger-workflows#running-your-pull_request-workflow-when-a-pull-request-merges
#
# this job is intended to only run only on the main branch as it is time consuming
# and should not fail often. However, it is important coverage to ensure correctness
# in the (very rare) event of a hash failure or sqlite query failure.
on:
# Run on all commits to main
push:
branches:
- main

jobs:
# Check crate compiles and base cargo check passes
linux-build-lib:
name: linux build test
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v4
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Prepare cargo build
run: cargo check --profile ci --all-targets

# Run extended tests (with feature 'extended_tests')
linux-test-extended:
name: cargo test 'extended_tests' (amd64)
needs: linux-build-lib
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Run tests (excluding doctests)
run: cargo test --profile ci --exclude datafusion-examples --exclude datafusion-benchmarks --workspace --lib --tests --bins --features avro,json,backtrace,extended_tests
- name: Verify Working Directory Clean
run: git diff --exit-code

# Check answers are correct when hash values collide
hash-collisions:
name: cargo test hash collisions (amd64)
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Run tests
run: |
cd datafusion
cargo test --profile ci --exclude datafusion-examples --exclude datafusion-benchmarks --exclude datafusion-sqllogictest --workspace --lib --tests --features=force_hash_collisions,avro,extended_tests

sqllogictest-sqlite:
name: "Run sqllogictests with the sqlite test suite"
runs-on: ubuntu-latest
container:
image: amd64/rust
steps:
- uses: actions/checkout@v4
with:
submodules: true
fetch-depth: 1
- name: Setup Rust toolchain
uses: ./.github/actions/setup-builder
with:
rust-version: stable
- name: Run sqllogictest
run: cargo test --profile release-nonlto --test sqllogictests -- --include-sqlite
Loading
Loading