-
Notifications
You must be signed in to change notification settings - Fork 837
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
concat_batches
errors with "schema mismatch" error when only metadata differs
#4799
Comments
I will have a proposed PR for this shortly |
|
Same issue with version 51.0.0 From the one big CSV cut into two, I created the first partquet file with parquet-cpp-arrow
Then, using the same schema (
When I try to concat them, I get The only diff is the naming of the schema, --- a.txt 2024-05-27 09:32:48.409232203 +0200
+++ b.txt 2024-05-27 09:32:55.073572763 +0200
@@ -1,6 +1,6 @@
GroupType {
basic_info: BasicTypeInfo {
- name: \"schema\",
+ name: \"arrow_schema\",
repetition: None,
converted_type: NONE,
logical_type: None, |
Describe the bug
When concatenating multiple
RecordBatch
es together, if the batches differ only in metadata,concat_batches
raises an errorTo Reproduce
Run this test:
This fails with this error:
Expected behavior
I expect the test to pass
Additional context
The text was updated successfully, but these errors were encountered: