Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: enhance kakfa windowing for streams and enable avro kafka message format #20

Merged

Conversation

gabb1er
Copy link
Collaborator

@gabb1er gabb1er commented Dec 19, 2023

Major fixes:

  • Make windowBy to be not just a column name but a spark sql expression. This allows for more advanced selection of custom columns for windowing
  • Dataframe utils to prepare window and event timestamp columns are updated accordingly.
  • Streamable virtual sources now allows to redefine windowBy column.
  • Enable Avro format for kafka key and value message columns.

Minor fixes:

  • Refactor dumpSize defaults for error collection: when dumpSize is missing in target configuration then application level dumpSize value is used.
  • Weaken configuration validation requirements for spark parameter strings (enable string with various separators including new lines).
  • Add debug logging for schemas (both during schema reading and during sources reading).
  • Fix merge strategy rules for assembling dependencies uber-jar
  • documentation updates to reflect aforementioned changes.

…ults for error collection

- windowBy is not just a column name but a spark sql expression. This allows for more advanced selection of custom columns for windowing
- Dataframe utils to prepare window and event timestamp columns are updated accordingly
- Streamable virtual sources now allows to redefine windowBy column
- Minor fixed related to error collection: when dumpSize is missing in target configuration then application level dumpSize value is used.
- enable Avro format for kafka key and value message columns
- weaken configuration validation requirements for spark parameter strings (enable string with various separators including new lines)
- add debug logging for schemas (both during schema reading and during sources reading)
- fix merge strategy rules for assembling dependencies uber-jar
- minor doc update reflecting avro support in kafka messages.
@gabb1er gabb1er merged commit a5e9827 into Raiffeisen-DGTL:main Dec 19, 2023
8 checks passed
cibaa-team-user pushed a commit that referenced this pull request Dec 20, 2023
## [1.1.1](v1.1.0...v1.1.1) (2023-12-20)

### Bug Fixes

* enhance kakfa windowing for streams and enable avro kafka message format ([#20](#20)) ([a5e9827](a5e9827))
@cibaa-team-user
Copy link
Collaborator

🎉 This PR is included in version 1.1.1 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants