feat: refactor kafka source API and enhance virtual sources resolving logic #37

gabb1er · 2024-04-16T13:36:18Z

added support for binary format for key and value. This means, that kafka message key or value is read as is, without casting to other types. Thus, it is up to user to use virtual sources/streams functionality to cast column to a desired type suitable for data quality checks.
added boolean flag to enable schema ID subtraction from kafka value: when schema registry is used, the schema ID is embedded into kafka value (magic byte + 4 bytes of schema ID). Therefore, in order to parse value, first it is required to subtract schema ID from it.
changed Avro schema API by adding boolean flag that enables or disable default values checks.
refactor logic for resolving virtual sources: now they are resolved not in order but rather with respect to their parents dependencies.
added tests to verify virtual source resolving logic.
documentation updated with respect to API changes.
configuration api version is updated to 1.5
fixes:
- updates SQLIte dependency version: security patch [VDB-248999].
- fixed JoinVirtualSourceReader by adding aliases to dataframes that are being joined. This is needed to avoid ambiguous column referencing in the resultant dataframe.

… logic: - added support for binary format for key and value. This means, that kafka message key or value is read as is, without casting to other types. Thus, it is up to user to use virtual sources/streams functionality to cast column to a desired type suitable for data quality checks. - added boolean flag to enable schema ID subtraction from kafka value: when schema registry is used, the schema ID is embedded into kafka value (magic byte + 4 bytes of schema ID). Therefore, in order to parse value, first it is required to subtract schema ID from it. - changed Avro schema API by adding boolean flag that enables or disable default values checks. - refactor logic for resolving virtual sources: now they are resolved not in order but rather with respect to their parents dependencies. - added tests to verify virtual source resolving logic. - documentation updated with respect to API changes. - configuration api version is updated to 1.5 - fixes: - updates SQLIte dependency version: security patch [VDB-248999]. - fixed JoinVirtualSourceReader by adding aliases to dataframes that are being joined. This is needed to avoid ambiguous column referencing in the resultant dataframe.

# [1.6.0](v1.5.0...v1.6.0) (2024-04-16) ### Features * refactor kafka source API and enhance virtual sources resolving logic ([#37](#37)) ([8ea9c23](8ea9c23))

cibaa-team-user · 2024-04-16T14:01:12Z

🎉 This PR is included in version 1.6.0 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

cibaa-team-user approved these changes Apr 16, 2024

View reviewed changes

gabb1er merged commit 8ea9c23 into Raiffeisen-DGTL:main Apr 16, 2024
8 checks passed

cibaa-team-user pushed a commit that referenced this pull request Apr 16, 2024

chore (release): 1.6.0 [skip ci]

a4cd9d4

# [1.6.0](v1.5.0...v1.6.0) (2024-04-16) ### Features * refactor kafka source API and enhance virtual sources resolving logic ([#37](#37)) ([8ea9c23](8ea9c23))

cibaa-team-user added the released label Apr 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: refactor kafka source API and enhance virtual sources resolving logic #37

feat: refactor kafka source API and enhance virtual sources resolving logic #37

gabb1er commented Apr 16, 2024

cibaa-team-user commented Apr 16, 2024

feat: refactor kafka source API and enhance virtual sources resolving logic #37

feat: refactor kafka source API and enhance virtual sources resolving logic #37

Conversation

gabb1er commented Apr 16, 2024

cibaa-team-user commented Apr 16, 2024