-
Notifications
You must be signed in to change notification settings - Fork 196
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Time Partitioned Incremental loading #1094
Comments
@timhiebenthal I think you can use it instead or with primary key to replace partitions of data (ie. days). I think our |
for the sake of cross-referencing, this seems to be the same usecase as the report here: #971 (comment) |
@karakanb my learning from linked issue is that we should disable deduplication if merge key is present and primary key is set... which is IMO expected behavior as now the "deduplication" should happen via merge key upstream. that should fix the the issue you describe |
moved to #1131 |
Feature description
Currently you need to specify a primary key to load data incrementally through a merge Disposition.
It's not uncommon - especially with "reporting API's" - that you don't have any specific unique key and also don't need one.
Currency you would need to build a surrogate/primary key to do incremental loads.
How about replacing the data not through the primary_key but through a time period?
Example:
Adding also a end_date makes sense but I wanted to keep the example simple
Are you a dlt user?
Yes, I run dlt in production.
Use case
Sometimes building the primary key to load incrementally is cumbersome to build (e.g. because in nested dictionaries)
So you need to do more transformations before loading than needed.
Proposed solution
Specify
"Key" would work as currently.
"Time" would
Related issues
No response
The text was updated successfully, but these errors were encountered: