Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

pull_requests table contains many duplicates since upgrade to v2 #194

Open
Nicoowr opened this issue Jun 5, 2023 · 4 comments
Open

pull_requests table contains many duplicates since upgrade to v2 #194

Nicoowr opened this issue Jun 5, 2023 · 4 comments

Comments

@Nicoowr
Copy link

Nicoowr commented Jun 5, 2023

Hi, we have upgraded our Github integration to v2 on Stitch one week ago. Since then it seems the pull_requests table contains many duplicates of some pull requests. This is weird since the primary key is indeed id
SCR-20230605-qjda
SCR-20230605-qjmu

Could please you help us on this?

@dmosorast
Copy link
Contributor

Hello 👋 This is interesting. As far as I can tell, the tap is doing everything it needs to in order to enable deduplication downstream via ID. You can see that here when it writes the SCHEMA message per stream with the key_properties for that stream and here where it specifies the key_properties for the PullRequests stream object. Nevertheless, when I just tested it, I also got duplicates for the most recent record (from the inclusive query on the incremental extraction).

This might be something better served through Stitch's support channels (this tap is marked as supported by Stitch in the docs) rather than leaving it up to a community contributor to pick up.

@BenPeddie
Copy link

@Nicoowr did you get anywhere with this? Also running into the same issue.

@Nicoowr
Copy link
Author

Nicoowr commented Aug 29, 2023

@BenPeddie Our team is in contact with Stitch support but nothing new yet
Will keep this thread updated

@nkolster
Copy link

nkolster commented Jan 2, 2024

Still no update? Why in contact with stitch?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants