Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Investigate exposing to the jobs the possibility of querying if the item with a given key already exists #49

Open
ivan-brko opened this issue Jan 26, 2021 · 0 comments
Milestone

Comments

@ivan-brko
Copy link
Owner

Use case:
If we are writing a parser which parses some news site and we don't want to to store duplicates, at the moment we will still send requests for these duplicate articles only to have them discarded later on when storage to DB occurs.
Instead of this, investigate adding a function like following to job API which would allow the job/scripts themselves to to not query duplicates:

def get_nonduplicate_list(list_containing_all_keys) -> list_containing_only_non_dupes

@ivan-brko ivan-brko added this to the v0.3 milestone Jan 28, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant