Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fetch non-food product data from Open Product Facts & Open Beauty Facts #114

Closed
raphodn opened this issue Jan 1, 2024 · 7 comments · Fixed by #264
Closed

Fetch non-food product data from Open Product Facts & Open Beauty Facts #114

raphodn opened this issue Jan 1, 2024 · 7 comments · Fixed by #264

Comments

@raphodn
Copy link
Member

raphodn commented Jan 1, 2024

Story

Currently we consider that a new barcode is necessarily a food product, and we fetch its data from Open Food Facts.

What if a user scans a book ? or a boardgame ? or a deodorant ?

Solution

Sync with the OBF & OPF databases.
How to manage duplicates ? Or products that switch from one db to another ? Or are deleted ?

Extra info

Link Number of products csv size
OFF https://world.openfoodfacts.org 3M+ 855 MB
OBF https://world.openbeautyfacts.org 38k 78 MB
OPFF https://world.openpetfoodfacts.org 10k 22 MB
OPF https://world.openproductsfacts.org 17k 39 MB
Obsolete products (OFF) see issue #281 24k
@teolemon teolemon moved this to Backlog in 💸 Open Prices Jan 25, 2024
@raphodn raphodn added the 🙅‍♀️ Wontfix This will not be worked on label Jan 25, 2024
@raphodn
Copy link
Member Author

raphodn commented Jan 25, 2024

Status : not a priority, decided to concentrate our efforts on food products, while the OPF & OBF databases are revamped in 2024

Nevertheless, users are regularly adding non-food prices. It could be a good starting point to nudge users to add new products in the corresponding database 😉

@raphodn
Copy link
Member Author

raphodn commented Mar 28, 2024

Opened an issue in the Python package to figure out how to fetch easily the dumps of the other databases

openfoodfacts/openfoodfacts-python#220

@raphodn raphodn removed the 🙅‍♀️ Wontfix This will not be worked on label Mar 28, 2024
@raphodn
Copy link
Member Author

raphodn commented Mar 29, 2024

@raphael0202 is it possible to get dumps from OBF & co ?

@teolemon
Copy link
Member

https://world.openbeautyfacts.org/data
https://world.openpetfoodfacts.org/data
https://world.openproductsfacts.org/data

They should have similar URLS, but we probably don't have fancy new stuff like JSONL

@raphodn
Copy link
Member Author

raphodn commented Mar 29, 2024

Ok cool !

Maybe i'll try to improve the openfoodfacts-python package first ? It seems limited to only fetching the OFF data dumps.

I'll see if I manage to get something working.

Issue opened : openfoodfacts/openfoodfacts-python#220

@raphodn
Copy link
Member Author

raphodn commented Apr 1, 2024

Ok cool ! Maybe i'll try to improve the openfoodfacts-python package first ? It seems limited to only fetching the OFF data dumps.

PR opened ! here: openfoodfacts/openfoodfacts-python#223

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Archived in project
Status: Done
Archived in project
Status: Done
2 participants