Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make TPC-H data publicly available #1517

Open
jaychia opened this issue May 28, 2024 · 2 comments
Open

Make TPC-H data publicly available #1517

jaychia opened this issue May 28, 2024 · 2 comments

Comments

@jaychia
Copy link

jaychia commented May 28, 2024

Hi, I am trying to access the TPC-H benchmarking data but running into some permissioning issues:

image

Am I accessing the right data, or could the data be made public please?

@mrocklin
Copy link
Member

Ah, I thought that this was in coiled-datasets-rp, the requester pays bucket.

@fjetter who I think manages a lot of this is on PTO this week. @hendrikmakait or @phofl thoughts? (also ok to wait until Florian gets back)

@fjetter
Copy link
Member

fjetter commented Jun 3, 2024

I triggered a copy of the data to our requester pays bucket. The data will be available under

coiled-datasets-rp/tpc-h/snappy/scale-*/

with the scales 1, 10, 100, 1k, 10k

At the time of writing 1k and 10k is still in progress but should be available shortly.

I will amend our readme once this is through

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants