Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cp --from-fs --to-bucket #31

Open
asilvas opened this issue Apr 26, 2021 · 4 comments
Open

cp --from-fs --to-bucket #31

asilvas opened this issue Apr 26, 2021 · 4 comments
Labels
enhancement New feature or request

Comments

@asilvas
Copy link

asilvas commented Apr 26, 2021

Would be super handy for transferring data into S3.

@shanebdavis
Copy link
Member

Yes! This is definitely on my list. You'd think it would be easy... but the way S3P does it's massively parallel bucket-scanning doesn't translate very well to the local file system. I was able to make a workable solution from-bucket-to-fs, but it's harder the other way around.

Still, it can and should be done!

@shanebdavis shanebdavis added the enhancement New feature or request label Jul 14, 2021
@mbolek
Copy link

mbolek commented Aug 12, 2021

I was able to make a workable solution from-bucket-to-fs, but it's harder the other way around.

Could it be made available? :) @shanebdavis

@shanebdavis
Copy link
Member

shanebdavis commented Aug 19, 2021

There is a 'quick and dirty' way to make this happen, but it has scaling limitations. Basically, I could load the list of all the source files to copy in memory and then simulate the S3 listBucket API against that list.

Would it be helpful if this was available, but it might blow up if you have too many local files? I'm guessing under 100,000 would be no problem. Possibly even under a million files would work. More than a million files might start to get out of control.

@mbolek @asilvas What do you think?

@mbolek
Copy link

mbolek commented Aug 26, 2021

@shanebdavis sorry, I might've hijacked the thread a bit, I was thinking about the S3 to local FS download specifically, which you've mentioned as I was able to make a workable solution from-bucket-to-fs :)
Or is this already the part you've just mentioned? Then I guess, millions of files might not be enough since buckets grow way to easily... 🙄 ;)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants