You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello @amitani,
Thank you for your interest in Amazon S3 Connector for PyTorch.
You're correct that PyTorch Lightning's current design and architecture limit the use of the CheckpointIO interface implementation, which our library provides, to only read/write/delete checkpoint operations. However, PyTorch Lightning is using its built-in fsspec library for other types of requests to S3, such as listing directories, which bypass our library's implementation and lead to the authentication error you encountered.
Given that this is an architectural limitation within PyTorch Lightning itself, and the CheckpointIO interface is not designed to handle all types of requests to S3, there is indeed little we can do from our end to address this issue directly.
In light of this information, I appreciate you taking the time to explain the root cause and the constraints we face due to PyTorch Lightning's current architecture. While we cannot resolve this issue within our library, your feedback provides valuable insights that can help us better understand the limitations.
I'm closing this issue for the time being. Please feel free to reopen it if you have any additional questions or concerns regarding this topic.
s3torchconnector version
s3torchconnector-1.2.3
s3torchconnectorclient version
s3torchconnectorclient-1.2.3
AWS Region
No response
Describe the running environment
Running locally on M1 Mac.
What happened?
This is related to experimental feature in 1.2.3 for specifying endpoint URL.
Specifying S3 URL as dirpath in Trainer triggered checking if the dir is empty using fsspec, instead of plugin.
This part tries to access S3 directly from Lightning without the plugin, leading to authentication error.
https://github.com/Lightning-AI/pytorch-lightning/blob/37e04d075a5532c69b8ac7457795b4345cca30cc/src/lightning/pytorch/callbacks/model_checkpoint.py#L274
Instead, all access to S3 when using the connector should go through the plugin.
Plugin I tried to use.
Relevant log output
No response
Code of Conduct
The text was updated successfully, but these errors were encountered: