-
Notifications
You must be signed in to change notification settings - Fork 3.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add LocalTmpStorage for all services #17588
base: master
Are you sure you want to change the base?
Conversation
a6f3e1e
to
2864ed2
Compare
2864ed2
to
7888593
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Needs tests for coverage. Add those before merging please.
@LazySingleton | ||
public LocalTmpStorage getLocalTmpStorage() | ||
{ | ||
File tmpDir = new File(taskDirPath, "tmp"); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@cheddar : should we create the temporary storage per attempt? or should we share it across attempts for re-use of temp data?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's share it. The old default for cases where this would've been used is just java.io.tmpdir
which was shared as well, so it shouldn't cause problems...
|
||
import java.io.File; | ||
|
||
public interface LocalTmpStorage |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
pretty similar to TempDirProducer
; to service this usecase maybe that could be enhanced with a lazy init and a getRoot
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
TempDirProducer
appears to have semantics around cleaning up with close()
. That class is different from this class, this class is basically just a Config
. It could be called a Config
, but Rohan doesn't like the word config, so it got this name instead.
LocalTmpStorage
is just delivering a location, nothing more, nothing less, it provides something that can be injected to get at a system-configured tmp storage location and is not intended to actually do anything beyond that. It probably deserves javadoc that describes that this class shouldn't be doing anything other than delivering configuration as it's just there to be an injectable configuration object.
It could absolutely make sense to have TempDirProducer
depend on a LocalTmpStorage
in order to get the tmp dir location that it's supposed to use.
*/ | ||
File getTmpDir(); | ||
|
||
class DefaultLocalTmpStorageProvider implements Provider<LocalTmpStorage> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
doesn't seem like the system will prepare to clean up these files - wouldn't that will fill up the disk/create garabge?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The tmpdir provided from this is just a location. Whether things are cleaned up or not is a question of the implementation. In general, most of the code that deals with tmp files cleans them up after itself and if it's not cleaning up after itself, that should either be intentional and have a design reason or it is a bug in the code that's dealing with the file, not a bug in the code that's delivering the location of the directory.
{'id': 'msg_bdrk_01G7MDEZaj8AMTK7V9svu9eR', 'type': 'message', 'role': 'assistant', 'model': 'claude-3-5-sonnet-20241022', 'content': [{'type': 'text', 'text': 'Suggested pull request title: feat: introduce LocalTmpStorage interface for unified temporary storage management\n\nI appreciate the initiative to consolidate and standardize the temporary storage management across all services in Apache Druid. The code is well-structured, and the interface design is clean and straightforward. The implementation across different services is consistent, which is excellent.\n\nHowever, there are a few areas that could be improved to make this change even better:\n\n1. In LocalTmpStorage.java:\n - Consider adding documentation about the lifecycle management of the temporary directory\n - Add a method for cleanup of temporary files (like This comment was added by our PR Review Assistant Bot. Please kindly acknowledge that while we're doing our best to keep these comments up to very high standards, they may occasionally be incorrect. Suggestions offered by the Bot are only intended as points for consideration and no statements by this bot alone can be considered grounds for merging of any pull request. Remember to seek a review from a human co-worker. |
Suggested pull request title: I appreciate the effort to create a unified interface for managing temporary storage across all Druid services. The code is well-structured and follows good Java practices with clear interface definition and implementation. However, there are a few areas that could be improved to make this change more robust and maintainable:
Please address these points and add appropriate tests before merging. The overall direction of the change is good, but it needs additional safeguards and documentation to ensure reliable operation. This comment was added by our PR Review Assistant Bot. Please kindly acknowledge that while we're doing our best to keep these comments up to very high standards, they may occasionally be incorrect. Suggestions offered by the Bot are only intended as points for consideration and no statements by this bot alone can be considered grounds for merging of any pull request. Remember to seek a review from a human co-worker. |
Please disregard the bot comments. I used this as example for testing a bot reviewer and it was never meant to publish these. |
Provides an interface for getting local temporary storage in all the services. In all future changes, the users can directly inject an object of
LocalTmpStorage
and use the temporary directory in it as a scratch pad.Along with the consolidation and possible ability to track all temp data in future, this also provides a common interface for interacting with all local temporary storage work.