[FEA] Limit Host Memory Usage #8874
Labels
epic
Issue that encompasses a significant feature or body of work
reliability
Features to improve reliability or bugs that severly impact the reliability of the plugin
Is your feature request related to a problem? Please describe.
Currently the Spark RAPIDs plugin uses as much host memory as it feels it needs to. We have put some pools in place with the idea that a user wouldn’t really need to go beyond that, but in practice it can happen and then YARN, Kubernetes, docker, etc kills the executors because they tried to use more memory than the container is configured to use.
This epic is to put in place all of the changes needed so we effectively have a hard limit on the amount of host memory that the plugin uses and that amount is easy to configure.
Framework Changes:
Operator Changes:
Debugability:
The text was updated successfully, but these errors were encountered: