Replies: 8 comments
-
Yes, that's definitely on the list. In principle, you should already be able to use everything that you can connect to via SSH and has multicore set up. However, I have never tested anything like that. For multiple remote machines, this will require some changes in how |
Beta Was this translation helpful? Give feedback.
-
Have you looked at Docker and Kubernetes to do parallel processing in the cloud? A kubernetes cluster is a lot easier to set up on AWS or Azure than a conventional cluster would be, plus you get scaling thrown in. RStudio Server Pro has just added this feature interestingly enough. I'm looking at makeClusterFunctions in batchtools and makeClusterPSOCK in future but I think Kubernetes might be better. Thanks for the great packages. |
Beta Was this translation helpful? Give feedback.
-
While we currently building up a HPC, we have several standalone machines. It would be great if we could use the SSH connector to distribute jobs across all machines. This would perfectly work together with |
Beta Was this translation helpful? Give feedback.
-
Thank you for the hints re kubernetes, @chapmandu2. @pat-s Is there a reason why you don't set up a scheduler on your HPC? That would not only support |
Beta Was this translation helpful? Give feedback.
-
As said, we're already building a HPC with warewulf and slurm. Until then, we have several standalone servers that are used for production and cannot be turned off until there is a production ready replacement 🙂 our main goal is to combine all of them but until then, the multiple ssh approach would be a nice thing to have. |
Beta Was this translation helpful? Give feedback.
-
I dropped words (the "while") while reading again, you did say. Sorry. I'm afraid I won't have multiple SSH hosts set up in the next couple of weeks. |
Beta Was this translation helpful? Give feedback.
-
What about AWS Batch? Metaflow uses it. |
Beta Was this translation helpful? Give feedback.
-
Looks like the |
Beta Was this translation helpful? Give feedback.
-
Can
clustermq
use workers on AWS, Digital Ocean, arbitrary remote Docker containers, etc.? It seems straightforward, for example, to use the ssh scheduler to deploy to workers on the same AWS instance. But what about a single pool of workers spread over multiple instances?I was at an R conference last week, and there seems to be uncertainty and debate about the long-term future of traditional HPC systems. cc @dpastoor
Beta Was this translation helpful? Give feedback.
All reactions