-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
minor changes for reprocessing in STHML #45
base: master
Are you sure you want to change the base?
Conversation
I don't think you want 40 cores. @JelleAalbers can you comment? I thought pax got unstable past 20 and you'll also hit I/O issues. @XeBoris have you done the test where you see that 40 is better than 20 or 10? |
The question for 40 cores is arbitrary here. I just put in a high number because a single login node has up to 48 cores here in Stockholm. I just checked some log files and couldn't find anything unexpected. I didn't know that pax becomes unstable beyond 20 cores. |
I thought we saw crashes at some point. It may not be the case anymore. It's just worth checking. |
Pax shouldn't be unstable anymore with high number of cores, this was the case when we didn't have proper file lock protection of the root class compilation. Reading and writing are not parallelized, so once either of these becomes the bottleneck there's no point in parallelizing further. At what number of cores that occurs depends on your system hardware and the data you're processing; you can find out by processing a hundred events or so with one core so you get a timing report. For example, on an event builder machine processing a recent dataset (160923_0018), I get:
Since reading (first plugin) and writing (last plugin) each take less than 1% of the time, we could easily go to as much cores as the machine has (40). Of course there are other overheads as well that we do not quantify. The only way to know if you're still getting improvements when you raise the number of cores is to measure and compare the performance. |
@JelleAalbers Thank you for clarifying this point. |
Just minor changes for Stockholm processing. (More CPUs and another Anaconda locational)