mycpp: add 128B pool #1958

melvinw · 2024-05-07T02:49:54Z

This improves performances on memory-bound workloads and helps reudce page faults in workloads like CPython configure.

This improves performances on memory-bound workloads.

andychu · 2024-05-07T03:48:56Z

mycpp/mark_sweep_heap.h

@@ -265,10 +265,12 @@ class MarkSweepHeap {
 #ifndef NO_POOL_ALLOC
  // 16,384 / 24 bytes = 682 cells (rounded), 16,368 bytes
  // 16,384 / 48 bytes = 341 cells (rounded), 16,368 bytes
+  // 16,384 / 96 bytes = 171 cells (rounded), 16,368 bytes


Hm this still says 96

I kinda want to do some evaluation, make sure it's not over-tuned for Linux / glibc / 64-bit

And does this waste memory on any workloads?

Also I think we had the theory again that malloc() needed some slack, to be strictly less than 16,384 bytes, to allow for headers

But I guess we never measured that effect

We already run on many different libc with different malloc(), so yeah I think we can at least survey Alpine Linux

Oh whoops. That's a typo leftover from a previous version. Will fix.

Happy to leave this open until we have a better idea of the effect with different allocators. I suspect that it will be a win in most cases, though. Since the write to the allocation header (whether it's below the buffer or elsewhere) is going to cause page faults (the reduction of which was the main win from this) regardless.

andychu · 2024-05-07T04:01:21Z

OK if leaving this open isn't blocking anything, it will be a good opportunity to test out my new performance harness

It bugged me that the CI is not good enough to really measure performance -- we need other hardware and other systems

I think a lot of projects actually have real hardware labs for this, e.g. the Go language has a bunch of machines all over. But we can probably do pretty good on a small budget :)

andychu · 2024-06-01T17:27:33Z

BTW I still want to merge this, but I want to use it as an example for some perf evaluation on different machines with different libc allocators

andychu · 2024-11-12T20:53:14Z

Oh this was accidentally closed because I deleted the soil-staging branch

andychu · 2024-11-12T20:57:08Z

OK it merged cleanly, so I pushed it to CI again

I was thinking about the pools again, but I don't remember why

Oh yeah it was because Aidan and I were investigating a word splitting bug. And I was thinking about how to do fewer allocs in word splitting, which is hot

And it felt like we needed more 32-byte Slices, like our 32-byte Tokens. I was wondering if we make it 24,32,48, or 24,48,128, etc.

i.e. there is still a "hot" part of the code that isn't "done" yet, because there are IFS bugs

I also felt like this depends a lot on the specific machine. e.g. a 128 byte pool could be more of a de-optimization for a 32-bit machine. Although those are getting rarer and rarer

andychu · 2024-11-12T21:35:54Z

On the cachegrind benchmarks, this does help a little

http://op.oilshell.org/uuu/github-jobs/8247/benchmarks2.wwz/_tmp/gc-cachegrind/index.html

http://op.oilshell.org/uuu/github-jobs/8249/benchmarks2.wwz/_tmp/gc-cachegrind/index.html

parsing

41.0 	_bin/cxx-opt/osh 	mut+alloc+free+gc

to

40.3 	_bin/cxx-opt/osh 	mut+alloc+free+gc

fibonacci

 24.2 	_bin/cxx-opt/osh 	mut+alloc+free+gc

and

24.0 	_bin/cxx-opt/osh 	mut+alloc+free+gc

I guess one reason I've been hesistant is that we can be using more memory, and not notice it on the benchmarks

It is common to trade speed for memory, and then memory doesn't get measured

I hope we can do the IFS fixes, and then see where word eval is at. Actually I checked in some word eval benchmarks the other day:

https://oilshell.zulipchat.com/#narrow/channel/121539-oil-dev/topic/word.20split.20benchmarks

we're actually faster than bash! But slower than other shells

Word eval is known to be hot, and speeding it up even sped up CPython configure as far as I remember. So I hope we can get that in, and then compare say 24/32/48 vs 24/48/128

Although we will still have the memory issue then, and it's still true that we can be over-fitting for certain hardware

andychu · 2024-11-12T21:37:51Z

Also, I could be missing something in terms of the benchmarks, since there are so many dimensions and workloads

I'm sure there are other ways to look at this

mycpp: add 128B pool

900b26a

This improves performances on memory-bound workloads.

melvinw requested a review from andychu May 7, 2024 02:49

melvinw changed the base branch from master to soil-staging May 7, 2024 02:53

andychu reviewed May 7, 2024

View reviewed changes

Merge branch 'master' into pool3

e935d28

andychu deleted the branch soil-staging October 1, 2024 04:33

andychu closed this Oct 1, 2024

andychu reopened this Nov 12, 2024

Merge branch 'master' into pool3

3a6f7ef

andychu deleted the branch soil-staging December 1, 2024 18:37

andychu closed this Dec 1, 2024

akinomyoga reopened this Dec 1, 2024

akinomyoga mentioned this pull request Dec 1, 2024

[core refactor] Refactor the code for BashArray/BashAssoc unset -v a[i] #2161

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mycpp: add 128B pool #1958

mycpp: add 128B pool #1958

melvinw commented May 7, 2024 •

edited

Loading

andychu May 7, 2024

andychu May 7, 2024

melvinw May 7, 2024

andychu commented May 7, 2024

andychu commented Jun 1, 2024

andychu commented Nov 12, 2024 •

edited

Loading

andychu commented Nov 12, 2024 •

edited

Loading

andychu commented Nov 12, 2024 •

edited

Loading

andychu commented Nov 12, 2024

mycpp: add 128B pool #1958

Are you sure you want to change the base?

mycpp: add 128B pool #1958

Conversation

melvinw commented May 7, 2024 • edited Loading

andychu May 7, 2024

Choose a reason for hiding this comment

andychu May 7, 2024

Choose a reason for hiding this comment

melvinw May 7, 2024

Choose a reason for hiding this comment

andychu commented May 7, 2024

andychu commented Jun 1, 2024

andychu commented Nov 12, 2024 • edited Loading

andychu commented Nov 12, 2024 • edited Loading

andychu commented Nov 12, 2024 • edited Loading

parsing

fibonacci

andychu commented Nov 12, 2024

melvinw commented May 7, 2024 •

edited

Loading

andychu commented Nov 12, 2024 •

edited

Loading

andychu commented Nov 12, 2024 •

edited

Loading

andychu commented Nov 12, 2024 •

edited

Loading