Allocator<T>::uncompress is a performance nightmare #867

dirkwhoffmann · 2025-01-22T14:53:31Z

Uncompressing a snapshot can take multiple seconds if it contains a hard drive. The reason is that the implementation utilizes std::vector which is awfully slow, at least in debug builds.

TODO: Rewrite the code using good old C arrays.

The text was updated successfully, but these errors were encountered:

dirkwhoffmann · 2025-01-23T14:19:49Z

Comparison between old and new code. Test case is a 138 MB snapshot shrinking down to 9.3 MB after compression:

Debug build
- Old code
  Uncompressing 9790563 bytes (hash: ff80197f)... 2.48 sec
  Compressing 145308450 bytes (hash: f9ea0396)... 0.39 sec
- New code
  Uncompressing 9790563 bytes (hash: ff80197f)... 0.17 sec
  Compressing 145308450 bytes (hash: f9ea0396)... 0.40 sec
Release build
- Old code
  Uncompressing 9790563 bytes (hash: ff80197f)... 0.17 sec
  Compressing 145308450 bytes (hash: cf0b5b96)... 0.11 sec
- New code
  Uncompressing 9790563 bytes (hash: ff80197f)... 0.07 sec
  Compressing 145308450 bytes (hash: cf0b5b96)... 0.14 sec

Pros:

No more beachballing in debug builds.

Cons:

The new code is more complicated because I have to deal with dynamic buffer resizing myself.
In release builds, the 2.5x speed boost does not really matter as the 0.17 sec consumed by the old code is still acceptable.
The old vector-based compression code outperforms my C-array-style code. This is interesting by itself as I have no idea how this is possible.

Overall, the cons outweigh the pros. Therefore, I'll revert to the old code.

mras0 · 2025-01-23T16:28:23Z

Might be faster to just use this rather than the decode lambda:

vec.insert(vec.end(), isize(ptr[++i]), prev);

dirkwhoffmann · 2025-01-23T19:56:02Z

Might be faster to just use this rather than the decode lambda:

Indeed. This makes a huge difference. Very cool!

Debug build:

Old: Uncompressing 9790563 bytes... 2.45 sec
New: Uncompressing 9790563 bytes... 1.50 sec

Release build:

Old: Uncompressing 9790563 bytes... 0.15 sec
New: Uncompressing 9790563 bytes... 0.09 sec

dirkwhoffmann added the Performance label Jan 22, 2025

dirkwhoffmann added a commit that referenced this issue Jan 23, 2025

New compress / uncompress code in place (#867)

02bcebe

dirkwhoffmann added a commit that referenced this issue Jan 23, 2025

Rollback (#867)

a4ac103

dirkwhoffmann added a commit that referenced this issue Jan 23, 2025

Changed run-length compression format, optimized speed (#867)

d8e63eb

dirkwhoffmann added the v4.0 label Jan 25, 2025

dirkwhoffmann closed this as completed Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allocator<T>::uncompress is a performance nightmare #867

Allocator<T>::uncompress is a performance nightmare #867

dirkwhoffmann commented Jan 22, 2025

dirkwhoffmann commented Jan 23, 2025

mras0 commented Jan 23, 2025

dirkwhoffmann commented Jan 23, 2025

Allocator<T>::uncompress is a performance nightmare #867

Allocator<T>::uncompress is a performance nightmare #867

Comments

dirkwhoffmann commented Jan 22, 2025

dirkwhoffmann commented Jan 23, 2025

mras0 commented Jan 23, 2025

dirkwhoffmann commented Jan 23, 2025