ReadStream copies values into memory too soon #17

jonboulle · 2014-12-11T02:51:52Z

As currently implemented, Diskv.ReadStream is not a purely streaming reader, because it buffers an entire copy of the value (in the siphon) before attempting to put the value into the cache. Unfortunately with large values this is a recipe for memory exhaustion.

Ideally we would stream the value directly into the cache as the io.ReadCloser that ReadStream returns is consumed, checking the cache size as we go. I started to go down this path, but it creates another race condition because then writing into the cache is not atomic: we cannot know when the ReadCloser will be finished consuming the entry, and it's very possible for others to begin Reading the same key-value pair as we're still writing it. So the next step down that road is for readers to actually take a lock per cache entry (which would then get released once the caller Closes the ReadCloser). This quickly became a web of mutexes and synchronisation hacks which felt very unidiomatic golang.

Various simple "solutions" exist (e.g. prechecking the size of the file against the cache size before starting to siphon), but they are all inherently racy and could still lead to memory exhaustion under stressful conditions. (We could also just take a global write lock during reads, but that wouldn't be very nice to other readers, would it?)

@peterbourgon @philips thoughts?

The text was updated successfully, but these errors were encountered:

peterbourgon · 2014-12-11T07:25:39Z

Definitely can't take a global write lock. Maybe there's a way to change the cache structure (significantly) to allow per-key synchronization. I bet a smarter play would be to siphon into a buffer off-cache, making regular checkpoints to ensure we don't exceed the max, and then atomically moving the siphoned data into the cache on completion. Something along these lines...

I think it's feasible. Give me a moment to think on it.

jonboulle · 2014-12-11T08:18:24Z

Yeah, that's kind of the direction I was going. Main thing I don't like about that is then potentially you have multiple readers trying to generate redundant cache entries alongside each other. But maybe I'm overoptimising.

In the meantime, I verified that the existing behaviour is already kinda racy :-/
https://gist.github.com/jonboulle/ab3f77bfb8c85fb4c022

peterbourgon · 2014-12-11T19:31:04Z

Previous write by goroutine 26:

Haha, whoops! :person_frowning:

@jonboulle

Partially addresses #17. Big thanks to @jonboulle for spotting that head-slapper.

peterbourgon · 2014-12-16T12:24:28Z

@jonboulle just to be clear, this is already fixed for your specific use-case with #21, correct?

jonboulle · 2014-12-16T19:00:56Z

@peterbourgon well technically I would say that for our specific use case we are just sidestepping the problem :-). I would still love to see this fixed and am kind of mulling on it in the back of my brain - if you wouldn't mind leaving it open a bit longer maybe I can come up with a patch...

peterbourgon · 2014-12-16T19:10:47Z

Understood. Yeah, I'm mulling it too, just not finding quite as much mull-time as I'd like :)

mkilpatrick · 2020-06-17T17:34:31Z

Is there any plan to address this issue? We've found that setting a CacheSizeMax results in using a lot more heap memory and then ultimately still doesn't limit disk cache through restarts.

jonboulle mentioned this issue Dec 11, 2014

stage0: oom while extracting images rkt/rkt#267

Closed

jonboulle mentioned this issue Dec 11, 2014

diskv: add Move function #16

Closed

peterbourgon added a commit that referenced this issue Dec 11, 2014

Ensure ReadStream with direct=true isn't racy

129c9e4

Partially addresses #17. Big thanks to @jonboulle for spotting that head-slapper.

peterbourgon self-assigned this Dec 12, 2014

jonboulle mentioned this issue Dec 12, 2014

cas: never cache values in memory rkt/rkt#279

Merged

peterbourgon added bug helpwanted labels Jun 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ReadStream copies values into memory too soon #17

ReadStream copies values into memory too soon #17

jonboulle commented Dec 11, 2014

peterbourgon commented Dec 11, 2014

jonboulle commented Dec 11, 2014

peterbourgon commented Dec 11, 2014

peterbourgon commented Dec 16, 2014

jonboulle commented Dec 16, 2014

peterbourgon commented Dec 16, 2014

mkilpatrick commented Jun 17, 2020

ReadStream copies values into memory too soon #17

ReadStream copies values into memory too soon #17

Comments

jonboulle commented Dec 11, 2014

peterbourgon commented Dec 11, 2014

jonboulle commented Dec 11, 2014

peterbourgon commented Dec 11, 2014

peterbourgon commented Dec 16, 2014

jonboulle commented Dec 16, 2014

peterbourgon commented Dec 16, 2014

mkilpatrick commented Jun 17, 2020