Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: cache system with precious memory usage limitation #153

Open
zhangjinpeng87 opened this issue Jun 15, 2022 · 3 comments
Open
Labels
enhancement New feature or request

Comments

@zhangjinpeng87
Copy link
Member

Currently every sst is opened in mmap mode, we can't control the memory usage of agatedb with this mode, it will use as much as it can. This may cause OOM when agatedb embedded in other system (like in mongo's early days', its mmap engine caused many issues).

Implement a cache system that can control its total memory usage is very important for a storage engine, especially on the cloud environment where the resource is strictly limited. RocksDB's block-cache is good reference, it can limit the total memory usage and provide impressive access performance.

@zhangjinpeng87 zhangjinpeng87 added the enhancement New feature or request label Jun 15, 2022
@skyzh
Copy link
Member

skyzh commented Jun 15, 2022

Need to consider how to cache vlog. Currently vlog is not organized as "blocks".

@zhangjinpeng87 zhangjinpeng87 changed the title [Feature Request] cache system with precious memory usage limitation Feature Request: cache system with precious memory usage limitation Jun 27, 2022
@zhangjinpeng87
Copy link
Member Author

Need to consider how to cache vlog. Currently vlog is not organized as "blocks".

Yes, currently Titan( TiKV's key value separation engine ) also doesn't cache blob files. But it is necessary to consider how to cache vlog, maybe just using system's page cache is feasible?

@wangnengjie
Copy link
Contributor

Suppose value is large (default 1KiB), just okay to store it as one item in cache per value? And the ValuePtr can be the key in cache.

Maybe one cache for block and one cache for vlog value or just use one cache (use the cache_id concept in rocksdb).

But through this way, value cache can only work for user query (which get vptr first) while the backgroud gc that iter on vlog cannot visit the cache, still issue I/O on disk. (well...seems no iter impl for vlog, I need to have a look to its format)
(BTW, it seems that agatedb haven't implement vlog gc and I'm not familiar with badger...)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants