Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding a sorted map container? #105

Open
nyanpasu64 opened this issue Oct 3, 2019 · 22 comments
Open

Adding a sorted map container? #105

nyanpasu64 opened this issue Oct 3, 2019 · 22 comments

Comments

@nyanpasu64
Copy link

For a music editing program, I'm storing a series of timestamped events (note data) in an Immer container. I would like to use the timestamps as keys in a sorted map, but Immer lacks sorted maps. I'm trying to emulate it using a wrapper around a sorted vector of struct{time, event} (usually under 100 events so copying isn't awful), but having to reimplement all operations is extra work.

Should I just stuff an immer::box<std::map> or something in an Immer container?

@arximboldi
Copy link
Owner

Depending on how much data you have, an immer::box<std::map> may suffice. Also, depending on your usage pattern, keeping a sorted immer::flex_vector may suffice (doing a binary-search + insert at point) may work. In the future, I would like add an ordered_map and ordered_set based on persistent red-black trees (or maybe b-trees).

@arximboldi
Copy link
Owner

Ohh, I see you have <100 events. You may consider just an immer::box<std::vector> or immer::array<>. Again, it depends on usage patterns (how often you insert or remove events, and how you iterate over them)

@arximboldi
Copy link
Owner

You may also wanna take a look at this: https://en.cppreference.com/w/cpp/algorithm/make_heap

@nyanpasu64
Copy link
Author

nyanpasu64 commented Oct 3, 2019

Do I need to use immer::box<std::map> (cheap copies) and not std::map (expensive copies)?


All event lists will be iterated over, in sorted order (so not a heap) every time the screen is redrawn. It's only edited in response to user input, so 1-20 times a second.

I will be inserting and deleting elements at random positions in response to user input, sometimes at the end, sometimes (>50%) in the middle.

The audio thread needs to be able to pull a document tree out of an immer::atom, and read events (via indexing and binary search) without taking a lock.

And if the audio thread is holding onto a document revision after the main thread deletes it, the audio thread needs to be able to delete the entire tree in bounded time. According to http://www.rossbencina.com/code/real-time-audio-programming-101-time-waits-for-nothing , both malloc and free can take unbounded time and cause audio stuttering.

Is deleting immer structures (lots of hierarchy, but pool-allocated) bounded-time? If the audio thread deletes an immer::box<std::map>, does the audio thread delete a std::map which uses C++ heap allocations, which is not bounded time?

Honestly this concern is mostly moot, since it's quite unlikely that the audio thread will be holding onto the last reference to something. That would require the user to undo (pushing the current state to redo stack), then perform an action (clearing the redo stack), all within 1-50 milliseconds (depends on how audio is configured).


I'm not sure that a categorical "don't hold locks in the audio thread" is even justified. Someone claims that it only matters if your audio thread is running with realtime scheduler priority...

Also I love the flying spaghetti monster at https://sinusoid.es/talks/immer-cppcon17/#/3

@arximboldi
Copy link
Owner

arximboldi commented Oct 3, 2019 via email

@nyanpasu64
Copy link
Author

Thanks for the help! This is not a commercial project. I actually came up with the single-atom idea talking to user bowbahdoe (don't want to ping, idk?) experienced in Clojure.

It seems you're reading and replying via email, so you missed my edit where I said "I'm not sure that a categorical "don't hold locks in the audio thread" is even justified. Someone claims that it only matters if your audio thread is running with realtime scheduler priority..."

@voxoid0
Copy link

voxoid0 commented Dec 10, 2020

Right now I'm working on an immutable ordered map for one of my own projects that I'd be happy to contribute to immer. Inserts/removes optimally reuse subtrees instead of making deep copies, leaving inserts and removes at O(log n), and searches of course are O(log n). Like std::map, it's implemented via a red-black tree.

@arximboldi
Copy link
Owner

Thanks a lot for @voxoid0! I think that would be nice!

However, one can get better performance (due to cache utiliziation) using B-trees, that's what these libraries do:

immutable.js-sorted: https://github.com/applitopia/immutable-sorted
im-rs: https://github.com/bodil/im-rs

However, node-base trees can also be interesting in a functional language, like red-black trees, and others. If you are interested these are the ones that Data.Map on Haskell implements: http://groups.csail.mit.edu/mac/users/adams/BB/

@voxoid0
Copy link

voxoid0 commented Dec 11, 2020

Ah interesting. I searched for something that discusses the causes and trade-offs in B-tree performance for immutable ordered maps, but couldn't find it; do you know of something that goes into detail? Perhaps you can tell me if my guess is correct: a B-tree, by being able to use more than 2 branches per node, finds a tradeoff between the cache hits enabled by storing multiple child pointers in a contiguous array in memory, versus the additional cost O(c) of inserting or deleting on a node's child pointer list, where c is the avg number of children per node. (Both vector and list children would have O(c) bounded performance but for different reasons.) Insertion or deletion on the tree (and subsequent copying of a node's ancestry) would incur an additional cost in memory and perhaps performance, since when an ancestral line of nodes is rewritten after a write, a wider subtree is affected since each node branches more than 2. I wonder of the performance would depend on frequency of write vs read (or subtree iteration).

@arximboldi
Copy link
Owner

That is a correct general assumption. Reads are expected to be much faster because the tree is way shallower and more of the data is in contiguous blocks, less traversal and more cache efficiency. Ideally you would use block sizes similar to what we use for vectors (have you seen my talks on the topic?)

Inserting performance may be worse. I am not a 100% sure, because it depends on the amount of rebalancing you need on the red-black-tree, and on implementation details. I would in general be happy trade off a bit of write performance for faster reading.

@voxoid0
Copy link

voxoid0 commented Dec 14, 2020

Nice paper and talk; I looked over them the past couple of days. It will be interesting to see how your findings translate to an unordered map.

I realized last night that I need to work top-down in my project a bit more first to confirm the appropriate data structures, before moving on with the immutable ordered (multi) map impl. Will give an update afterwards. It happens to be a music project as well :) (algorithmic composition).

@arximboldi
Copy link
Owner

Oh nice! I am actually actually working in that field now with these guys: https://bronze.ai/
:)

And yeah, it sounds good to try not over-engineer with data-structures first. For now for example there is a place where I am needing one and I am simply using a immer::flex_vector + std::lower_bound on insertion and lookup to keep things sorted. Not optimal but I don't have time now to implement B-trees :p

@jeffplaisance
Copy link

i've been working on an immutable b+ tree that would cover this use case as well as #158. by default the internal nodes just store pointers and there are 3 mixins that can decorate them with child counts, max keys, and/or weights which means it can implement either an indexed or an ordered interface (or both although this doesn't seem particularly useful in practice) as well as allowing the implementation of various ordered statistics datastructures and other things requiring fast prefix sum. on a dataset of 10 million 32 bit ints, the random lookup performace is around 2x slower than flex vector but the random insert performance is around 4x faster, and the design made it easy to support random insert on the transient version which is around 100x faster than random insert on a persistent flex vector. the code is still a work in progress and neither the interfaces nor the style are particularly consistent with immer right now, but if this is something you'd be interested in incorporating into immer i can share it and we can go from there.

@jeffplaisance
Copy link

well it's almost 3 years later, but if anyone is still interested in an immutable sorted map container, i finally finished my immutable b+ tree library https://github.com/jeffplaisance/BppTree

@arximboldi
Copy link
Owner

arximboldi commented Apr 25, 2023

That is trully amazing @jeffplaisance. Great work! A quick glance is telling me that this is a high quality library...

The API and coding style is a bit different than the Immer one, but I would love adapting it and integrating it into Immer...

@jeffplaisance
Copy link

Thanks! It would be awesome for this to be part of immer. I changed all the camelCase to snake_case (all the types are still PascalCase) which may make things slightly easier. Some of the API differences are probably due to the requirement to have unique names for every method across all the mixins so they don't hide each other when combined, which is not ideal, but seemed like the least bad option. There might be a way around it by putting all the API methods in the BppTreeDetail::Shared/Transient/Persistent classes and leveraging SFINAE to disable the ones that aren't applicable, but I was worried that might make IDE autocomplete even less effective than it currently is.

@arximboldi
Copy link
Owner

Yes... I think another thing to think about is how memory is managed in the two libraries. But it would be super interesting to have this in... Hopefully I find some time to go through this in the future, or someone else does ;)

@jeaye
Copy link

jeaye commented Jul 6, 2024

Following up on this thread to say that jank (native Clojure dialect on LLVM) is currently using immer for maps/vectors/sets and I'm now finding myself in need of sorted maps and sorted sets. Having these in immer would be superb, but I'll take a look at BppTree in the meantime.

Thanks for all of the work here, folks! ❤️

@arximboldi
Copy link
Owner

@jeaye can you ping back here when you come up with a solution for sorted map/sets for Jank to see if we can backport it into Immer?

@jeaye
Copy link

jeaye commented Nov 7, 2024

@jeaye can you ping back here when you come up with a solution for sorted map/sets for Jank to see if we can backport it into Immer?

Hey @arximboldi! jank is using https://github.com/jeffplaisance/BppTree to provide sorted maps/sets (persistent and transient), which works well alongside immer. I had to make a small PR to get things fitting nicely into jank, but otherwise it was smooth.

The only other thing missing is the ability to customize the allocator or memory policy. immer makes this a breeze and jank just uses:

  using memory_policy = immer::memory_policy<immer::heap_policy<immer::gc_heap>,
                                             immer::no_refcount_policy,
                                             immer::default_lock_policy,
                                             immer::gc_transience_policy,
                                             false>;

For BppTree, sorted maps and sets aren't going to be GC-allocated for now, so they're going to leak. That means merging BppTree into immer would actually be a big win for jank, since we'd be able to address that problem. Otherwise, I'll be ultimately reaching out to @jeffplaisance to see what can be done, once memory leaks are my biggest concern. 😄

@jeaye
Copy link

jeaye commented Nov 9, 2024

Ah, one other thing I just found I had a TODO for. Clojure supports a sorted-set-by fn, which builds a sorted set given a runtime comparator function. This would require the comparator to be created at runtime alongside the sorted set instance. As far as I know, BppTree doesn't support that. So it boils down to two things currently missing, for full Clojure/jank support:

  1. Custom allocator/memory policy
  2. Runtime comparator objects

@jeaye
Copy link

jeaye commented Dec 14, 2024

Following up after a couple more months of using BppTree to provide an updated summary with a new concern.

  1. Compile times are slow! Using Clang's -ftime-trace, BppTree is one of the highest ranking template sets to instantiate for me, taking around 40s of my 950s build. For jank, with JIT compilation, slow compile times also mean slow startup times. Note: immer doesn't show up at all in that time tracing
  2. Since it lacks custom allocator support, and jank is using Boehm, I'm currently leaking any sorted maps/sets
  3. Since it lacks runtime comparators, I can't implement clojure.core/sorted-set-by

Sooo, although it was easy to add in, it's not really something I can stick with when jank goes to production in 2025. If there's no good solution, I may just strip out sorted sets/maps from jank until there is. If immer were to support these, especially while addressing the three concerns above, I'd likely be your first adopter.

I'd be able to comp a couple grand $$ for this to be addressed, one way or another, but as I'm quitting my job to focus on jank in January, that's all I can muster. 🙇

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants