Replies: 6 comments 8 replies
-
It looks reasonable to me, but that's not worth much. I have no qualifications for juggling pull requests. Is putting the GGML Rust bindings in a separate repo/project something that's on the roadmap? (I've been looking at trying to use it with my RWKV stuff for example, but that's going to require adding some ops and such.) |
Beta Was this translation helpful? Give feedback.
-
I'm fine with this |
Beta Was this translation helpful? Give feedback.
-
By the way, #114 already works with a 1 line code change (and the nicer way to do it is obvious). Unless iacore doesn't like the idea of the non-mmap approach that one really just needs code cleanups to be suitable for merging. |
Beta Was this translation helpful? Give feedback.
-
Which PRs are considered "done"? Those should be merged first. I don't care about the order. I want to fix the converting back and forth of |
Beta Was this translation helpful? Give feedback.
-
It sounds good to me |
Beta Was this translation helpful? Give feedback.
-
Quick update on this: #84 is dependent on #125 in order to support all three model types (it currently only supports GGMJ), so we can't merge that yet. #85 is now very out of sync (sorry, @hhamud!) but hopefully the changes that have been introduced since will make it easier to reconstruct and to share as much code as possible. |
Beta Was this translation helpful? Give feedback.
-
I've been merging PRs fairly haphazardly, but they're all starting to conflict with each other. I propose the following merge order and then we ship 0.1.0:
quantize.cpp
#84 - blocked on fixing the logic, but means users won't need upstreamllama.cpp
at allquantize.cpp
#84, and needs to be updated to match Sync to llama.cpp + GGML version as of 20230407 09:57 AM UTC. #119 and Reserve more eval memory and use ggml scratch buffers #116quantize.cpp
#84, which introduces loading helper functions that could be used by Add GGJT loader #114 and WIP: Bloom Inference #85)No other PRs that significantly touch logic will be merged outside of this order unless they're critical to getting things working.
Please confirm you're OK with this, or suggest an alternate plan: @setzer22 @KerfuffleV2 @iacore @FloppyDisck @hhamud
Beta Was this translation helpful? Give feedback.
All reactions