-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
+install windows #15
base: main_v2_1
Are you sure you want to change the base?
+install windows #15
Conversation
also added support for poe.com -- which works, but the async io is tricky, i used a temporary workaround with waiting/logging (could certainly be improved, but am short of time atm) |
Thanks for your work on this. I'd like to get windows support to a point where it all "just works". Maybe the transformers dependency issues can be solved by just having 3.10 as a requirement on windows. I'm in the middle of attempting to replace pexpect with subprocess, because I think it's causing the issue with ooba hanging after ~30 queries. So that might help with windows compatibility. Re: adding poe.com support -- I'm a bit wary of adding too many inferencing engines / api support beyond the most common ones because each one incurs maintenance debt. I'm not really familiar with poe.com. Are there models there that you can't get elsewhere? do a lot of people use it? |
i quite understand this, yes. it was mostly for experimenting. |
Oh that would be handy to have api access to claude at least. Ok I'm convinced. |
for Claude-2 you have to pay though, but atm i have this. |
btw llama.cpp also uses subprocess here https://github.com/ggerganov/llama.cpp/blob/67be2ce1015d070b3b2cd488bcb041eefb61de72/examples/server/tests/features/steps/steps.py#L967 |
openai api (llamafile, ollama, etc): works well (only using that would need much less python libraries though)
ooba: pexpect/threading issue in ooba.py (there would be several ways to solve this, maybe with popen_spawn.PopenSpawn?)
transformers: several issues atm, esp. FlashAttentionV2, triton only supported for python 3.10 in windows