Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

+install windows #15

Open
wants to merge 9 commits into
base: main_v2_1
Choose a base branch
from
Open

Conversation

CrispStrobe
Copy link
Contributor

openai api (llamafile, ollama, etc): works well (only using that would need much less python libraries though)

ooba: pexpect/threading issue in ooba.py (there would be several ways to solve this, maybe with popen_spawn.PopenSpawn?)

transformers: several issues atm, esp. FlashAttentionV2, triton only supported for python 3.10 in windows

  • maybe atm easier to just add eg ctransformers[cuda]

@CrispStrobe
Copy link
Contributor Author

also added support for poe.com -- which works, but the async io is tricky, i used a temporary workaround with waiting/logging (could certainly be improved, but am short of time atm)

@sam-paech
Copy link
Contributor

Thanks for your work on this. I'd like to get windows support to a point where it all "just works". Maybe the transformers dependency issues can be solved by just having 3.10 as a requirement on windows.

I'm in the middle of attempting to replace pexpect with subprocess, because I think it's causing the issue with ooba hanging after ~30 queries. So that might help with windows compatibility.

Re: adding poe.com support -- I'm a bit wary of adding too many inferencing engines / api support beyond the most common ones because each one incurs maintenance debt. I'm not really familiar with poe.com. Are there models there that you can't get elsewhere? do a lot of people use it?

@CrispStrobe
Copy link
Contributor Author

i quite understand this, yes. it was mostly for experimenting.
nice thing about poe.com, besides ease of use, is the cost factor ;)
& poe offers Claude-2 100k, Mixtral via Groq, Mistral-large, among others

@sam-paech
Copy link
Contributor

Oh that would be handy to have api access to claude at least. Ok I'm convinced.

@CrispStrobe
Copy link
Contributor Author

for Claude-2 you have to pay though, but atm i have this.

@CrispStrobe
Copy link
Contributor Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants