Skip to content

Issues: turboderp/exllama

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Speculative decoding?
#218 opened Aug 2, 2023 by bryanhpchiang
Streaming API
#37 opened Jun 6, 2023 by bkutasi
Very poor output quality
#47 opened Jun 12, 2023 by calebmor460
Lora support
#55 opened Jun 15, 2023 by alain40
Exllama tutorials?
#192 opened Jul 25, 2023 by NickDatLe
Weird issue with context length
#220 opened Aug 3, 2023 by zzzacwork
Support for NF4?
#230 opened Aug 7, 2023 by hoagy-davis-digges
Slower tokens/s than expecting
#231 opened Aug 7, 2023 by teknium1
Question about example_flask.py
#235 opened Aug 8, 2023 by ZeroYuJie
Continuous Batching support
#237 opened Aug 9, 2023 by FireMasterK
KV caching?
#238 opened Aug 9, 2023 by bryanhpchiang
Run on CPU without AVX2
#315 opened Apr 14, 2024 by ZanMax
ProTip! Mix and match filters to narrow down what you’re looking for.