Skip to content

Latest commit

 

History

History
14 lines (10 loc) · 624 Bytes

README.md

File metadata and controls

14 lines (10 loc) · 624 Bytes

axs2kiss

Automated KRAI X workflows for dedicated inference engines on selected backends: vLLM and SGLang on CUDA and ROCm, NIM on CUDA, using the OpenAI API compatible LoadGen client.

To import this repository and its dependencies into your work_collection, run:

axs byquery git_repo,collection,repo_name=axs2kiss

License

Unless explicitly stated otherwise, the software in this repository is provided under the permissive MIT license.

Contact

Please contact [email protected] for any queries.