The following video demonstrates the below steps:
MaKLlama
- 4 followers
- China
Popular repositories Loading
-
-
containerd
containerd PublicForked from containerd/containerd
An open and reliable container runtime
Go 1
-
-
ktransformers
ktransformers PublicForked from kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Python 1
-
Repositories
- ktransformers Public Forked from kvcache-ai/ktransformers
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
- ollama Public Forked from ollama/ollama
Get up and running with Llama 3, Mistral, Gemma, and other large language models.
- llama-box Public Forked from gpustack/llama-box
LLM inference server implementation based on llama.cpp.
- stable-diffusion.cpp Public Forked from leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
- fastfetch Public Forked from gpustack/fastfetch
Like neofetch, but much faster because written mostly in C.
- vllm Public Forked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs