- 🌱 I’m currently learning CUDA && LLM Inference && CG
- 📫 How to reach me: [email protected]
🎯
Focusing
Pinned Loading
-
-
InternLM/lmdeploy
InternLM/lmdeploy PublicLMDeploy is a toolkit for compressing, deploying, and serving LLMs.
-
PaddlePaddle/Paddle
PaddlePaddle/Paddle PublicPArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice (『飞桨』核心框架,深度学习&机器学习高性能单机、分布式训练和跨平台部署)
-
-
sglang
sglang PublicForked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.