DefTruth

Follow

🎯

#pragma unroll

DefTruth DefTruth

🎯

#pragma unroll

Follow

@xlite-dev, @vipshop, LeetCUDA.

1.8k followers · 156 following

@xlite-dev, @vipshop
Guangzhou, China
19:32 (UTC +08:00)
https://github.com/xlite-dev

Achievements

Achievements

Organizations

DefTruth/README.md

🏢 Group: Owner. @xlite-dev | @vipshop | Prev. @PaddlePaddle 🏰

🛠 Creator: lite.ai.toolkit | Awesome-LLM-Inference | LeetCUDA | ffpa-attn 🎧

🖥 HGEMM | 🤗cache-dit | Awesome-DiT-Inference | torchlm 🖱

🎉 Contributor: FastDeploy | vLLM | SGLang | Many Others ⚙️

✉️ Contact: [email protected] | GitHub: DefTruth | 知乎: DefTruth 🤖

♥️ I love open source, bro, and I think you do too. ♥️

Pinned Loading

xlite-dev/LeetCUDA xlite-dev/LeetCUDA Public

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA.🎉

Cuda 5.3k 564
xlite-dev/lite.ai.toolkit xlite-dev/lite.ai.toolkit Public

🛠 A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉

C++ 4.1k 746
xlite-dev/Awesome-LLM-Inference xlite-dev/Awesome-LLM-Inference Public

📚A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.🎉

Python 4.2k 290
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 51.4k 8.5k
PaddlePaddle/FastDeploy PaddlePaddle/FastDeploy Public

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

C++ 3.4k 536
vipshop/cache-dit vipshop/cache-dit Public

🤗A Training-free and Easy-to-use Cache Acceleration Toolbox for DiTs: DBCache, DBPrune, TaylorSeer, FBCache, Cache CFG, etc🔥

Python 85 3