Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      Apache License 2.0
      5.5k36k1.2k482Updated Feb 4, 2025Feb 4, 2025
    • Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM
      Python
      Apache License 2.0
      759241436Updated Feb 4, 2025Feb 4, 2025
    • Python
      Apache License 2.0
      17131114Updated Feb 4, 2025Feb 4, 2025
    • HCL
      17803Updated Feb 3, 2025Feb 3, 2025
    • HTML
      MIT License
      7601Updated Jan 30, 2025Jan 30, 2025
    • SCSS
      MIT License
      7400Updated Jan 30, 2025Jan 30, 2025
    • Community maintained hardware plugin for vLLM on Spyre
      Apache License 2.0
      0200Updated Jan 29, 2025Jan 29, 2025
    • Community maintained hardware plugin for vLLM on Ascend
      Apache License 2.0
      31211Updated Jan 29, 2025Jan 29, 2025
    • Fast and memory-efficient exact attention
      C++
      BSD 3-Clause "New" or "Revised" License
      1.4k4308Updated Jan 26, 2025Jan 26, 2025
    • An adaptor to allow Python allocator for PyTorch pluggable allocator
      C++
      Apache License 2.0
      1200Updated Jan 5, 2025Jan 5, 2025
    • media-kit

      Public
      vLLM Logo Assets
      0000Updated Dec 12, 2024Dec 12, 2024
    • vllm-nccl

      Public archive
      Manages vllm-nccl dependency
      Python
      Apache License 2.0
      31620Updated Jun 3, 2024Jun 3, 2024
    • dashboard

      Public
      vLLM performance dashboard
      Python
      Apache License 2.0
      42000Updated Apr 26, 2024Apr 26, 2024