shixianc

Follow

😁

i like mcdonalds

shixianc shixianc

😁

i like mcdonalds

Follow

[email protected]

0 followers · 1 following

shixianc/README.md

Hi there, I'm Shixian Cui 👋

Interested in getting model run faster

Connect with me:

emai: [email protected]

linkedin: shixian cui

school projects

Pinned Loading

ray-project/ray ray-project/ray Public

Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 36.4k 6.2k
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 43.5k 6.6k
triton-inference-server/server triton-inference-server/server Public

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 9k 1.6k
NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 10.1k 1.3k
triton-inference-server/model_navigator triton-inference-server/model_navigator Public

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 199 26
QwenLM/Qwen2-Audio QwenLM/Qwen2-Audio Public

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1.7k 130