GitHub - hitpoint6/llm-continuous-batching-simulator: Simulate how llm serving engines like vllm make use of python asyncio.Queue to achieve dynamic batching: batch generation at the iteration level.

Simulate how llm serving engines like vllm make use of python asyncio.Queue to achieve dynamic batching: batch generation at the iteration level.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
README.md		README.md
async_queue_test.py		async_queue_test.py
queue_test.py		queue_test.py

Provide feedback