LMeterX

📋 Project Overview

LMeterX is a professional large language model performance testing platform that can be applied to model inference services based on large model inference frameworks (such as LiteLLM, vLLM, TensorRT-LLM, LMDeploy, and others), and also supports performance testing for cloud services like Azure OpenAI, AWS Bedrock, Google Vertex AI, and other major cloud providers. Through an intuitive Web interface, users can easily create and manage test tasks, monitor testing processes in real-time, and obtain detailed performance analysis reports, providing reliable data support for model deployment and performance optimization.

✨ Core Features

Universal Framework Support - Compatible with mainstream inference frameworks (vLLM, LiteLLM, TensorRT-LLM) and cloud services (Azure, AWS, Google Cloud)
Full Model Compatibility - Supports mainstream LLMs like GPT, Claude, and Llama with one-click stress testing
High-Load Stress Testing - Simulates high-concurrency requests to accurately detect model performance limits
Multi-Scenario Coverage - Supports streaming/non-streaming, supports text/multimodal/custom datasets
Professional Metrics - Core performance metrics including first token latency, throughput(RPS、TPS), and success rate
AI Smart Reports - AI-powered performance analysis, multi-dimensional model comparison and visualization
Web Console - One-stop management for task creation, stopping, status tracking, and full-chain log monitoring
Enterprise-level Deployment - Docker containerization with elastic scaling and distributed deployment support

Feature Comparison

Dimension	LMeterX	EvalScope	llmperf
Usage	Web UI for full-lifecycle task creation, monitoring & stop (load-test)	CLI for ModelScope ecosystem (eval & load-test)	CLI, Ray-based (load-test)
Concurrency & Stress	Multi-process / multi-task, enterprise-scale load testing	Command-line concurrency (`--parallel`, `--rate`)	Command-line concurrency
Test Report	Multi-model / multi-version comparison, AI analysis, visual dashboard	Basic report + visual charts (requires gradio, plotly, etc.)	Simple report
Model & Data Support	OpenAI-compatible, custom data & model interfaces	OpenAI-compatible by default; extending APIs needs custom code	OpenAI-compatible
Deployment & Scaling	Docker / K8s ready, easy horizontal scaling	`pip` install or source code	Source code only

🏗️ System Architecture

LMeterX adopts a microservices architecture design, consisting of four core components:

Backend Service: FastAPI-based REST API service responsible for task management and result storage
Load Testing Engine: Locust-based load testing engine that executes actual performance testing tasks
Frontend Interface: Modern Web interface based on React + TypeScript + Ant Design
MySQL Database: Stores test tasks, result data, and configuration information

🚀 Quick Start

Environment Requirements

Docker 20.10.0+
Docker Compose 2.0.0+
At least 4GB available memory
At least 5GB available disk space

One-Click Deployment (Recommended)

Complete Deployment Guide: See Complete Deployment Guide for detailed instructions on all deployment methods

Use pre-built Docker images to start all services with one click:

# Download and run one-click deployment script
curl -fsSL https://raw.githubusercontent.com/MigoXLab/LMeterX/main/quick-start.sh | bash

Multi-Instance Deployment (Supports Concurrent Testing Tasks)

# Download the deployment file docker-compose.yml
curl -fsSL -o docker-compose.yml https://raw.githubusercontent.com/MigoXLab/LMeterX/main/docker-compose.yml
# Start multiple instances using the --scale
# Start 2 backends + 2 engines (the number can be adjusted as needed)
docker compose up -d --scale backend=2 --scale engine=2

Usage Guide

Access Web Interface: Open http://localhost:8080
Create Test Task: Navigate to Test Tasks → Create Task, configure LLM API request information, test data, and request-response field mapping
- 2.1 Basic Information: For /chat/completions API, you only need to configure API path, model, and response mode. You can also supplement the complete payload in request parameters.
- 2.2 Data Payload: Select built-in text datasets/multimodal datasets as needed, or upload custom JSONL data files.
- 2.3 Field Mapping: Configure the prompt field path in payload, and response data paths for model output content, reasoning_content fields, usage fields, etc. This field mapping is crucial for updating request parameters with datasets and correctly parsing streaming/non-streaming responses.
API Testing: In Test Tasks → Create Task, click the "Test" button in the Basic Information panel to quickly test API connectivity Note: For quick API response, it's recommended to use simple prompts when testing API connectivity.
Real-time Monitoring: Navigate to Test Tasks → Logs/Monitoring Center to view full-chain test logs and troubleshoot exceptions
Result Analysis: Navigate to Test Tasks → Results to view detailed performance results and export reports
Result Comparison: Navigate to Model Arena to select multiple models or versions for multi-dimensional performance comparison
AI Analysis: In Test Tasks → Results/Model Arena, after configuring AI analysis service, support intelligent performance evaluation for single/multiple tasks

🔧 Configuration

Environment Variable Configuration

General Configuration

# ================= Database Configuration =================
DB_HOST=mysql           # Database host (container name or IP)
DB_PORT=3306            # Database port
DB_USER=lmeterx         # Database username
DB_PASSWORD=lmeterx_password  # Database password (use secrets management in production)
DB_NAME=lmeterx         # Database name

# ================= Frontend Configuration =================
VITE_API_BASE_URL=/api  # Base API URL for frontend requests (supports reverse proxy)

# ================= High-Concurrency Load Testing Deployment Requirements =================
# When concurrent users exceed this threshold, the system will automatically enable multi-process mode (requires multi-core CPU support)
MULTIPROCESS_THRESHOLD=1000

# Minimum number of concurrent users each child process should handle (prevents excessive processes and resource waste)
MIN_USERS_PER_PROCESS=500

# ⚠️ IMPORTANT NOTES:
#   - When concurrency ≥ 1000, enabling multi-process mode is strongly recommended for performance.
#   - Multi-process mode requires multi-core CPU resources — ensure your deployment environment meets these requirements.

# ================= Deployment Resource Limits =================
deploy:
  resources:
    limits:
      cpus: '2.0'       # Recommended minimum: 2 CPU cores (4+ cores recommended for high-concurrency scenarios)
      memory: 2G        # Memory limit — adjust based on actual load (minimum recommended: 2G)

🤝 Development Guide

We welcome all forms of contributions! Please read our Contributing Guide for details.

Technology Stack

LMeterX adopts a modern technology stack to ensure system reliability and maintainability:

Backend Service: Python + FastAPI + SQLAlchemy + MySQL
Load Testing Engine: Python + Locust + Custom Extensions
Frontend Interface: React + TypeScript + Ant Design + Vite
Deployment & Operations: Docker + Docker Compose + Nginx

Project Structure

LMeterX/
├── backend/                  # Backend service
├── st_engine/                # Load testing engine service
├── frontend/                 # Frontend service
├── docs/                     # Documentation directory
├── docker-compose.yml        # Docker Compose configuration
├── Makefile                  # Run complete code checks
├── README.md                 # English README

Development Environment Setup

Fork the Project to your GitHub account
Clone Your Fork, create a development branch for development
Follow Code Standards, use clear commit messages (follow conventional commit standards)
Run Code Checks: Before submitting PR, ensure code checks, formatting, and tests all pass, you can run make all
Write Clear Documentation: Write corresponding documentation for new features or changes
Actively Participate in Review: Actively respond to feedback during the review process

🗺️ Development Roadmap

In Development

Support for client resource monitoring

Planned

CLI command-line tool
Support for /v1/embedding and /v1/rerank API stress testing

📚 Related Documentation

Deployment Guide - Detailed deployment instructions and configuration guide
Contributing Guide - How to participate in project development and contribute code

👥 Contributors

Thanks to all developers who have contributed to the LMeterX project:

@LuckyYC - Project maintainer & Core developer
@del-zhenwu - Core developer

📄 Open Source License

This project is licensed under the Apache 2.0 License.

**⭐ If this project helps you, please give us a Star! Your support is our motivation for continuous improvement.**

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LMeterX

📋 Project Overview

✨ Core Features

Feature Comparison

🏗️ System Architecture

🚀 Quick Start

Environment Requirements

One-Click Deployment (Recommended)

Multi-Instance Deployment (Supports Concurrent Testing Tasks)

Usage Guide

🔧 Configuration

Environment Variable Configuration

General Configuration

🤝 Development Guide

Technology Stack

Project Structure

Development Environment Setup

🗺️ Development Roadmap

In Development

Planned

📚 Related Documentation

👥 Contributors

📄 Open Source License

About

Uh oh!

Releases 14

Packages

Contributors 5

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
.github/workflows		.github/workflows
backend		backend
docs		docs
frontend		frontend
mysql		mysql
st_engine		st_engine
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
README_CN.md		README_CN.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
init_db.sql		init_db.sql
quick-start.sh		quick-start.sh

License

MigoXLab/LMeterX

Folders and files

Latest commit

History

Repository files navigation

LMeterX

📋 Project Overview

✨ Core Features

Feature Comparison

🏗️ System Architecture

🚀 Quick Start

Environment Requirements

One-Click Deployment (Recommended)

Multi-Instance Deployment (Supports Concurrent Testing Tasks)

Usage Guide

🔧 Configuration

Environment Variable Configuration

General Configuration

🤝 Development Guide

Technology Stack

Project Structure

Development Environment Setup

🗺️ Development Roadmap

In Development

Planned

📚 Related Documentation

👥 Contributors

📄 Open Source License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 14

Packages 0

Contributors 5

Uh oh!

Languages

Packages