-
Notifications
You must be signed in to change notification settings - Fork 234
feat: add tracing to docker compose #434
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Signed-off-by: JaredforReal <[email protected]>
✅ Deploy Preview for vllm-semantic-router ready!
To edit notification comments on pull requests, go to your Netlify project configuration. |
👥 vLLM Semantic Team NotificationThe following members have been identified for the changed files in this PR and have been automatically assigned: 📁
|
…re and down-llm-katan targets Signed-off-by: JaredforReal <[email protected]>
Signed-off-by: JaredforReal <[email protected]>
@JaredforReal can you also take a look at #329 and see if you can add such app -> router tracing? |
Signed-off-by: JaredforReal <[email protected]>
Signed-off-by: JaredforReal <[email protected]>
Signed-off-by: JaredforReal <[email protected]>
Signed-off-by: JaredforReal <[email protected]>
@rootfs can we review this pr for now? I will try with a follow-up pr to make it e2e-enable. |
@JaredforReal can you add otel and jaeger to https://github.com/vllm-project/semantic-router/blob/main/website/src/components/AcknowledgementsSection/data.json? |
@rootfs sure! Maybe Grafana, Prometheus, and openwebui can be added here as well. |
sounds good! |
Add comprehensive distributed tracing support for OpenShift demos: - Add Jaeger all-in-one deployment manifest - Add deployment script with configuration instructions - Add toggle script to enable/disable tracing easily - Update DEMO-README.md with tracing documentation - Add DEMO-URLS.md quick reference for all demo URLs The tracing integration uses OpenTelemetry with OTLP exporters and provides end-to-end visibility into request flows including: - Request ingress through Envoy - ExtProc classification pipeline - Security checks (jailbreak, PII) - Category classification - Model routing decisions - Cache hits/misses - Per-span latency breakdown This leverages the tracing capabilities added in PR vllm-project#434. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <[email protected]> Signed-off-by: Yossi Ovadia <[email protected]>
What this PR does / why we need it:
config.tracing.yaml
inconfig/
as an example.