Latency in linkerd multicluster Communication #12343

ben-yun · 2024-03-25T06:18:19Z

ben-yun
Mar 25, 2024

We are operating in an Amazon EKS environment where communication between different clusters was previously facilitated through an NLB, utilizing api-gateway. We have transitioned to using linkerd multicluster Communication for pod-to-pod communication.

However, we are experiencing intermittent latency spikes, occurring approximately every 30 seconds, with response times exceeding 100ms when attempting pod-to-pod communication, compared to the previous average response time of around 10ms for the same client application using the previous route.

Upon investigation with APM (DataDog), it's evident that there are untracked spans consuming significant time, likely indicating a bottleneck occurring within the linkerd communication process.

Considering the circumstances, it appears there may be something within the communication facilitated by linkerd causing periodic delays. How should we proceed? What additional information would be beneficial to provide?

linkerd version: stable-2.14.2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Latency in linkerd multicluster Communication #12343

{{title}}

Replies: 0 comments

Select a reply

Latency in linkerd multicluster Communication #12343

ben-yun Mar 25, 2024

Replies: 0 comments

ben-yun
Mar 25, 2024