The cluster size is 400 applications, 4,000 pods, and 3 billion segments are generated every day. Looking for advice. #12235

liuxinagxiang · 2024-05-17T15:33:41Z

liuxinagxiang
May 17, 2024

My project generates about 3 billion segments a day, and the current architecture is skywalking-agent --> kafka ---> skywalking-L1-L2 ---> ES.

The scale is as follows: 400 microservices, nearly 4,000 pods, three Kafka nodes, skywalking-L1 6 nodes: Xmx8G Xms8G Xmn3G, skywalking-L2 6 nodes: Xmx8G Xms8G Xmn3G, elasticsearch 6 nodes: Xmx16G Xms16G Xmn6G
#kafka3.7.0 skywalking9.2 elasticsearch7.17.18

The current problem encountered is that kafka cluster and elaticsearch cluster are normal but skywalking L1 and L2 clusters continue to report errors,skywalking oap service consumption is very slow . After each restart, kafka data is consumed normally within ten minutes and various timeout errors are reported. so I have moved L2 has been moved to k8s cluster container deployment to avoid manually restarting oap every time 😅

I want to know how to plan the cluster according to the size of my project. Which configurations can be optimized? Does anyone have any suggestions?

I was referring to this article recently: https://skywalking.apache.org/zh/2022-08-30-pingan-jiankang/

wu-sheng · 2024-05-17T15:59:16Z

wu-sheng
May 17, 2024
Collaborator

I want to know how to plan the cluster according to the size of my project. Which configurations can be optimized? Does anyone have any suggestions?

Elasticsearch healthy doesn't mean it is powerful enough. Check self-observability data, especially OAP flush metrics. I believe it is too slow, then everything goes to be blocked eventually.

0 replies

linshenfanying · 2024-06-11T08:42:36Z

linshenfanying
Jun 11, 2024

Is your SkyWalking deployed in Kubernetes? If so, could you provide the relevant documentation? I'm also making relevant modifications recently.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The cluster size is 400 applications, 4,000 pods, and 3 billion segments are generated every day. Looking for advice. #12235

{{title}}

Replies: 2 comments

{{title}}

{{title}}

Select a reply

The cluster size is 400 applications, 4,000 pods, and 3 billion segments are generated every day. Looking for advice. #12235

liuxinagxiang May 17, 2024

Replies: 2 comments

wu-sheng May 17, 2024 Collaborator

linshenfanying Jun 11, 2024

liuxinagxiang
May 17, 2024

wu-sheng
May 17, 2024
Collaborator

linshenfanying
Jun 11, 2024