Log the sizes allocated through the SimpleMemoryPool for analysis. #380

wyuka · 2022-08-29T20:54:06Z

[LI-HOTFIX]
TICKET = LIKAFKA-43625
LI_DESCRIPTION = We want to re-use memory buffers for request deserialization. To do this, we need to gather statistics on the most frequent allocation sizes on current clusters. We need histograms of the number of requests for every allocation size. This change enables us to specify Kafka configuration to log this histogram every few (configurable amount of) minutes.
EXIT_CRITERIA = This PR will be reverted once we determine the right range of sizes to use for the memory pool as part of the work on the aforementioned ticket.

Committer Checklist (excluded from commit message)

Verify design and implementation
Verify test coverage and CI build status
Verify documentation (including upgrade notes)

lmr3796 · 2022-09-07T20:07:47Z

clients/src/main/java/org/apache/kafka/common/memory/MemoryPoolStatsStore.java

+public class MemoryPoolStatsStore {
+    private static final Logger log = LoggerFactory.getLogger(MemoryPoolStatsStore.class);
+
+    private final AtomicInteger[] histogram;


Is there a reason to not use the metrics object but implement this by hand?

Yes. Kafka metrics support publishing histograms only as percentiles. This will not be useful information for us to determine the range of most frequent memory allocations. For example, consider that around 60% of requests are of size 1 kb. Now, if we observe from metrics that the p90 request size is 400k and the p99 size is 1500k, we are unable to make any inference about the fact that most requests are within 1 kb range.

Hence, we are publishing the full histogram as logs.

gitlw · 2022-09-08T23:03:04Z

clients/src/main/java/org/apache/kafka/common/memory/SimpleMemoryPool.java

@@ -114,6 +119,7 @@ public boolean isOutOfMemory() {
    //allows subclasses to do their own bookkeeping (and validation) _before_ memory is returned to client code.
    protected void bufferToBeReturned(ByteBuffer justAllocated) {
        this.allocateSensor.record(justAllocated.capacity());


Is the metrics from this allocateSensor not good enough?
If so, can you please explain why?

This sensor relies on the Stat class, and it appears that it can record statistics like max, sum, avg, and not the entire histogram.

gitlw · 2022-09-08T23:10:15Z

core/src/main/scala/kafka/server/KafkaServer.scala

+            memoryPoolStatsStore.clear()
+          }
+
+          val histogramPublisher = new KafkaScheduler(threads = 1, "histogram-publisher-")


Can we reuse the same scheduler

kafka/core/src/main/scala/kafka/server/KafkaServer.scala

Line 160 in c57acec

var kafkaScheduler: KafkaScheduler = null

since the startup and shutting down of that scheduler is already taken care of.

Yeah we can do that. I was not sure myself, because we use this kafkaScheduler in some places inside KafkaServer, while in other places we allocate a new KafkaScheduler for other purposes. So I was confused as to which way to go.

I have fixed it.

gitlw · 2022-09-08T23:11:56Z

core/src/main/scala/kafka/server/MemoryPoolStatsLogger.scala

+class MemoryPoolStatsLogger extends Logging {
+  override lazy val logger = MemoryPoolStatsLogger.logger
+
+  def logStats(memoryPoolStatsStore: MemoryPoolStatsStore): Unit = {


It seems everything in this class can be moved into the object MemoryPoolStatsLogger

Thanks for the good suggestion.

gitlw · 2022-09-08T23:16:47Z

clients/src/main/java/org/apache/kafka/common/memory/MemoryPoolStatsStore.java

+            log.debug("Requested bytes {} for allocation exceeds maximum recorded value {}", bytes, maxSizeBytes);
+            return -1;
+        } else {
+            return (bytes - 1) / segmentSizeBytes;


Would it be better to record the bytes in log scale instead of linear scale?

Log the sizes allocated through the SimpleMemoryPool for analysis.

c57acec

wyuka requested a review from lmr3796 August 29, 2022 20:54

Fixed build.

6007d75

wyuka requested a review from gitlw August 29, 2022 23:22

lmr3796 reviewed Sep 7, 2022

View reviewed changes

gitlw reviewed Sep 8, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Log the sizes allocated through the SimpleMemoryPool for analysis. #380

Log the sizes allocated through the SimpleMemoryPool for analysis. #380

wyuka commented Aug 29, 2022

lmr3796 Sep 7, 2022

wyuka Sep 8, 2022

gitlw Sep 8, 2022

wyuka Sep 9, 2022

gitlw Sep 8, 2022

wyuka Sep 9, 2022

gitlw Sep 8, 2022

wyuka Sep 9, 2022

gitlw Sep 8, 2022

Log the sizes allocated through the SimpleMemoryPool for analysis. #380

Are you sure you want to change the base?

Log the sizes allocated through the SimpleMemoryPool for analysis. #380

Conversation

wyuka commented Aug 29, 2022

Committer Checklist (excluded from commit message)

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment