Batching of metrics #11

akakitani · 2016-11-16T23:22:32Z

Implement batching of metrics, so that when we want to collect metrics at the record level, the driver doesn't slow down with too many metrics being sent to it.

My initial idea for this is to have some configurable batching interval of x seconds. This interval can be configured by the user to match how often some external monitoring service checks these Spark metrics, so that even time-sensitive ones can remain relatively accurate.

With Spark 2.0, we should also explore using the newer accumulator APIs to see if we can accomplish this.

The text was updated successfully, but these errors were encountered:

BDeus · 2017-05-17T07:36:04Z

Do you have an implementation or a begin of implementation of batching ?
Use spark-metrics at record level has a too high footprint on performance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batching of metrics #11

Batching of metrics #11

akakitani commented Nov 16, 2016

BDeus commented May 17, 2017

Batching of metrics #11

Batching of metrics #11

Comments

akakitani commented Nov 16, 2016

BDeus commented May 17, 2017