Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Out of order data produced by aggregator #398

Open
replay opened this issue Feb 28, 2020 · 3 comments
Open

Out of order data produced by aggregator #398

replay opened this issue Feb 28, 2020 · 3 comments

Comments

@replay
Copy link
Contributor

replay commented Feb 28, 2020

A user has shared detailed information with us which shows that a crng aggregator has produced out of order data for them. The issue seems to be hard to reproduce and only occurs from time to time (sounds like race condition to me).

This is the aggregator config they used:

[[aggregation]]
function = 'sum'
regex = '^<regex with 3 capture groups>$'
format = '$1.$2.$3'
interval = 60
wait = 65
dropRaw = true

Then the user added a carbon route which looks like this:

[[route]]
key = 'carbon-default'
type = 'sendAllMatch'
destinations = [
  '127.0.0.1:12003 pickle=false'
]

And recorded the generated data by doing nc -l -p 12003 > <file>. Additionally to that carbon route they also had an active grafanaNet route, all the traffic was written to both routes.

The recorded nc output clearly shows that some of the generated data is not in order:

<aggregated metric> 7.0 1582838400
<aggregated metric> 5.0 1582838340
<aggregated metric> 1.0 1582838520
<aggregated metric> 5.0 1582838460

The user also recorded the data which was sent to carbon-relay-ng via tcpdump, this recording shows that the metrics going into the aggregator were in order.

@robert-milan
Copy link
Contributor

Problem appears to be solved currently. Leaving this ticket open to re-investigate if it comes up again.

@robert-milan robert-milan removed this from the sprint-8 milestone Mar 16, 2020
@replay replay added this to the sprint-9 milestone Mar 16, 2020
@robert-milan robert-milan assigned fkaleo, replay and robert-milan and unassigned fkaleo Mar 16, 2020
@replay
Copy link
Contributor Author

replay commented Mar 16, 2020

Re-opening this issue, as it has already appeared again

@fkaleo
Copy link

fkaleo commented Apr 6, 2020

Could not figure out the cause yet.

@fkaleo fkaleo modified the milestones: sprint-9, sprint-10 Apr 6, 2020
@Dieterbe Dieterbe removed this from the sprint-10 milestone Apr 27, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants