-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ERROR Error encoding stats err="json: unsupported value: NaN" #280
Comments
Please add What Docker version and device / OS are you using? |
I have enabled debug logging, and I am waiting for the error to reappear. I am running docker 24.0.6, debian 11 on a rpi3b+. The agent is version 0.7.2. Apologies for not including this before. |
|
Thanks. Is it really logging everything twice or was that just a timing coincidence when you started it up? Can you please run I'd recommend upgrading Docker if possible. There's a known bug in 24 that I had to specifically add a workaround for. I don't know if related but it's a good idea to update either way. |
I have now updated to docker 27 - i must have forgotten this rpi when i updated my cluster. I have this in my The logs are showing up twice, both in my syslog server - and when i inspect the logs manually using the docker command. It is happening consistently, not just close to a startup of the agent. |
The issue may be that multiple instances of the hub are requesting metrics from the agent at the same time. There's no lock coordination between multiple instances atm. Try to make sure only one total instance is running. |
I think this is the same issue as #216 actually. You can also reference this thread about swarm: #17. It seems to be possible to get working properly, but I haven't set up swarm yet to figure it out. With swarm, it would be better for the agents to use an internal timer and send stats to the hub, rather than how it's set up currently where the hub requests from the agent. I want to add that option but it's not going to be in the very near future. I think what's going wrong is that the hub connections are not being handled by the correct nodes, and sometimes one node is being assigned two connections. Re: failed to get active systems error - that's the first time I've seen someone hit that. Are you using the arm7 image for the hub by any chance? There's an open issue about that image having a problem with the DB (#133) but I haven't been able to troubleshoot it because I don't have an arm7 device. It may go away if you can figure out a working configuration with the swarm nodes, otherwise you may want to delete the |
Sorry for the slow reply time. Some more details of my system. I have 1 swarm, with 4 nodes. The original problem i posted about takes places on a separate pi, with details as described. So aside from the agent running on the rpi3b+, the hub and other agents are running on arm. Based on this information, I do not think it is likely that the problem is caused by swarm. But if you wish to test it your self, you can easily make a swarm "cluster" with just one node. Perhaps you can trigger the issues there. |
Hi,
One of my agents , running in docker is reporting some errors.
I am not sure how to go about debugging them it, or even fixing it.
The text was updated successfully, but these errors were encountered: