-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Bug] chia says it fails to start daemon, but the process is running. chia can't connect to it. #11390
Comments
Oh!!! I just noticed something else different about this set of hosts. They are older hardware, but more importantly, don't have IPv4 internet connectivity (or a local IPv4 address, except for 127.0.0.1). I think this is likely the core issue. I can see the daemon listening on 127.0.0.1:55400, but the fact that that should listen IPv6 is a separate defect I suspect. And, I can get to 127.0.0.1 just fine on these systems.
|
By default the daemon will listen on "localhost", however, it will try to do a DNS lookup on the word localhost which should return 127.0.0.1 - since the daemon seems to be listening this would appear to be ok. Note, you could try running the daemon in the foreground with |
Actually, with prefer_ipv6 set to True, which is it for me, it might resolve to ::1. But I tried it with prefer_ipv6 set to False as well, and it still fails in the same way. Running the daemon in the foreground has the same problem. The daemon is running fine, but chia still thinks it's not. I did find something that will work. I looked through the code to see how the daemon connection is made to test that it's up, and can see it uses
and with no other changes it all works now. So, something is limited in (a) the daemon code (which should listen on ::1 IMO, but that's likely a different bug) or the (b) daemon client code, which should try to connect to 127.0.0.1 when self_hostname is "localhost" at least when prefer_ipv4 is false. But, I don't know if code in that area using the resolver stuff that I implemented last fall, which I think would have that effect, but something clearly isn't. |
As expected since it's in aiohttp, the setting of the chia |
Also, I suggest that the Exception occurring in |
Looks like |
I do think there are some bugs listening on ::1 - and it's possible |
Yup. (as is likely obvious) I cloned the tree onto one of these machines, so I can help probe more if you have suggestions/questions. For now I have the config set to self_hostname: 127.0.0.1 so I'm working, but happy to change that around to test things. |
You are correct that daemon doesn't use |
Stranger yet is that later I realized that on this Ubuntu system, the hosts file lists ::1 as |
This issue has not been updated in 14 days and is now flagged as stale. If this issue is still affecting you and in need of further review, please comment on it with an update to keep it from auto closing in 7 days. |
This should remain open. This issue should track that |
What happened?
After upgrading a number of my linux (Ubuntu 20.04.4) systems to 1.3.4, I have a few harvesters that are failing to start up. When I try to start the harvester:
This reports this error in only a couple of seconds. I've noted that if I check the process table after this I see
chia_daemon
, and netstat shows a listener on port 55400. Maybe something isn't waiting long enough?These same systems were working with 1.3.3, and other systems are working with 1.3.4, so I don't know what to look at.
The debug.log shows:
but the above are from an hour or two ago, and new invocations aren't producing more of the same.
The systems displaying this behavior are older systems, and all (3-5) of these like systems are displaying this same problem. I rebooted one completely and it's still misbehaving in this way.
Let me know what other diagnostics or information I can gather.
Version
1.3.4
What platform are you using?
Linux
What ui mode are you using?
CLI
Relevant log output
The text was updated successfully, but these errors were encountered: