Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

After the node disconnected by libcluster, it went back again to the Topology! #202

Open
wawashra opened this issue Aug 4, 2024 · 0 comments

Comments

@wawashra
Copy link

wawashra commented Aug 4, 2024

Steps to reproduce

  • Configure your deployment to use Cluster.Strategy.Kubernetes.DNS, within K8s service discovery.
  • Scale up the deployment to have more than one Pod (Let us consider we have node1, node2, and node3 connected to the topology).
  • Down-scale the deployment (Let us consider the K8S decide to terminate the node2).
  • Now, the other nodes (node1, node3) will disconnected from node2
  • While node2 is doing the Graceful shutdown, and it is now out of the service discovery, it will notice the other nodes(node1, node3), while polling them from the service discovery and it will re-connected to them !!!, and by that the orher node re-connected to the node2 again until the graceful shutdown done!

Description of issue

When I scaled down the deployment, I supposed the libclutser should disconnect the node when the k8s pod starts to terminate, but after the node was disconnected, it went back again to the cluster! and be visible and I can see it by Node.list() until the be pod is fully terminated!.

What are the expected results?

When the Node is not a part of the Topology(For example K8S service discovery), it should not be reconnected to the other nodes

Suggested Solution & Discussion .

To address that, I cloned the original Strategy (Cluster.Strategy.Kubernetes.DNS), And I patched it by adding a validation during the connection process, by validating the source node needed to connect to the other nodes must be a member of the topology(In my case it is must be one of the nodes back from the nslookup discovery), is that a correct solution? can we collaborate to make a general fix to be fit with the libcluster core?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant