Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DMT: dmt command hangs when migration cluster fails to connect to source or target [CLC-432] #427

Closed
sumnerib opened this issue Nov 6, 2023 · 3 comments
Labels
bug Something isn't working source:internal to-jira Open a tracking issue on Jira

Comments

@sumnerib
Copy link

sumnerib commented Nov 6, 2023

Given a running migration cluster, if the migration is started and the migration cluster is unable to connect to the source or target, it will eventually timeout and write a log showing the migration failed. However the dmt command will still be running with output like the following:

Hazelcast Data Migration Tool v5.3.0
(c) 2023 Hazelcast, Inc.

Selected data structures in the source cluster will be migrated to the target cluster.


    OK [1/2] Connected to the migration cluster.
    OK [2/2] Started the migration with ID: 078b5f14-aa73-4704-902d-a2c939397107.

Expected behavior: the dmt command should exit with some kind of message stating the migration cluster could not connect to either the source or target.

To reproduce:

  1. From the DMT package directory, start the migration cluster w/ a command like: HZ_NETWORK_PORT_PORT=5702 HZ_CLUSTERNAME=migration ./bin/hz start (don't start any of the other clusters)
  2. Start the migration: ./bin/dmt --config migration.yaml start migration_config --yes
  3. Eventually a warning like the following will be logged by the migration cluster: 2023-11-06 13:42:03,357 [ WARN] [data-migration-command-queue-handler] [c.h.d.i.MigrationRunner]: [127.0.0.1]:5702 [migration] [5.3.5-DM-SNAPSHOT] Migration failed unexpectedly com.hazelcast.core.HazelcastException: Cannot connect to target cluster:Unable to connect to any cluster.
  4. The dmt command will continue running, and no failure will be indicated to the user.
@sumnerib sumnerib added bug Something isn't working to-jira Open a tracking issue on Jira source:internal labels Nov 6, 2023
@sumnerib sumnerib added this to the Data Migration Tool [Phase 1] milestone Nov 6, 2023
@github-actions github-actions bot changed the title DMT: dmt command hangs when migration cluster fails to connect to source or target DMT: dmt command hangs when migration cluster fails to connect to source or target [CLC-432] Nov 6, 2023
Copy link

github-actions bot commented Nov 6, 2023

Internal Jira issue: CLC-432

@sumnerib
Copy link
Author

sumnerib commented Nov 7, 2023

Similar behavior w/ the estimate command:

bin/dmt estimate --config migration.yaml migration_config

Hazelcast Data Migration Tool v5.3.0
(c) 2023 Hazelcast, Inc.


Estimation usually ends within 15 seconds.
    OK [1/2] Connected to the migration cluster.
       

@yuce
Copy link
Collaborator

yuce commented Nov 20, 2023

Fixed by: #428

@yuce yuce closed this as completed Nov 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working source:internal to-jira Open a tracking issue on Jira
Projects
None yet
Development

No branches or pull requests

2 participants