title | summary | category |
---|---|---|
Data Migration Troubleshooting |
Learn how to diagnose and resolve issues when you use Data Migration. |
tools |
This document summarizes some commonly encountered issues when you use Data Migration, and provides the solutions.
If you encounter errors while running Data Migration, try the following solution:
-
Check the log content related to the error you encountered. The log files are on the DM-master and DM-worker deployment nodes. You can then view common errors to find the corresponding solution.
-
If the error you encountered is not involved yet, and you cannot solve the problem yourself by checking the log or monitoring metrics, you can contact the corresponding sales support staff.
-
After the error is solved, restart the task using dmctl.
resume-task ${task name}
However, you need to reset the data synchronization task in some cases. For details about when to reset and how to reset, see Reset the data synchronization task.
Access denied for user 'root'@'172.31.43.27' (using password: YES)
shows when you query the task or check the log
For database related passwords in all the DM configuration files, use the passwords encrypted by dmctl
. If a database password is empty, it is unnecessary to encrypt it. For how to encrypt the plaintext password, see Encrypt the upstream MySQL user password using dmctl.
In addition, the user of the upstream and downstream databases must have the corresponding read and write privileges. Data Migration also prechecks the corresponding privileges automatically while starting the data synchronization task.
When you encounter the following error, you need to manually handle it using dmctl (skipping the DDL statement or replacing the DDL statement with a specified DDL statement). For details, see Skip or replace abnormal SQL statements.
encountered incompatible DDL in TiDB: %s
please confirm your DDL statement is correct and needed.
for TiDB compatible DDL, please see the docs:
English version: https://github.com/pingcap/docs/blob/master/sql/ddl.md
Chinese version: https://github.com/pingcap/docs-cn/blob/master/sql/ddl.md
if the DDL is not needed, you can use dm-ctl to skip it, otherwise u also can use dm-ctl to replace it.
Note: Currently, TiDB is not compatible with all the DDL statements that MySQL supports. See the DDL statements supported by TiDB.
You need to reset the entire data synchronization task in the following cases:
RESET MASTER
is artificially executed in the upstream database, which causes an error in the relay log synchronization.- The relay log or the upstream binlog is corrupted or lost.
Generally, at this time, the relay unit exits with an error and cannot be automatically restored gracefully. You need to manually restore the data synchronization and the steps are as follows:
-
Use the
stop-task
command to stop all the synchronization tasks that are currently running. -
Use Ansible to stop the entire DM cluster.
-
Manually clean up the relay log directory of the DM-worker corresponding to the MySQL master whose binlog is reset.
- If the cluster is deployed using DM-Ansible, the relay log is in the
<deploy_dir>/relay_log
directory. - If the cluster is manually deployed using the binary, the relay log is in the directory set in the
relay-dir
parameter.
- If the cluster is deployed using DM-Ansible, the relay log is in the
-
Clean up downstream synchronized data.
-
Use Ansible to start the entire DM cluster.
-
Restart data synchronization with the new task name, or set
remove-meta
totrue
andtask-mode
toall
.
If the sync unit encounters an error while executing a SQL (DDL/DML) statement, DM supports manually skipping the SQL statement using dmctl or replacing this execution with another user-specified SQL statement.
When you manually handle the SQL statement that has an error, the frequently used commands include query-status
, sql-skip
, and sql-replace
.
-
Use
query-status
to query the current running status of the task.- Whether an error caused the
Paused
status of a task in a DM-worker - Whether the cause of the task error is an error in executing the SQL statement
- Whether an error caused the
-
Record the returned binlog pos (
SyncerBinlog
) that Syncer has synchronized when executingquery-status
. -
According to the error condition, application scenario and so on, decide whether to skip or replace the current SQL statement that has an error.
-
Skip or replace the current error SQL statement that has an error:
- To skip the current SQL statement that has an error, use
sql-skip
to specify the DM-worker, task name and binlog pos that need to perform SQL skip operations and perform the skip operations. - To replace the current SQL statement that has an error, use
sql-replace
to specify the DM-worker, task name, binlog pos, and the new SQL statement(s) used to replace the original SQL statement. (You can specify multiple statements by separating them using;
.)
- To skip the current SQL statement that has an error, use
-
Use
resume-task
and specify the DM-worker and task name to restore the task on the DM-worker that was paused due to an error. -
Use
query-status
to check whether the SQL statement skip or replacement is successful.
In dm-worker.log
, find current pos
corresponding to the SQL statement has an error.
worker
: flag parameter, string,--worker
, required; specifies the DM-worker where the SQL statement that needs to perform the skip operation is locatedtask-name
: non-flag parameter, string, required; specifies the task where the SQL statement that needs to perform the skip operation is locatedbinlog-pos
: non-flag parameter, string, required; specifies the binlog pos where the SQL statement that needs to perform the skip operation is located; the format ismysql-bin.000002:123
(:
separates the binlog name and pos)
worker
: flag parameter, string,--worker
, required; specifies the DM-worker where the SQL statement that needs to perform the replacement operation is locatedtask-name
: non-flag parameter, string, required; specifies the task where the SQL statement that needs to perform the replacement operation is locatedbinlog-pos
: non-flag parameter, string, required; specifies the binlog pos where the SQL statement that needs to perform the replacement operation is located; the format ismysql-bin.000002:123
(:
separates the binlog name and pos)sqls
: non-flag parameter, string, required; specifies new SQL statements that are used to replace the original SQL statement (You can specify multiple statements by separating them using;
)