Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

lightning: add a note for using separator #18630

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
6 changes: 5 additions & 1 deletion tidb-lightning/tidb-lightning-data-source.md
Original file line number Diff line number Diff line change
Expand Up @@ -136,6 +136,10 @@ trim-last-separator = false

另外,设置 `separator = '\n'` 表示使用两个字符 `\` + `n` 作为字符串定界符,而不是转义后的换行符 `\n`。

> **注意:**
>
> 如果 CSV 格式配置错误,TiDB Lightning 错误退出并提示 `encode kv error in file db.table.serialno.csv:0 at offset 2529671`。用 vim 打开 csv 文件,命令模式下 `:goto 2529671` 可以定位到编码错误的位置。
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
> 如果 CSV 格式配置错误,TiDB Lightning 错误退出并提示 `encode kv error in file db.table.serialno.csv:0 at offset 2529671` vim 打开 csv 文件,命令模式下 `:goto 2529671` 可以定位到编码错误的位置
> 如果 CSV 格式配置错误,TiDB Lightning 将退出并显示错误消息 `encode kv error in file db.table.serialno.csv:0 at offset 2529671`要定位错误,你可以使用 vim 打开 CSV 文件,并在命令模式下输入 `:goto 2529671`。更多信息,请参考[错误报告](/tidb-lightning/tidb-lightning-error-resolution.md#错误报告)


更多详细的内容请参考 [TOML v1.0.0 标准](https://toml.io/cn/v1.0.0#%E5%AD%97%E7%AC%A6%E4%B8%B2)。

#### `separator`
Expand All @@ -146,7 +150,7 @@ trim-last-separator = false

* CSV 用 `','`
* TSV 用 `"\t"`
* "\u0001" 表示使用 ASCII 字符 0x01
* "\u0001" 表示使用 ASCII 字符 0x01,即 SOH 控制字符,常用于 HIVE 表的导出时的分隔符。
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
* "\u0001" 表示使用 ASCII 字符 0x01,即 SOH 控制字符,常用于 HIVE 表的导出时的分隔符。
* "\u0001" 表示使用 ASCII 字符 0x01,即 SOH 控制字符,常用于导出 Hive 数据。


- 对应 LOAD DATA 语句中的 `FIELDS TERMINATED BY` 项。

Expand Down