how to initialize n-gram tower and emb? #25

FengYue95 · 2023-01-30T08:50:48Z

Hi~

1、Is ZEN trained from any base bert（e.g. google） or trained from scratch? If from scrach, I guess the n-gram emb is randomly initialized, If from base bert, the n-gram emb maybe the average of characters included?

2、According to "We use the same parameter setting for the n-gram encoder as in BERT" in the paper，I want to know that the params of n-gram encoder is shared and the same with bert tower（maybe the bottom six layer?），or is initialized and trained independently?

thank you~

GuiminChen · 2023-01-30T08:51:08Z

您好，您的来信已收到，我会尽快回复您的邮件。 ============ 祝您生活愉快！

shizhediao · 2023-09-20T18:47:53Z

There are two models in our paper. (R): randomly initialized parameters and (P): pre-trained model, which is the Google released Chinese BERT base model.
Sorry I don't quite get your question, could you elaborate on it? Thanks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to initialize n-gram tower and emb? #25

how to initialize n-gram tower and emb? #25

FengYue95 commented Jan 30, 2023

GuiminChen commented Jan 30, 2023 via email

shizhediao commented Sep 20, 2023 •

edited

Loading

how to initialize n-gram tower and emb? #25

how to initialize n-gram tower and emb? #25

Comments

FengYue95 commented Jan 30, 2023

GuiminChen commented Jan 30, 2023 via email

shizhediao commented Sep 20, 2023 • edited Loading

shizhediao commented Sep 20, 2023 •

edited

Loading