Does the PAD token affect in fine-tuning? #146

dafmdev · 2024-07-03T15:25:14Z

dafmdev
Jul 3, 2024

We want to fine-tune Chronos but all our time series have lengths less than 512, which is the Chornos context window. Can this affect the performance of Chronos?

Is there information available related to this topic?

Answered by lostella

Jul 4, 2024

@dafmdev from a purely technical standpoint, training or fine-tuning with shorter series should not be an issue: PAD tokens will be used where needed, and the model will learn to ignore them.

But if you fine-tune on data with context length up to N < 512, then you probably want to also make sure to provide data with context length up to N at prediction time, since the model may not know what to do with longer contexts at that point.

View full answer

lostella · 2024-07-04T08:42:42Z

lostella
Jul 4, 2024
Maintainer

@dafmdev from a purely technical standpoint, training or fine-tuning with shorter series should not be an issue: PAD tokens will be used where needed, and the model will learn to ignore them.

But if you fine-tune on data with context length up to N < 512, then you probably want to also make sure to provide data with context length up to N at prediction time, since the model may not know what to do with longer contexts at that point.

1 reply

tomar-s Aug 10, 2024

Hi @lostella,
Can you please explain in more detail your response above with an example from the zero shot dataset like Tourism monthly where the length of series is 333. If I try to fine tune and then evaluate on the same dataset, what needs to be done differently for evaluation which was not done if it were a zero shot dataset used directly for evaluation?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does the PAD token affect in fine-tuning? #146

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Does the PAD token affect in fine-tuning? #146

dafmdev Jul 3, 2024

Replies: 1 comment · 1 reply

lostella Jul 4, 2024 Maintainer

tomar-s Aug 10, 2024

dafmdev
Jul 3, 2024

Replies: 1 comment 1 reply

lostella
Jul 4, 2024
Maintainer