Unit 3 Exercise 1 - Bank Note Authentication #95

harishb00 · 2023-11-25T15:17:17Z

harishb00
Nov 25, 2023

Hi @rasbt ,
I am working out the first exercise on bank note authentication in unit 3. The task is to find the epochs and learning_rate such that the training and validation accuracy reaches atleast 98%.

I achieve it. But what bothers me is that validation accuracy is greater than training accuracy. How's it even possible? Intuitively training accuracy should always be higher than the validation (unseen data) accuracy right? Could you please shed some light on this?

I used epochs=30 and lr=0.53

Answered by rasbt

Dec 4, 2023

That's an interesting one. It's rare but it can happen. This could be due to randomness (Ultimately, it's still a relatively small dataset, and there are only ~270 instances in the validation set). E.g., assume the model does not overfit and performs equal on the training and validation set. You will always have some slight variation based on which exact examples are in the training and the validation (i.e., how you split the dataset by varying the random seed, for example).

Another explanation is that you have tuned the model so much (via hyperparameter choices) to improve the validation accuracy that it is slightly overtuned on the validation set. This is why it's important to have an a…

View full answer

rasbt · 2023-12-04T17:21:51Z

rasbt
Dec 4, 2023
Maintainer

That's an interesting one. It's rare but it can happen. This could be due to randomness (Ultimately, it's still a relatively small dataset, and there are only ~270 instances in the validation set). E.g., assume the model does not overfit and performs equal on the training and validation set. You will always have some slight variation based on which exact examples are in the training and the validation (i.e., how you split the dataset by varying the random seed, for example).

Another explanation is that you have tuned the model so much (via hyperparameter choices) to improve the validation accuracy that it is slightly overtuned on the validation set. This is why it's important to have an additional test set for the final evaluation (If I recall correctly, I covered it later in the course.)

1 reply

harishb00 Dec 5, 2023
Author

Nice explanation. Both of the points make sense.
Thanks a lot @rasbt for answering my questions.
❤️ from India.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unit 3 Exercise 1 - Bank Note Authentication #95

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Unit 3 Exercise 1 - Bank Note Authentication #95

harishb00 Nov 25, 2023

Replies: 1 comment · 1 reply

rasbt Dec 4, 2023 Maintainer

harishb00 Dec 5, 2023 Author

harishb00
Nov 25, 2023

Replies: 1 comment 1 reply

rasbt
Dec 4, 2023
Maintainer

harishb00 Dec 5, 2023
Author