Interesting results for Highway Nets on ImageNet with A2 #1197
Unanswered
flukeskywalker
asked this question in
Show and tell
Replies: 1 comment 2 replies
-
@flukeskywalker any code to look at re the model changes for the highway variant? I assume based on resnet models for depths / stages / but the widths above and 'highway network' style gating? It'd be of interest to include, especially if the runtime (throughput, memory) is competitive with ResNet and similar |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Thanks to timm and @rwightman's very nicely written paper on training recipes, I was able to directly compare Residual connections to Highways on ImageNet-val over the last couple of weeks, and got the results below. I find it interesting that just switching to Highway boosts the acc by 0.5% in each case, while the learned representations are smaller. Moreover, with Highway+A2 I get equal or better acc. than ResNet+A1. I'm curious to know what others think. Are these results interesting, unexpected, or useful to you?
top-1 accuracy
top-1 accuracy
Beta Was this translation helpful? Give feedback.
All reactions