Skip to content

Latest commit

 

History

History
9 lines (8 loc) · 294 Bytes

README.md

File metadata and controls

9 lines (8 loc) · 294 Bytes

show-and-tell

cs547 project on implementing image captioning (show and tell approach)

Things to try

  • Try Glove embedding instead of nn.Embedding
  • Increase encoder capacity: resnet50 -> resnet152
  • Try GRU instead of LSTM
  • Beam search with variable k
  • DataParallel + Distributed training