Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 123 Bytes

README.md

File metadata and controls

2 lines (2 loc) · 123 Bytes

multimodal-vqa

Developing multimodal models for visual question answering (VQA) on a variety of datasets like NLVR2, GQA