CLIP-TASS PyTorch implementation of our IJCV paper: CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering Our code is coming soon.