Yuanyuan Jiang, Jianqin Yin
Beijing University of Posts and Telecommunications
-
Clone this repo
git clone https://github.com/Bravo5542/TJSTG.git
-
Download data and extract feature
MUSIC-AVQA: https://gewu-lab.github.io/MUSIC-AVQA/
python net_tjstg/main.py --mode train
python net_tjstg/main.py --mode test
We improve our target-aware process to obtain a more robust performance. The experimental results based on the updated code are as follows:
@inproceedings{jiang2023avqa,
title={Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamics Audio-Visual Scenarios},
author={Jiang, Yuanyuan and Yin, Jianqin},
booktitle={Findings of the Association for Computational Linguistics: EMNLP 2023},
year={2023},
pages = "9399--9409"
}