Skip to content

PyTorch implementation of our Fingdings of EMNLP2023 paper: Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamic Audio-Visual Scenarios

License

Notifications You must be signed in to change notification settings

Bravo5542/TJSTG

Repository files navigation

Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamic Audio-Visual Scenarios

Yuanyuan Jiang, Jianqin Yin
Beijing University of Posts and Telecommunications

[paper]

Preparation

  1. Clone this repo

    git clone https://github.com/Bravo5542/TJSTG.git
    
  2. Download data and extract feature

    MUSIC-AVQA: https://gewu-lab.github.io/MUSIC-AVQA/

Training

python net_tjstg/main.py --mode train

Testing

python net_tjstg/main.py --mode test

Notice

We improve our target-aware process to obtain a more robust performance. The experimental results based on the updated code are as follows:

image

Citation

@inproceedings{jiang2023avqa,
  title={Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamics Audio-Visual Scenarios},
  author={Jiang, Yuanyuan and Yin, Jianqin},
  booktitle={Findings of the Association for Computational Linguistics: EMNLP 2023},
  year={2023},
  pages = "9399--9409"
}

About

PyTorch implementation of our Fingdings of EMNLP2023 paper: Target-Aware Spatio-Temporal Reasoning via Answering Questions in Dynamic Audio-Visual Scenarios

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages