GitHub - intuitivecomputing/Object_Permanence_through_AudioVisual_Representations

Object Permanence through Audio-Visual Representations

This is the repository accompanying the paper submission Object Permanence through Audio-Visual Representations. In this work, we proposed a multimodal neural network model, using partially observed trajectory and audio, to predict the trajectory and final position of a dropped object.

Dataset is available at https://intuitivecomputing.jhu.edu/openscience.html

Pretrained weights for combined model is provided in multimodal_pretrained_weights.pkl.

partition.txt provides a dictionary of indices we used for training, validation, and testing.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
Multimodal_weights.pkl		Multimodal_weights.pkl
README.md		README.md
audio_baseline.ipynb		audio_baseline.ipynb
multimodal_model.ipynb		multimodal_model.ipynb
partition.json		partition.json
pretrained_weights.pkl		pretrained_weights.pkl
vision_baseline.ipynb		vision_baseline.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Object Permanence through Audio-Visual Representations

About

Releases

Packages

Languages

intuitivecomputing/Object_Permanence_through_AudioVisual_Representations

Folders and files

Latest commit

History

Repository files navigation

Object Permanence through Audio-Visual Representations

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages