Few Code-Script Questions about Multimodal Late-Fusion #11
-
Hi, I hope you're doing well. While reviewing your GitHub repo and paper on the "Multimodal Late-Fusion" study, I had a few follow-up questions about the code:
Thanks in advance for any clarification! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 6 replies
-
Thanks for your question @net-zero-2050! I've flagged with our technical team and someone should be responding ASAP 😊 |
Beta Was this translation helpful? Give feedback.
-
Hi @net-zero-2050, thanks for getting in contact. Ill try to answer these, but @Sukh-P and @AUdaltsova please do jump in on anything
Hopefully these answers help a bit, but please let me know any follow up questions |
Beta Was this translation helpful? Give feedback.
Hi @net-zero-2050, thanks for getting in contact. Ill try to answer these, but @Sukh-P and @AUdaltsova please do jump in on anything
So the gsp_id is just one number, but its gets embedded into a higher dimension here. This should allow the model to learn specific behaviours for some gsps but also group the behaviour together of similar gsp_ids.
Do you mind sharing where
multimodal.yaml
file is? You are write the channel dimension is the first one in that block of four. Our latest model is shared here with various configuration filesOur batch size is normally a bit smaller than that, but it totally depends on your hardware and your model and data configuration. We work with 1 main …