You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Suppose I want to use your fusion method to fuse the image features [B * 3, 512] and text features [B, 512] extracted from open_clip. For this cross modal fusion, What changes do I need to make in your code to make it work?
Thank you for your great work!
The text was updated successfully, but these errors were encountered:
hi,
Suppose I want to use your fusion method to fuse the image features [B * 3, 512] and text features [B, 512] extracted from open_clip. For this cross modal fusion, What changes do I need to make in your code to make it work?
Thank you for your great work!
The text was updated successfully, but these errors were encountered: