How to only input text feature or video feature #40

tingchihc · 2022-08-03T22:49:45Z

I want to only input text feature or video feature in UniVL. In this paper, it said that one transformer combines text representation T and video representation V. Could you tell me how to change it to only input T or V into UniVL? thanks

ArrowLuo · 2022-08-08T05:00:10Z

Hi @ting-chih, sorry for the delayed reply. The model will also need T and V, which can be masked if you need only to input one of them. For example, for only V, T is [CLS][SEP], and for only T, V is all zero. Best~

tiesanguaixia · 2023-05-20T15:33:18Z

I want to only input text feature or video feature in UniVL. In this paper, it said that one transformer combines text representation T and video representation V. Could you tell me how to change it to only input T or V into UniVL? thanks

Hi! Do you know how to download the raw videos of YouCook2? Thank you very much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to only input text feature or video feature #40

How to only input text feature or video feature #40

tingchihc commented Aug 3, 2022

ArrowLuo commented Aug 8, 2022

tiesanguaixia commented May 20, 2023

How to only input text feature or video feature #40

How to only input text feature or video feature #40

Comments

tingchihc commented Aug 3, 2022

ArrowLuo commented Aug 8, 2022

tiesanguaixia commented May 20, 2023