Embed box before multihead attention #2

luo3300612 · 2020-03-02T06:00:27Z

Thank you for your idea and repo. Since box embedding and w_g stay same in multi-turn multihead attention and they do not rely on k,q,v. Is it proper to move box embedding process to the begining of multihead attention to avoid embedding box in each EncoderLayer again and again? I have tried this and found it can reduce XE training time from 22h to 18h(on GTX 1080Ti) without obvious performance degradation (from CIDEr 1.1495 to CIDEr 1.1485)

simaoh · 2021-01-07T01:34:31Z

@luo3300612 Thanks for your observations.

Equations (6) and (7) in the paper, show that indeed the box_embedding Emb(\lambda) is just a function of the bounding box displacements, and therefore constant for all the self-attention layers of the transformer encoder.
Therefore, as you say, the computation of Emb(\lambda) can be moved out of the self-attention layer.

However, as you can see in equation (7), the geometric weights w_g are a function of a learnable weight matrix W_G.
This learnable matrices are allowed to be different for different self-attention layers.
Therefore, the computation of w_g cannot be moved out of the self-attention layer.

Here is the computation of w_g in our code (Notice the linear layer l()):

object_relation_transformer/models/RelationTransformerModel.py

Line 293 in f21674d

    
           relative_geometry_weights_per_head = [l(flatten_relative_geometry_embeddings).view(box_size_per_head) for l in self.WGs]

Does this answer your question?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Embed box before multihead attention #2

Embed box before multihead attention #2

luo3300612 commented Mar 2, 2020 •

edited

Loading

simaoh commented Jan 7, 2021 •

edited

Loading

Embed box before multihead attention #2

Embed box before multihead attention #2

Comments

luo3300612 commented Mar 2, 2020 • edited Loading

simaoh commented Jan 7, 2021 • edited Loading

luo3300612 commented Mar 2, 2020 •

edited

Loading

simaoh commented Jan 7, 2021 •

edited

Loading