Question about hierarchical Structure #65

Titolasanta · 2024-01-11T01:34:13Z

Hi, researching your paper I read that the used architecture utilizes a hierarchical Structure in a similar way as in the Hourglass transformer ( https://arxiv.org/abs/2110.13711 ). But for your case you are utilizing a encoder-decoder transformer, so how do you reshape what would usually be the input for your decoders which consists of both the output of the encoder which I would consider to be downsampled and the original input, which would be in the original input shape?

Thank you

gaozhihan · 2024-01-11T03:44:43Z

Thank you for your interest and your question. It may help to refer to the following piece of our implementation:

earth-forecasting-transformer/src/earthformer/cuboid_transformer/cuboid_transformer.py

Lines 3192 to 3195 in 7732b03

    
           if self.num_global_vectors > 0: 
        
               dec_out = self.decoder(initial_z, mem_l, mem_global_vector_l) 
        
           else: 
        
               dec_out = self.decoder(initial_z, mem_l)

The decoder typically takes as input the outputs from each layer of the encoder, as well as a dummy input named initial_z that matches the shape of the encoder's final layer output.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about hierarchical Structure #65

Question about hierarchical Structure #65

Titolasanta commented Jan 11, 2024

gaozhihan commented Jan 11, 2024

Question about hierarchical Structure #65

Question about hierarchical Structure #65

Comments

Titolasanta commented Jan 11, 2024

gaozhihan commented Jan 11, 2024