Skip to content

chop down position embedding matrix by 2 (1+padding_idx) #68

chop down position embedding matrix by 2 (1+padding_idx)

chop down position embedding matrix by 2 (1+padding_idx) #68