You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I was very impressed with your amazing work, which enables training dynamic document clustering through tokenization process.
I have carefully read your paper and the code to understand the details correctly but still have a question about the intent of reconstruction loss on initialization stage.
First, in your paper, it is said for building reconstruction loss $\mathcal L_{Rec}$ as:
warm-up the model by passing the continuous representation $d_T$ to the reconstruction model instead of the docid
representation $z_T$
But, I found that the code actually calculates reconstruction loss as below, and I cannot find any replacement described above in the code:
In summary, I found some inconsistency between the paper and the code in calculating reconstruction loss on codebook initialization stage.
If I understand correctly, the reconstruction is actually not being used on the initialization phase, is this correct?
Thanks for sharing your wonderful work.
Best regards,
JB
The text was updated successfully, but these errors were encountered:
Thank you for pointing out this issue. The code is inconsistent with the original paper in this part. Some analysis shows that with the query-doc contrastive loss, the model outputs document embeddings used for initialization; and the doc-doc contrastive loss (reconstruction loss) may be less useful and is omitted.
Dear authors.
I was very impressed with your amazing work, which enables training dynamic document clustering through tokenization process.
I have carefully read your paper and the code to understand the details correctly but still have a question about the intent of reconstruction loss on initialization stage.
First, in your paper, it is said for building reconstruction loss$\mathcal L_{Rec}$ as:
But, I found that the code actually calculates reconstruction loss as below, and I cannot find any replacement described above in the code:
GenRet/run.py
Lines 403 to 404 in d3c1609
Furthermore, in
main()
function, theloss_w
is configured as1
:GenRet/run.py
Line 1154 in d3c1609
GenRet/run.py
Lines 687 to 688 in d3c1609
In summary, I found some inconsistency between the paper and the code in calculating reconstruction loss on codebook initialization stage.
If I understand correctly, the reconstruction is actually not being used on the initialization phase, is this correct?
Thanks for sharing your wonderful work.
Best regards,
JB
The text was updated successfully, but these errors were encountered: