Support for Multi-GPU training? #6

Victorwz · 2022-08-16T15:47:19Z

Thank you so much for the great implementation. I would like to ask whether your implementation for Memorizing Transformer could support multi-card distributed training like original paper. If you distribute the memorizingtrransformer model you created to each GPU, then every GPU would hold a memory with a retrieval faiss index. Therefore, each model on different GPU holds different memory database and retrieval index, which is different from the original paper. I regard that each model on different GPU should share the same retrieval context. This problem confuses me a lot.

Thank you so much for your time. Looking forward to your response!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support for Multi-GPU training? #6

Support for Multi-GPU training? #6

Victorwz commented Aug 16, 2022

Support for Multi-GPU training? #6

Support for Multi-GPU training? #6

Comments

Victorwz commented Aug 16, 2022