About GPU usage during inference #21

yangzijia · 2024-06-19T01:57:54Z

Hi, thank you for sharing. I found that after pruning, the model size of slimsam-77 is only 38M, which is the same as the model size of edgesam, but the GPU usage of slimsam is still very high, 3071MiB, while that of edgesam is only 433MiB. Why does this happen? I don't quite understand the technical principle.

czg1225 · 2024-06-19T06:02:56Z

Hi @yangzijia ,
SAM's main memory requirements come from the global attention block of its image encoder. After pruning, SlimSAM greatly reduces the number of parameters through channel pruning, but the number of image tokens remains unchanged. If you want to solve this problem, you can try to further apply token merging or token pruning techniques.

yangzijia changed the title ~~GPU utilization~~ About GPU usage during inference Jun 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About GPU usage during inference #21

About GPU usage during inference #21

yangzijia commented Jun 19, 2024

czg1225 commented Jun 19, 2024

About GPU usage during inference #21

About GPU usage during inference #21

Comments

yangzijia commented Jun 19, 2024

czg1225 commented Jun 19, 2024