Skip to content

Commit 0a41ffd

Browse files
2ez4bzdominicshanshan
authored andcommitted
[None][feat] Skip prefetching consolidated safetensors when appropriate (NVIDIA#7225)
* Why? Some models (e.g. anything produced by Mistral) can have both sharded safetensors and a consolidated safetensor in the same checkpoint directory. In such cases, prefetching both to memory is a waste of time, and memory. * What? This commit skips over consolidated safetensors when they are not the only safetensor file present in the checkpoint directory. Signed-off-by: William Zhang <[email protected]> Signed-off-by: Wangshanshan <[email protected]>
1 parent 905b598 commit 0a41ffd

File tree

0 file changed

+0
-0
lines changed

    0 file changed

    +0
    -0
    lines changed

    0 commit comments

    Comments
     (0)