Commit 0a41ffd

authored and

committed

[None][feat] Skip prefetching consolidated safetensors when appropriate (NVIDIA#7225)

* Why? Some models (e.g. anything produced by Mistral) can have both sharded safetensors and a consolidated safetensor in the same checkpoint directory. In such cases, prefetching both to memory is a waste of time, and memory. * What? This commit skips over consolidated safetensors when they are not the only safetensor file present in the checkpoint directory. Signed-off-by: William Zhang <[email protected]> Signed-off-by: Wangshanshan <[email protected]>

1 parent 905b598 commit 0a41ffdCopy full SHA for 0a41ffd

0 file changed

-0

lines changed

0 file changed

-0

lines changed

Comments

(0)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Commit 0a41ffd

0 file changed

0 file changed

File tree

0 file changed

0 file changed

0 commit comments