Add quantization and tuning ops as part of model compile hash #180

TedThemistokleous · 2025-10-03T14:16:57Z

Description

Add the quantization, tuning and memory limits as inputs to the final hashed output name fora model. This ensure that we're not reusing a different quantized, or tuned model from a previous session.

Motivation and Context

turns out we weren't passing exhaustive tune flags for the recompile in along with some other iflags like mem_limit

Add quantization and tuning ops as part of model compile hash

c3abdc0

TedThemistokleous requested review from ahsan-ca and apwojcik October 3, 2025 14:17

TedThemistokleous self-assigned this Oct 3, 2025

TedThemistokleous added the Bugfix Fix to a bug or reported issue label Oct 3, 2025

TedThemistokleous added 2 commits October 3, 2025 22:06

Update hash from flags for when we need to recompile a model

de990f7

turns out we weren't passing exhaustive tune flags for the recompile in along with some other iflags like mem_limit

lintrunner pass

c7792d9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add quantization and tuning ops as part of model compile hash #180

Add quantization and tuning ops as part of model compile hash #180

Uh oh!

TedThemistokleous commented Oct 3, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add quantization and tuning ops as part of model compile hash #180

Are you sure you want to change the base?

Add quantization and tuning ops as part of model compile hash #180

Uh oh!

Conversation

TedThemistokleous commented Oct 3, 2025

Description

Motivation and Context

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants