Update llama-run to include temperature option #10899

ericcurtin · 2024-12-19T14:00:35Z

This commit updates the examples/run/README.md file to include a new option for setting the temperature and updates the run.cpp file to parse this option.

ngxson

Tbh I'm not sure what's the long-term plan for llama-run.

My thought is that if now we add --temp, I'm pretty sure someone will also add other sampling params like top-k, top-p, DRY, etc in the near future, to a point that it will defeat the initial goal of llama-run which is "just run".

examples/run/run.cpp

ericcurtin · 2024-12-19T17:21:24Z

Tbh I'm not sure what's the long-term plan for llama-run.

My thought is that if now we add --temp, I'm pretty sure someone will also add other sampling params like top-k, top-p, DRY, etc in the near future, to a point that it will defeat the initial goal of llama-run which is "just run".

I had a use case for --temp.

If somebody has a use case for extra arguments, I don't have an immediate issue with merging them. Yes, I'd hope it would be less complex than llama-cli, but personally I have no issue with people adding extra args if they need them.

This commit updates the `examples/run/README.md` file to include a new option for setting the temperature and updates the `run.cpp` file to parse this option. Signed-off-by: Eric Curtin <[email protected]>

ericcurtin · 2024-12-19T18:11:03Z

But of course, simple use cases like:

llama-run smollm:135m

should continue to work.

ericcurtin · 2024-12-20T13:24:55Z

This should be an easy review @slaren @ggerganov

ericcurtin mentioned this pull request Dec 19, 2024

Add llama-cpp-python server containers/ramalama#452

Draft

github-actions bot added the examples label Dec 19, 2024

ericcurtin force-pushed the llama-run-temp branch 3 times, most recently from ca259bd to cd61ea0 Compare December 19, 2024 14:43

ngxson reviewed Dec 19, 2024

View reviewed changes

examples/run/run.cpp Outdated Show resolved Hide resolved

examples/run/run.cpp Outdated Show resolved Hide resolved

ericcurtin force-pushed the llama-run-temp branch from cd61ea0 to ef5d16f Compare December 19, 2024 17:19

Update llama-run to include temperature option

d0c0945

This commit updates the `examples/run/README.md` file to include a new option for setting the temperature and updates the `run.cpp` file to parse this option. Signed-off-by: Eric Curtin <[email protected]>

ericcurtin force-pushed the llama-run-temp branch from ef5d16f to d0c0945 Compare December 19, 2024 17:22

ggerganov approved these changes Dec 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update llama-run to include temperature option #10899

Update llama-run to include temperature option #10899

ericcurtin commented Dec 19, 2024 •

edited

Loading

ngxson left a comment

ericcurtin commented Dec 19, 2024

ericcurtin commented Dec 19, 2024

ericcurtin commented Dec 20, 2024

Update llama-run to include temperature option #10899

Are you sure you want to change the base?

Update llama-run to include temperature option #10899

Conversation

ericcurtin commented Dec 19, 2024 • edited Loading

ngxson left a comment

Choose a reason for hiding this comment

ericcurtin commented Dec 19, 2024

ericcurtin commented Dec 19, 2024

ericcurtin commented Dec 20, 2024

ericcurtin commented Dec 19, 2024 •

edited

Loading