Skip to content

Commit

Permalink
✨🔧 configurable model by cli in server mode
Browse files Browse the repository at this point in the history
  • Loading branch information
chriamue committed Jan 8, 2024
1 parent e5b4380 commit 846ddd0
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 1 deletion.
2 changes: 2 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -99,6 +99,7 @@ The following table shows the performance metrics of the model on different syst
| 7b-open-chat-3.5 | AMD 7900X3D (12 Core) 64GB | 9.4 tokens/s |
| 7b-open-chat-3.5 | AMD 5600G (8 Core VM) 16GB | 2.8 tokens/s |
| 13b (llama2 13b) | AMD 7900X3D (12 Core) 64GB | 5.2 tokens/s |
| phi-2 | AMD 7900X3D (12 Core) 64GB | 20.6 tokens/s |

## Todo

Expand All @@ -108,3 +109,4 @@ The following table shows the performance metrics of the model on different syst
- [x] docker image and docker-compose
- [ ] add tests
- [ ] add documentation
- [ ] fix stop token
4 changes: 3 additions & 1 deletion src/main.rs
Original file line number Diff line number Diff line change
Expand Up @@ -90,7 +90,9 @@ async fn main() {
let opt = Opt::parse();

match load_config(&opt.config) {
Ok(config) => {
Ok(mut config) => {
config.model = opt.model.unwrap_or(config.model);

info!("Loaded config: {:?}", config);
if let Some(prompt) = opt.prompt {
let parameter = GenerateParameter {
Expand Down

0 comments on commit 846ddd0

Please sign in to comment.