Dockerfile for llm and service in docker compose #98

Kleczyk · 2024-02-22T00:32:41Z

To run llm you need to download the model from hugingface:

wget -P ./llm/models https://huggingface.co/TheBloke/Llama-2-7B-GGUF/resolve/main/llama-2-7b.Q3_K_L.gguf

smaller model does not work

Endpoint to send prompt

POST 127.0.0.1:9000/v1/completions
Request body

{
  "prompt": "\n\n### Instructions:\nWhat is the capital of France?\n\n### Response:\n",
  "stop": [
    "\n",
    "###"
  ]
}

Successful Response

{
  "id": "string",
  "object": "text_completion",
  "created": 0,
  "model": "string",
  "choices": [
    {
      "text": "string",
      "index": 0,
      "logprobs": {
        "text_offset": [
          0
        ],
        "token_logprobs": [
          0,
          null
        ],
        "tokens": [
          "string"
        ],
        "top_logprobs": [
          {
            "additionalProp1": 0,
            "additionalProp2": 0,
            "additionalProp3": 0
          },
          null
        ]
      },
      "finish_reason": "stop"
    }
  ],
  "usage": {
    "prompt_tokens": 0,
    "completion_tokens": 0,
    "total_tokens": 0
  }
}

Validation Error

{
  "detail": [
    {
      "loc": [
        "string",
        0
      ],
      "msg": "string",
      "type": "string"
    }
  ]
}

README.md

pgronkievitz · 2024-02-24T07:47:30Z

README.md

 ```sh
-wget https://huggingface.co/TheBloke/sheep-duck-llama-2-70B-v1.1-GGUF/resolve/main/sheep-duck-llama-2-70b-v1.1.Q4_K_S.gguf
+wget -P ./llm/models https://huggingface.co/TheBloke/Llama-2-7B-GGUF/resolve/main/llama-2-7b.Q3_K_L.gguf
 ```


suggestion: I'd add curl instructions as well (pro tip is -o flag) - there are systems with no curl and without wget, so this way you won't force anyone to modify the command or to install new package

llm/.gitignore

llm/test/test_llm.py

pgronkievitz · 2024-02-24T08:03:17Z

llm/test/test_promt.py

nitpick: this doesn't seem to be proper pytest test and it seems like kinda duplicate of test_llm file

this is a quick check of what llm is responding to us sometimes it can be useful

does it have any advantage over running regular test doing exactly the same?

llm/test/test_llm.py

Co-authored-by: Patryk Gronkiewicz <[email protected]>

… Dockerfile_for_LLM

pgronkievitz

in general lgtm

pgronkievitz · 2024-02-25T10:10:04Z

README.md

 ```sh
-wget https://huggingface.co/TheBloke/sheep-duck-llama-2-70B-v1.1-GGUF/resolve/main/sheep-duck-llama-2-70b-v1.1.Q4_K_S.gguf
+curl -o ./llm/models/llama-2-7b.Q3_K_L.gguf -L https://huggingface.co/TheBloke/Llama-2-7B-GGUF/resolve/main/llama-2-7b.Q3_K_L.gguf
 ```


suggestion (non-blocking): please add both instructions with curl and wget

Kleczyk and others added 8 commits January 17, 2024 23:58

try dockerfile FROM python

5c483b9

nothing working

80773f0

image is working and LLM make a answer

91b95d0

add to docker compose llm service

0d6db0f

fix gitignore and deleted __pycache

daf4a5e

fix gitignore x2

ea7507f

pull changes from main

6faf2a4

Update test_llm.py

86c102f

Kleczyk requested review from pgronkievitz, TheJimmyNowak, finloop, Sygnator and bisd98 February 22, 2024 00:32

Kleczyk mentioned this pull request Feb 22, 2024

Postawienie LLMa #89

Closed

2 tasks

Kleczyk linked an issue Feb 22, 2024 that may be closed by this pull request

Postawienie LLMa #89

Closed

2 tasks

Merge branch 'main' into Dockerfile_for_LLM

89bd511

pgronkievitz requested changes Feb 24, 2024

View reviewed changes

Kleczyk and others added 4 commits February 24, 2024 11:45

Update llm/test/test_llm.py

b33222a

Co-authored-by: Patryk Gronkiewicz <[email protected]>

update README

7c948e1

Merge branch 'Dockerfile_for_LLM' of github.com:knmlprz/ChatKNML into…

154865f

… Dockerfile_for_LLM

update test_llm.py and .gitignore

425c5bf

Kleczyk requested a review from pgronkievitz February 24, 2024 12:09

pgronkievitz approved these changes Feb 25, 2024

View reviewed changes

Kleczyk added 2 commits February 25, 2024 11:39

update README

2b46374

update README x2

c4fa391

Kleczyk merged commit 5344c0e into main Feb 25, 2024
2 checks passed

Kleczyk deleted the Dockerfile_for_LLM branch February 25, 2024 10:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Dockerfile for llm and service in docker compose #98

Dockerfile for llm and service in docker compose #98

Kleczyk commented Feb 22, 2024

pgronkievitz Feb 24, 2024

pgronkievitz Feb 24, 2024 •

edited

Loading

Kleczyk Feb 24, 2024

pgronkievitz Feb 25, 2024

pgronkievitz left a comment

pgronkievitz Feb 25, 2024

Dockerfile for llm and service in docker compose #98

Dockerfile for llm and service in docker compose #98

Conversation

Kleczyk commented Feb 22, 2024

Endpoint to send prompt

Successful Response

Validation Error

pgronkievitz Feb 24, 2024

Choose a reason for hiding this comment

pgronkievitz Feb 24, 2024 • edited Loading

Choose a reason for hiding this comment

Kleczyk Feb 24, 2024

Choose a reason for hiding this comment

pgronkievitz Feb 25, 2024

Choose a reason for hiding this comment

pgronkievitz left a comment

Choose a reason for hiding this comment

pgronkievitz Feb 25, 2024

Choose a reason for hiding this comment

pgronkievitz Feb 24, 2024 •

edited

Loading