Add a FIM pipeline to Providers #102

aponcedeleonch · 2024-11-27T13:17:53Z

Related: #87, #43

The PR adds a FIM pipeline independent from chat completion pipeline.
It could still be faulty since we need:

Message normalizer. We now expect all messages to have the key messages. However, there are incoming messages with prompt.
Secreets detector. There's the skeleton of a class called SecretAnalyzer that is meant to analyze the messages and return a warning if it detected a secret.

aponcedeleonch · 2024-11-27T13:21:38Z

Tested with Anthropic since we don't need Message Normalizer for that one:

src/codegate/providers/litellmshim/generators.py

jhrozek · 2024-11-27T15:27:02Z

src/codegate/providers/anthropic/adapter.py

+        model_in_request = completion_request['model']
+        if not model_in_request.startswith('anthropic/'):
+            completion_request['model'] = f'anthropic/{model_in_request}'


why do we need this block? I don't think any anthropic models start with anthropic. Either way, why do we change the model to have the anthropic prefix?

According to LiteLLM docs if we prepepnd anthropic/ to the model name it will always force the request to anthropic. Otherwise, it will try to map the model name to one that they have in their registry. However, it accepts most names. I tried with claude-3-5-sonnet-latest and didn't accept it as a valid anthropic model. Hence, I made this change. But agree, maybe this is being too careful and can be removed.

Doc Reference

src/codegate/providers/anthropic/adapter.py

aponcedeleonch · 2024-11-27T16:21:41Z

Drafting because as it is FIM is working for Anthropic but will not work with the other providers that we have: OpenAI and llama.cpp. I will wait to rebase the changes by Jakub to get things working for all providers.

src/codegate/providers/anthropic/adapter.py

ptelang · 2024-11-27T17:33:17Z

Looks good!

This adds a pipeline processing before the completion is ran where the request is either change or can be shortcut. This pipeline consists of steps, for now we implement a single step `CodegateVersion` that responds with the codegate version if the verbatim `codegate-version` string is found in the input. The pipeline also passes along a context, for now that is unused but I thought this would be where we store extracted code snippets etc. To avoid import loops, we also move the `BaseCompletionHandler` class to a new `completion` package. Since the shortcut replies are more or less simple strings, we add yet another package `providers/formatting` whose responsibility is to convert the string returned by the shortcut response to the format expected by the client, meaning either a reply or a stream of replies in the LLM-specific format. We use the `BaseCompletionHandler` as a way to convert to the LLM-specific format.

Related: #87, #43 The PR adds a FIM pipeline independent from chat completion pipeline. It could still be faulty since we need: - Message normalizer. We now expect all messages to have the key `messages`. However, there are incoming messages with `prompt`. - Secreets detector. There's the skeleton of a class called SecretAnalyzer that is meant to analyze the messages and return a warning if it detected a secret.

lukehinds · 2024-11-28T12:23:55Z

Don't merge yet. You should use the go port that uses the large signatures list we have

I will have this up in a minute

aponcedeleonch requested review from jhrozek, lukehinds and ptelang November 27, 2024 13:17

jhrozek reviewed Nov 27, 2024

View reviewed changes

src/codegate/providers/litellmshim/generators.py Outdated Show resolved Hide resolved

jhrozek reviewed Nov 27, 2024

View reviewed changes

src/codegate/providers/anthropic/adapter.py Outdated Show resolved Hide resolved

aponcedeleonch requested a review from jhrozek November 27, 2024 16:09

jhrozek approved these changes Nov 27, 2024

View reviewed changes

aponcedeleonch marked this pull request as draft November 27, 2024 16:19

ptelang reviewed Nov 27, 2024

View reviewed changes

src/codegate/providers/anthropic/adapter.py Outdated Show resolved Hide resolved

jhrozek and others added 5 commits November 28, 2024 10:02

Adding output from make format and fixing unit tests

b7d5489

Channging check of is_fim to chec for stop_sequences

0172668

Reformatted checkf for is_fim_request

4ee938f

aponcedeleonch force-pushed the add-fim-pipeline branch from 5239136 to 4ee938f Compare November 28, 2024 09:49

Added basic unit test

e2971a2

aponcedeleonch marked this pull request as ready for review November 28, 2024 10:49

aponcedeleonch requested review from ptelang and jhrozek November 28, 2024 10:59

Minor modifications to docstrings

e45e15d

aponcedeleonch added 2 commits November 28, 2024 16:40

Leaving the FIM pipeline empty for now

4387142

Remove print statement

8bb074c

aponcedeleonch merged commit 189aee9 into main Nov 28, 2024
2 checks passed

aponcedeleonch mentioned this pull request Nov 28, 2024

Implement pipelines for FIM #87

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add a FIM pipeline to Providers #102

Add a FIM pipeline to Providers #102

Uh oh!

aponcedeleonch commented Nov 27, 2024

Uh oh!

aponcedeleonch commented Nov 27, 2024

Uh oh!

Uh oh!

jhrozek Nov 27, 2024

Uh oh!

aponcedeleonch Nov 27, 2024 •

edited

Loading

Uh oh!

Uh oh!

aponcedeleonch commented Nov 27, 2024 •

edited

Loading

Uh oh!

Uh oh!

ptelang commented Nov 27, 2024

Uh oh!

lukehinds commented Nov 28, 2024

Uh oh!

Uh oh!

Uh oh!

Add a FIM pipeline to Providers #102

Add a FIM pipeline to Providers #102

Uh oh!

Conversation

aponcedeleonch commented Nov 27, 2024

Uh oh!

aponcedeleonch commented Nov 27, 2024

Uh oh!

Uh oh!

jhrozek Nov 27, 2024

Choose a reason for hiding this comment

Uh oh!

aponcedeleonch Nov 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

aponcedeleonch commented Nov 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ptelang commented Nov 27, 2024

Uh oh!

lukehinds commented Nov 28, 2024

Uh oh!

Uh oh!

Uh oh!

aponcedeleonch Nov 27, 2024 •

edited

Loading

aponcedeleonch commented Nov 27, 2024 •

edited

Loading