Skip to content

Commit

Permalink
DEBUG
Browse files Browse the repository at this point in the history
  • Loading branch information
tgaddair committed Jun 3, 2024
1 parent ccc6c06 commit bfac3b8
Show file tree
Hide file tree
Showing 2 changed files with 12 additions and 0 deletions.
11 changes: 11 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -100,6 +100,17 @@ curl 127.0.0.1:8080/generate \
-H 'Content-Type: application/json'
```

Embed:

```shell
curl 127.0.0.1:8080/embed \
-X POST \
-d '{
"inputs": "[INST] Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did Natalia sell altogether in April and May? [/INST]"
}' \
-H 'Content-Type: application/json'
```

Prompt a LoRA adapter:

```shell
Expand Down
1 change: 1 addition & 0 deletions server/lorax_server/server.py
Original file line number Diff line number Diff line change
Expand Up @@ -96,6 +96,7 @@ async def Prefill(self, request: generate_pb2.PrefillRequest, context):
)

async def Embed(self, request: generate_pb2.EmbedRequest, context):
print("!!! EMBED")
if not self.model.supports_embeddings:
raise ValueError("Model does not support embeddings")

Expand Down

0 comments on commit bfac3b8

Please sign in to comment.