Adaptive output and contextual dialogue capabilities of text-generation-inference #424

MLikeWater · 2023-09-26T06:44:03Z

System Info

System Info
HL-SMI Version: hl-1.11.0-fw-45.1.1.1
Driver Version: 1.11.0-e6eb0fd

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

Deploy the Llama-2-7b-chat-hf model through text-generation-inference, but there is no adaptive output when using the following command, instead the input and output size are max_new_tokens.

curl 127.0.0.1:8080/generate_stream -X POST -d '{"inputs":"What is Deep Learning?","parameters":{"max_new_tokens":200}}'     -H 'Content-Type: application/json'

Also, how to implement chat functionality with context? Similar to GPT4, it can adaptively output appropriate content and has the ability to dialogue with context.

Expected behavior

adaptive output
dialogue with context

The text was updated successfully, but these errors were encountered:

regisss · 2023-10-13T16:02:53Z

@MLikeWater What do you mean exactly by adaptive output?

MLikeWater added the bug Something isn't working label Sep 26, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adaptive output and contextual dialogue capabilities of text-generation-inference #424

Adaptive output and contextual dialogue capabilities of text-generation-inference #424

MLikeWater commented Sep 26, 2023 •

edited

Loading

regisss commented Oct 13, 2023

Adaptive output and contextual dialogue capabilities of text-generation-inference #424

Adaptive output and contextual dialogue capabilities of text-generation-inference #424

Comments

MLikeWater commented Sep 26, 2023 • edited Loading

System Info

Information

Tasks

Reproduction

Expected behavior

regisss commented Oct 13, 2023

MLikeWater commented Sep 26, 2023 •

edited

Loading