Skip to content
This repository was archived by the owner on Oct 25, 2024. It is now read-only.

Conversation

letonghan
Copy link
Contributor

@letonghan letonghan commented Feb 19, 2024

Type of Change

feature
API added:

  • /v1/assist/chat
  • /v1/assist/decode
  • /v1/assist/data_transfer

Description

Support Assisted Generation on Multi-nodes.
The code framework is implemented. Details will be completed by Wangyi's team.
JIRA: https://jira.devtools.intel.com/browse/NLPTOOLKIU-1126

Expected Behavior & Potential Risk

The assisted generation restful api will be able to run on multi-nodes.

How has this PR been tested?

Local. Draft PR now.

Dependency Change?

None.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants