LFX-Proposal: Cloud-edge collaborative speculative decoding for LLM based on KubeEdge-Ianvs #156

FuryMartin · 2024-10-14T15:21:16Z

What type of PR is this?
/kind design

What this PR does / why we need it:

Proposal for LFX Project CNCF - KubeEdge: Cloud-Edge Speculative Decoding for LLM via KubeEdge-Ianvs

Which issue(s) this PR fixes:

Fixes #126

kubeedge-bot · 2024-10-14T15:21:26Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please assign moorezheng after the PR has been reviewed.
You can assign the PR to them by writing /assign @moorezheng in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

MooreZheng

Got to fix the below CI errors for Pylint (3.9) before further actions, see CI logs

Run pylint '/home/runner/work/ianvs/ianvs/core'
core/testenvmanager/dataset/dataset.py:119:4: R0917: Too many positional arguments (8/5) (too-many-positional-arguments)
core/testenvmanager/dataset/dataset.py:206:4: R0917: Too many positional arguments (6/5) (too-many-positional-arguments)
core/testenvmanager/dataset/dataset.py:213:4: R0917: Too many positional arguments (7/5) (too-many-positional-arguments)
core/testenvmanager/dataset/dataset.py:246:4: R0917: Too many positional arguments (7/5) (too-many-positional-arguments)
core/testenvmanager/dataset/dataset.py:285:4: R0917: Too many positional arguments (7/5) (too-many-positional-arguments)
core/testenvmanager/dataset/dataset.py:329:4: R0917: Too many positional arguments (7/5) (too-many-positional-arguments)
core/testenvmanager/dataset/dataset.py:368:4: R0917: Too many positional arguments (6/5) (too-many-positional-arguments)
************* Module core.testcasecontroller.algorithm.paradigm.singletask_learning.singletask_learning_active_boost
core/testcasecontroller/algorithm/paradigm/singletask_learning/singletask_learning_active_boost.py:66:4: R0917: Too many positional arguments (7/5) (too-many-positional-arguments)

-----------------------------------
Your code has been rated at 9.95/10

Error: Process completed with exit code 8.

FuryMartin · 2024-10-24T05:25:09Z

Got to fix the below CI errors for Pylint (3.9) before further actions, see CI logs

This is fixed by #158

hsj576

The implementation of cloud-edge collaborative speculative decoding in the proposal needs to be further refined. According to what we discussed in the regular community meeting, speculative decoding can be implemented in the cloud by still adopting the hard example mining paradigm on the edge side.

MooreZheng

Overall it looks fine to me. Might need to highlight the difference against the OSPP proposal

Signed-off-by: Yu Fan <[email protected]> doc: modify proposal Signed-off-by: Yu Fan <[email protected]>

hsj576 · 2024-11-28T09:39:06Z

It is necessary to highlight why we use speculative decoding to accelerate LLM cloud-edge collaborative inference in the motivation section. The differences between this proposal and OSPP proposal should be further highlighted in the methods section.

MooreZheng

Overall it looks fine. As discussed at the routine meeting, there are a few points yet to be achieved.

The motivation for using the technique is not quite clear. I believe that it would make sense the improve the inference time. But the reason and potential of solving this problem could be further explored, e.g., by adding examples.
Need to highlight the difference against the existing design.

kubeedge-bot added the kind/design Categorizes issue or PR as related to design. label Oct 14, 2024

kubeedge-bot requested review from jaypume and MooreZheng October 14, 2024 15:21

kubeedge-bot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Oct 14, 2024

MooreZheng requested review from hsj576 and removed request for jaypume October 15, 2024 03:08

MooreZheng assigned FuryMartin and hsj576 Oct 15, 2024

MooreZheng requested changes Oct 15, 2024

View reviewed changes

kubeedge-bot assigned MooreZheng Oct 15, 2024

FuryMartin force-pushed the lfx-proposal branch from 27ab314 to d019f65 Compare October 24, 2024 05:23

hsj576 suggested changes Oct 26, 2024

View reviewed changes

FuryMartin force-pushed the lfx-proposal branch from d019f65 to 7c0e001 Compare November 7, 2024 08:21

kubeedge-bot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Nov 7, 2024

FuryMartin force-pushed the lfx-proposal branch from 7c0e001 to 6e86e1a Compare November 7, 2024 08:23

FuryMartin changed the title ~~Cloud-edge collaborative speculative decoding for LLM based on KubeEdge-Ianvs~~ LFX-Proposal: Cloud-edge collaborative speculative decoding for LLM based on KubeEdge-Ianvs Nov 7, 2024

MooreZheng requested changes Nov 7, 2024

View reviewed changes

FuryMartin force-pushed the lfx-proposal branch from 6e86e1a to 25fcccb Compare November 28, 2024 06:43

doc: add proposal for cloud-edge speculative decoding strategy

746c70d

Signed-off-by: Yu Fan <[email protected]> doc: modify proposal Signed-off-by: Yu Fan <[email protected]>

FuryMartin force-pushed the lfx-proposal branch from 25fcccb to 746c70d Compare November 28, 2024 07:44

MooreZheng requested changes Nov 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LFX-Proposal: Cloud-edge collaborative speculative decoding for LLM based on KubeEdge-Ianvs #156

LFX-Proposal: Cloud-edge collaborative speculative decoding for LLM based on KubeEdge-Ianvs #156

FuryMartin commented Oct 14, 2024

kubeedge-bot commented Oct 14, 2024

MooreZheng left a comment •

edited

Loading

FuryMartin commented Oct 24, 2024

hsj576 left a comment

MooreZheng left a comment

hsj576 commented Nov 28, 2024

MooreZheng left a comment

LFX-Proposal: Cloud-edge collaborative speculative decoding for LLM based on KubeEdge-Ianvs #156

Are you sure you want to change the base?

LFX-Proposal: Cloud-edge collaborative speculative decoding for LLM based on KubeEdge-Ianvs #156

Conversation

FuryMartin commented Oct 14, 2024

kubeedge-bot commented Oct 14, 2024

MooreZheng left a comment • edited Loading

Choose a reason for hiding this comment

FuryMartin commented Oct 24, 2024

hsj576 left a comment

Choose a reason for hiding this comment

MooreZheng left a comment

Choose a reason for hiding this comment

hsj576 commented Nov 28, 2024

MooreZheng left a comment

Choose a reason for hiding this comment

MooreZheng left a comment •

edited

Loading