Add llm-benchmarks proposal #113

IcyFeather233 · 2024-07-06T08:18:26Z

What type of PR is this?

OSPP proposal

What this PR does / why we need it:

Investigated various aspects of implementing the LLM Benchmark in Ianvs, introduced the integration plan of opencompass, and achieved the dataset map.

IcyFeather233 · 2024-07-06T08:56:36Z

By the way, my implementation is in https://github.com/IcyFeather233/ianvs/tree/dev, you can follow this readme to try llm single task learning and opencompass

hsj576 · 2024-07-18T10:03:56Z

The PR should be translated into English to facilitate understanding by people from other countries.

IcyFeather233 · 2024-07-18T10:10:09Z

https://github.com/IcyFeather233/ianvs/tree/dev

Thanks for the advice! I will translate it after the proposal content will not be changed.

MooreZheng · 2024-07-18T11:41:07Z

docs/proposals/scenarios/llm-benchmarks/llm-benchmarks.md

This proposal is related to #95.
Great to see a comprehensive proposal. This one is close to the final version. As discussed in the routine meeting, there might be some advices.

Though the directories are clear, an architecture is still needed to clarify what is modified in this project. E.g., what is in the TestEnv, TestCase. It seems to me that at least we have made changes on the data format to support NLP. You might want to take a look at https://github.com/kubeedge/ianvs/pull/122/files for an architecture example.

Prompts would have different forms, e.g., N shots. How shall we develop a prompt template to adapt all these? It looks interesting and challenging.

Now I add my changes to ianvs core, including 2 flowcharts showing the architecture. And a more complete benchmark format is updated in the doc.

Signed-off-by: IcyFeather <[email protected]>

hsj576 · 2024-08-01T09:53:57Z

/lgtm

MooreZheng

Overall looks good to me. Just a few comments would be considered:

The revision for index in env manager of ianvs core, which should be highlighted: The proposed design also aims to support both the old and new index.
The integration part of OpenCompass should also be highlighted in the structure.png. A tutorial is needed.
The prompt template is extendable, including the keys and prompts, which can be also highlighted in the proposal.

MooreZheng

/lgtm

Signed-off-by: IcyFeather <[email protected]>

kubeedge-bot · 2024-08-14T07:36:16Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: MooreZheng

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [MooreZheng]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

ojuschugh1 · 2024-08-20T06:27:47Z

Hi @IcyFeather233 , @hsj576 , @MooreZheng , I wanted to discuss one thing regarding this project. Is there any way I can chat/discuss this with you? I am unable to find anyone of you on slack channel

MooreZheng · 2024-08-29T10:55:38Z

Hi @IcyFeather233 , @hsj576 , @MooreZheng , I wanted to discuss one thing regarding this project. Is there any way I can chat/discuss this with you? I am unable to find anyone of you on slack channel

Hi, the slack channel currently has networking issues for Chinese users. You might want to leave your comment here or raise a issue.

jaypume · 2024-08-29T12:28:24Z

/lgtm

IcyFeather233 force-pushed the main branch from 8366629 to a1bda28 Compare July 6, 2024 08:21

IcyFeather233 force-pushed the main branch from 2124251 to 07e6090 Compare July 18, 2024 09:08

MooreZheng reviewed Jul 18, 2024

View reviewed changes

IcyFeather233 added 4 commits July 19, 2024 11:17

add llm-benchmarks proposal

8905586

Signed-off-by: IcyFeather <[email protected]>

update llm benchmark proposal

7754088

Signed-off-by: IcyFeather <[email protected]>

update llm benchmark proposal

f5d74a1

Signed-off-by: IcyFeather <[email protected]>

update llm benchmark proposal

6862f88

Signed-off-by: IcyFeather <[email protected]>

IcyFeather233 force-pushed the main branch from 838aee5 to 6862f88 Compare July 19, 2024 03:17

translate llm-benchmark proposal

4b6afa1

Signed-off-by: IcyFeather <[email protected]>

MooreZheng reviewed Aug 1, 2024

View reviewed changes

MooreZheng approved these changes Aug 1, 2024

View reviewed changes

update proposal, add opencompass tutorial

8115c14

Signed-off-by: IcyFeather <[email protected]>

MooreZheng requested a review from hsj576 August 13, 2024 03:16

MooreZheng added the proposal PR label Aug 14, 2024

MooreZheng closed this Aug 14, 2024

MooreZheng reopened this Aug 14, 2024

kubeedge-bot added the size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. label Aug 14, 2024

kubeedge-bot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Aug 14, 2024

MooreZheng added kind/design Categorizes issue or PR as related to design. and removed approved Indicates a PR has been approved by an approver from all required OWNERS files. labels Aug 29, 2024

MooreZheng removed the proposal PR label Aug 29, 2024

kubeedge-bot assigned jaypume Aug 29, 2024

kubeedge-bot added the lgtm Indicates that a PR is ready to be merged. label Aug 29, 2024

jaypume merged commit 1aae17f into kubeedge:main Aug 29, 2024
19 of 23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add llm-benchmarks proposal #113

Add llm-benchmarks proposal #113

IcyFeather233 commented Jul 6, 2024

IcyFeather233 commented Jul 6, 2024

hsj576 commented Jul 18, 2024

IcyFeather233 commented Jul 18, 2024

MooreZheng Jul 18, 2024 •

edited

Loading

IcyFeather233 Jul 19, 2024

hsj576 commented Aug 1, 2024

MooreZheng left a comment

MooreZheng left a comment

kubeedge-bot commented Aug 14, 2024

ojuschugh1 commented Aug 20, 2024 •

edited

Loading

MooreZheng commented Aug 29, 2024

jaypume commented Aug 29, 2024

Add llm-benchmarks proposal #113

Add llm-benchmarks proposal #113

Conversation

IcyFeather233 commented Jul 6, 2024

IcyFeather233 commented Jul 6, 2024

hsj576 commented Jul 18, 2024

IcyFeather233 commented Jul 18, 2024

MooreZheng Jul 18, 2024 • edited Loading

Choose a reason for hiding this comment

IcyFeather233 Jul 19, 2024

Choose a reason for hiding this comment

hsj576 commented Aug 1, 2024

MooreZheng left a comment

Choose a reason for hiding this comment

MooreZheng left a comment

Choose a reason for hiding this comment

kubeedge-bot commented Aug 14, 2024

ojuschugh1 commented Aug 20, 2024 • edited Loading

MooreZheng commented Aug 29, 2024

jaypume commented Aug 29, 2024

MooreZheng Jul 18, 2024 •

edited

Loading

ojuschugh1 commented Aug 20, 2024 •

edited

Loading