Fetch from hub #1160

mikeshi80 · 2024-03-19T09:42:38Z

Add the function that user can register custom the model from model hub (HuggingFace, ModelScope) by selecting model type Hub, and input the model Id, and user can import basic info like context length, model size, quantization by click "Import Model" button.

The model will be download from model hub at first running.

TODO: the cached model will not show (cached) tag in model card in "Launch Model" page.

在自定义模型时加入了从Model Hub下载的功能，只要选择Model Source是Hub，并从HuggingFace或ModelScope中选择一个，然后指定Model Id就能注册新模型，并且可以通过点击"Import Model"从Hub中导入诸如context length， model size和量化信息。

注册后的模型会在第一次启动时从Model Hub下载。

TODO：在"Launch Model"页面的模型卡片中，自定义模型在模型下载后不会显示cached标签。

…get model hub infos.

ChengjieLi28 · 2024-03-19T10:23:00Z

Could you please paste some screenshots to show your PR effect?

mikeshi80 · 2024-03-20T01:57:36Z

Add the "Model Source" option in "Register Model" Page.

"Self Hosted" is same as the original design, "Hub" is the new function for registering new model by model hub and model id directly.

If the model format is "GGML" or "GGUF", the extra fields "Model File Name Template" and "Model File Name Split Template" will show out. If "Model File Name Split Template" is not empty, the extra "Quantization Parts" field shows.

When the correct model id and model hub are provided, click "IMPORT MODEL" button, it will fetch the model info and fill the corresponding fields, like "Context Length", "Model Size in Billions", "Model File Name Template", "Model File Name Split Template", "Quantization" and "Quantization Parts", you can modify them if there is anything wrong.

When the other options are selected correctly, the model can be registered like a built-in model, you do not download it until the first running.

mikeshi80 added 21 commits March 13, 2024 10:43

Split the language model register tab panel into the separated js file.

b7ea34c

WIP: 从HUB中获得模型信息

18f464e

Merge branch 'main' into fetch_from_hub

25f2b38

add more unit tests for getting model info.

cc903e1

fix the get model info correctly.

c108eb0

fix the wrong llm family build logic.

c24a459

Merge branch 'main' into fetch_from_hub

3e4f765

fix the bug that cannot process quantization with lower case.

4433f85

add support for GGUF, GGML of ModelScope

53ca758

add async runner to run function async, and add model hub utility to …

5dc38a1

…get model hub infos.

Did the code refactoring to get gguf and ggml model info.

3991ae6

when model size is decimals, replace dot with underscore.

c0936e8

when model size is unknown, not return "UNKNOWN", but 0

5081c48

add support of pytorch and awq format to fetch model info from hub.

f0bd9af

add support of pytorch and awq format to fetch model info from hub.

46e1626

add the rest api for fetch model info from model hub

97dbae6

Merge branch 'main' into fetch_from_hub

f915155

refactor the pytest case with pytest.raises to catch exception.

b853970

fix the but that '0.x' form model id cannot be processed correctly

b354cec

update caniuse-lite

e5ba02d

Finish LLM custom model register from hub.

9b774fe

XprobeBot added this to the v0.9.4 milestone Mar 19, 2024

XprobeBot modified the milestones: v0.9.4, v0.9.5 Mar 21, 2024

XprobeBot modified the milestones: v0.10.0, v0.10.1 Mar 29, 2024

mikeshi80 added 2 commits April 2, 2024 09:48

Merge branch 'main' into fetch_from_hub

8ed9ece

Finish embedding custom model register from hub.

c0a636c

XprobeBot modified the milestones: v0.10.2, v0.10.3, v0.11.0 Apr 19, 2024

XprobeBot modified the milestones: v0.11.0, v0.11.1, v0.11.2 May 11, 2024

XprobeBot modified the milestones: v0.11.2, v0.11.3 May 24, 2024

XprobeBot modified the milestones: v0.11.3, v0.11.4, v0.12.0, v0.12.1 May 31, 2024

XprobeBot modified the milestones: v0.12.1, v0.12.2 Jun 14, 2024

XprobeBot modified the milestones: v0.12.2, v0.12.4, v0.13.0, v0.13.1 Jun 28, 2024

XprobeBot modified the milestones: v0.13.1, v0.13.2 Jul 12, 2024

XprobeBot modified the milestones: v0.13.2, v0.13.4 Jul 26, 2024

XprobeBot modified the milestones: v0.14, v0.15 Sep 3, 2024

XprobeBot modified the milestones: v0.15, v0.16 Oct 30, 2024

XprobeBot modified the milestones: v0.16, v1.x Nov 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fetch from hub #1160

Fetch from hub #1160

mikeshi80 commented Mar 19, 2024

ChengjieLi28 commented Mar 19, 2024

mikeshi80 commented Mar 20, 2024

Fetch from hub #1160

Are you sure you want to change the base?

Fetch from hub #1160

Conversation

mikeshi80 commented Mar 19, 2024

ChengjieLi28 commented Mar 19, 2024

mikeshi80 commented Mar 20, 2024