Skip to content

Actions: InternLM/lmdeploy

publish-docker

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
841 workflow runs
841 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Support baichuan2-chat chat template (#378)
publish-docker #66: Commit 55764e0 pushed by lvhan028
September 8, 2023 09:03 30d 0h 0m 3s main
September 8, 2023 09:03 30d 0h 0m 3s
fix exceed session len core dump for chat and generate (#366)
publish-docker #65: Commit ce21a31 pushed by lvhan028
September 7, 2023 10:38 30d 0h 0m 3s main
September 7, 2023 10:38 30d 0h 0m 3s
[Fix] Set max dynamic smem size for decoder MHA to support context le…
publish-docker #64: Commit 71ade77 pushed by lvhan028
September 7, 2023 10:38 30d 0h 0m 4s main
September 7, 2023 10:38 30d 0h 0m 4s
bug-fix: always use stream mode to enable persistent batching (#346)
publish-docker #63: Commit 57cf99b pushed by lvhan028
September 7, 2023 09:42 30d 0h 0m 3s main
September 7, 2023 09:42 30d 0h 0m 3s
bump version to v0.0.7 (#358)
publish-docker #61: Commit d065f3e pushed by lvhan028
September 4, 2023 06:38 30d 0h 0m 4s main
September 4, 2023 06:38 30d 0h 0m 4s
Fix profile_serving hung issue (#344)
publish-docker #60: Commit edb7c6e pushed by lvhan028
September 4, 2023 03:07 30d 0h 0m 3s main
September 4, 2023 03:07 30d 0h 0m 3s
Decode generated token_ids incrementally (#309)
publish-docker #59: Commit 9bfe03c pushed by lvhan028
September 1, 2023 08:39 30d 0h 0m 3s main
September 1, 2023 08:39 30d 0h 0m 3s
Package 'bin/llama_gemm' to wheel (#320)
publish-docker #58: Commit 22e8b2c pushed by lvhan028
September 1, 2023 04:16 30d 0h 0m 2s main
September 1, 2023 04:16 30d 0h 0m 2s
Add flashattention2 (#196)
publish-docker #57: Commit 452822a pushed by lvhan028
August 29, 2023 12:53 30d 0h 0m 2s main
August 29, 2023 12:53 30d 0h 0m 2s
Fix turbomind import error on windows (#316)
publish-docker #56: Commit d4d609b pushed by lvhan028
August 29, 2023 08:14 30d 0h 0m 3s main
August 29, 2023 08:14 30d 0h 0m 3s
fix(kvint8): update doc (#315)
publish-docker #55: Commit a48e2d2 pushed by lvhan028
August 29, 2023 05:50 30d 0h 0m 3s main
August 29, 2023 05:50 30d 0h 0m 3s
Fix readthedocs building (#321)
publish-docker #54: Commit 08b2812 pushed by lvhan028
August 29, 2023 02:28 30d 0h 0m 3s main
August 29, 2023 02:28 30d 0h 0m 3s
bump version to v0.0.6 (#283)
publish-docker #52: Commit cfabbbd pushed by lvhan028
August 25, 2023 13:27 30d 0h 0m 2s main
August 25, 2023 13:27 30d 0h 0m 2s
Import turbomind in gradio server only when it is needed (#303)
publish-docker #51: Commit 59f8e67 pushed by lvhan028
August 25, 2023 04:45 30d 0h 0m 3s main
August 25, 2023 04:45 30d 0h 0m 3s
Enable the Gradio server to call inference services through the RESTf…
publish-docker #50: Commit 4279d8c pushed by lvhan028
August 24, 2023 11:35 30d 0h 0m 3s main
August 24, 2023 11:35 30d 0h 0m 3s
[Feature] Support decode with DP in pytorch (#193)
publish-docker #49: Commit 81f2983 pushed by lvhan028
August 24, 2023 07:48 30d 0h 0m 3s main
August 24, 2023 07:48 30d 0h 0m 3s
Pad tok_embedding and output weights to make their shape divisible by…
publish-docker #48: Commit 4903d3c pushed by lvhan028
August 24, 2023 04:29 30d 0h 0m 3s main
August 24, 2023 04:29 30d 0h 0m 3s
[Fix] Fix llama2 70b & qwen quantization error (#273)
publish-docker #47: Commit d5cb0be pushed by lvhan028
August 24, 2023 02:58 30d 0h 0m 3s main
August 24, 2023 02:58 30d 0h 0m 3s
[Fix] Fix building with CUDA 11.3 (#280)
publish-docker #45: Commit 9e36648 pushed by lvhan028
August 22, 2023 12:51 30d 0h 0m 3s main
August 22, 2023 12:51 30d 0h 0m 3s
Update workflow for building docker image (#282)
publish-docker #44: Commit 0632735 pushed by lvhan028
August 22, 2023 06:17 30d 0h 0m 3s main
August 22, 2023 06:17 30d 0h 0m 3s
Add Restful API (#223)
publish-docker #43: Commit d5c10e7 pushed by lvhan028
August 22, 2023 05:59 30d 0h 0m 3s main
August 22, 2023 05:59 30d 0h 0m 3s
Pass chat template args including meta_prompt to model (#225)
publish-docker #42: Commit 7785142 pushed by lvhan028
August 21, 2023 13:20 30d 0h 0m 3s main
August 21, 2023 13:20 30d 0h 0m 3s
add readthedocs (#208)
publish-docker #41: Commit c238f1c pushed by lvhan028
August 21, 2023 04:20 30d 0h 0m 3s main
August 21, 2023 04:20 30d 0h 0m 3s
Support TP for w4a16 (#262)
publish-docker #40: Commit 89f3d32 pushed by lvhan028
August 18, 2023 11:04 30d 0h 0m 2s main
August 18, 2023 11:04 30d 0h 0m 2s
[Feature] Support Qwen-7B, dynamic NTK scaling and logN scaling in tu…
publish-docker #39: Commit 4a60b45 pushed by lvhan028
August 18, 2023 09:49 30d 0h 0m 2s main
August 18, 2023 09:49 30d 0h 0m 2s
ProTip! You can narrow down the results and go further in time using created:<2023-08-18 or the other filters available.