Release Plan of BitBLAS 0.0.1 #150

LeiWang1999 · 2024-08-23T08:12:57Z

Hi all, it's time for us to considering the official release of BitBLAS v0.0.1, here are some todo items before this release:

Finalize comprehensive test cases and benchmarking scripts.
Decide on the default policy for the release.
Performance benchmark results of this release
Ensure the vLLM PR is either merged or confirm it no longer requires further modifications related to BitBLAS.
Implement kernel name serialization based on hardware hints and configurations.

LeiWang1999 · 2024-08-23T08:13:41Z

Looking ahead, our future plan for v0.0.2 should include at least support for the Marlin template, quantized Flash Attention, and Group MOE :)

LeiWang1999 · 2024-08-24T07:17:19Z

pr #153 serialized the kernel name with operator config and hint.

LeiWang1999 · 2024-08-28T12:53:43Z

From a policy Perspective, I think we should currently use LOP.3 only for weight propagation, this approach is compatible not only with A100 devices but also with other common devices, such as SM 70 or AMD (even though it’s not currently implemented for AMD, but it could be).

For Stage3 Performance, we can provide option to enable.

Moreover, the incoming stream_k template should share the same weight transformation function with Stage3.

LeiWang1999 · 2024-11-27T06:22:31Z

Think vllm pr vllm-project/vllm#6036 requires no further modifications to BitBLAS, we should consider publishing the formal release.

LeiWang1999 · 2024-11-28T05:35:26Z

PR #249 has successfully passed all test cases; we should now proceed to review the benchmark scripts.

LeiWang1999 added the enhancement New feature or request label Aug 24, 2024

xysmlx pinned this issue Aug 26, 2024

LeiWang1999 unpinned this issue Sep 1, 2024

LeiWang1999 pinned this issue Sep 1, 2024

LeiWang1999 mentioned this issue Nov 28, 2024

[Feature Request] Default Backend Should be changed into tilelang instead of tvm before v0.0.1 release #252

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Release Plan of BitBLAS 0.0.1 #150

Release Plan of BitBLAS 0.0.1 #150

LeiWang1999 commented Aug 23, 2024 •

edited

Loading

LeiWang1999 commented Aug 23, 2024 •

edited

Loading

LeiWang1999 commented Aug 24, 2024

LeiWang1999 commented Aug 28, 2024

LeiWang1999 commented Nov 27, 2024

LeiWang1999 commented Nov 28, 2024

Release Plan of BitBLAS 0.0.1 #150

Release Plan of BitBLAS 0.0.1 #150

Comments

LeiWang1999 commented Aug 23, 2024 • edited Loading

LeiWang1999 commented Aug 23, 2024 • edited Loading

LeiWang1999 commented Aug 24, 2024

LeiWang1999 commented Aug 28, 2024

LeiWang1999 commented Nov 27, 2024

LeiWang1999 commented Nov 28, 2024

LeiWang1999 commented Aug 23, 2024 •

edited

Loading

LeiWang1999 commented Aug 23, 2024 •

edited

Loading