sparse gemm kernels are not supported in ACL #1084

snadampal · 2023-12-19T18:22:44Z

Output of 'strings libarm_compute.so | grep arm_compute_version':
arm_compute_version=v23.11 Build options: {'Werror': '0', 'debug': '0', 'neon': '1', 'opencl': '0', 'embed_kernels': '0', 'os': 'linux', 'arch': 'armv8a', 'build': 'native', 'multi_isa': '1', 'fixed_format_kernels': '1', 'openmp': '1', 'cppthreads': '0'} Git hash=b'add70ace1e57f65d1ae4d0cedaec6e4578cf87ff'

Platform:
AWS c7g.16xl

Operating System:
Ubuntu 22.04

Problem description:

PyTorch supports sparse tensor formats The request is to provide aarch64 gemm kernels that accept these sparse formatted tensors and does sparse gemm implementation to achieve better performance.

morgolock · 2024-01-17T14:51:04Z

Hi @snadampal

Thanks for raising this. We will discuss the feature request with the team.

morgolock · 2024-01-30T11:29:39Z

Hi @snadampal

We discussed this with the team, we are considering exploring sparse tensors support in the context of GenAI but this is not officially in the roadmap for ACL. There are no plans to implement this feature.

We would be interested in specific use cases for ACL which you could share with us.

Hope this helps.

snadampal · 2024-03-16T20:21:42Z

Hi @morgolock , isn't ACL targeted for GenAI use cases? My requirement is mainly to accelerate sparse LLMs inference with ACL gemm kernels. Are you planning a different GEMM library for GenAI ?

morgolock · 2024-03-20T15:44:53Z

Hi @snadampal

Apologies if I was not clear. we're exploring ways to accelerate GenAI workloads with ACL. This means that we may consider exploring sparse tensors support in ACL to accelerate these workloads in the future but we don't have any work planned for this feature in our roadmap.

My requirement is mainly to accelerate sparse LLMs inference with ACL gemm kernels.

Could you please share more details about the models you would like to accelerate?

Hope this helps

snadampal · 2024-03-21T02:18:47Z

Hi @morgolock , thanks for the clarification. will share the details with you.

Arnav0400 · 2024-09-09T13:06:49Z

Hi @morgolock, any updates on the plan for sparse GEMM kernels in ACL? This is particularly interesting as sparse LLMs are able to match performance of dense counterparts on task-specific applications. For ref. - https://arxiv.org/pdf/2310.06927

morgolock · 2024-09-23T13:00:31Z

Hi @Arnav0400

There is no work planned to implement this feature at the moment. I'll discuss it again with the team.

morgolock added the Feature Request label Dec 20, 2023

morgolock closed this as completed Mar 15, 2024

morgolock reopened this Mar 20, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparse gemm kernels are not supported in ACL #1084

sparse gemm kernels are not supported in ACL #1084

snadampal commented Dec 19, 2023

morgolock commented Jan 17, 2024

morgolock commented Jan 30, 2024

snadampal commented Mar 16, 2024

morgolock commented Mar 20, 2024

snadampal commented Mar 21, 2024

Arnav0400 commented Sep 9, 2024 •

edited

Loading

morgolock commented Sep 23, 2024

sparse gemm kernels are not supported in ACL #1084

sparse gemm kernels are not supported in ACL #1084

Comments

snadampal commented Dec 19, 2023

morgolock commented Jan 17, 2024

morgolock commented Jan 30, 2024

snadampal commented Mar 16, 2024

morgolock commented Mar 20, 2024

snadampal commented Mar 21, 2024

Arnav0400 commented Sep 9, 2024 • edited Loading

morgolock commented Sep 23, 2024

Arnav0400 commented Sep 9, 2024 •

edited

Loading