Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gemm+silu fused operator is not supported in ACL #1125

Open
TianyuLi0 opened this issue Jul 18, 2024 · 1 comment
Open

gemm+silu fused operator is not supported in ACL #1125

TianyuLi0 opened this issue Jul 18, 2024 · 1 comment

Comments

@TianyuLi0
Copy link

Output of 'strings libarm_compute.so | grep arm_compute_version':
arm_compute_version=v24.04 Build options: {'Werror': '1', 'build_dir': '//acl/build', 'debug': '0', 'neon': '1', 'opencl': '0', 'os': 'linux', 'openmp': '1', 'cppthreads': '0', 'arch': 'armv8.2-a', 'multi_isa': '1', 'fixed_format_kernels': '1', 'build': 'native'} Git hash=b'4fda7a803eaadf00ba36bd532481a33c18952089'

Platform:
Neoverse N2

Operating System:
Ubuntu 22.04

Problem description:
gemm+silu fused operator is not supported in ACL

LLM use case: llama like models use silu activation with gemm.

Similar issue
#1083

@morgolock
Copy link

Hi @TianyuLi0

We currently don't have plans to implement silu in ACL.

KleidiAI is better suited to accelerate the type of workloads you find in these models than ACL. Please have a look at this article discussing LLM acceleration on mobile CPUs.

Alternatively you are welcome to submit a patch contributing to ACL. Please see our contribution guide for more information on how to add a new operator

Hope this helps

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants