-
Notifications
You must be signed in to change notification settings - Fork 22
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How does gemini nano work on qcom Npu/Gpu #7
Comments
@zhouyiself Is AICORE based on liteRT to connect to the Qualcomm acceleration library or is there another implementation in AICORE? Thank you very much |
AICore on Galaxy device:
AICore thirdpartyeap:
That's enough to answer this issue. |
Thanks a lot, but it looks like the pixel and sm8650 have different software paths? pixel uses ggml, Qualcomm npu as far as I know there are only two ways of tflite qnnhtp delegate and qnn offline compilation, and all signs show that aicore does not seem to use tflite runtime, which is where I do not understand. Thank you for your reply |
AICore has several versions depending on the SoC, on Galaxy devices they may wrapped Qualcomm Genie SDK because that's the fastest API on a modern Qualcomm device with NPU, and they might not care about compabilities, so they probably won't use TFlite or GPUs. |
I have a question. From the documentation and introduction, ai edge sdk api is used to access AICORE, while gemini nano is in AICORE. How does AICORE connect to Qualcomm npu hardware? Is it through runtime in AICORE or liteRT gpu delegate(tflite runtime) or Qualcomm's own qnn htp/gpu delegate? It's not mentioned in the document or the video.
thanks
The text was updated successfully, but these errors were encountered: