Deploy AIMET quantized model with quantized encodings to QNN #3878
Unanswered
Anurag Ranjan (anuragranj)
asked this question in
Q&A
Replies: 1 comment 3 replies
-
|
Hi Anurag Ranjan (@anuragranj) This is two step process
We are actively working on improving our docs to make it easy to use. Any feedback would be appreciated. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I am currently working on deploying an AIMET-quantized ONNX model with its quantization encodings on an Android device using QNN. However, I am facing discrepancies in the output, and I am unable to find any tutorial or documentation that outlines the complete workflow for this process.
Is there any official guide or example demonstrating how to take a standard model (such as MobileNet) through AIMET quantization and encodings, and successfully run it on an Android device using QNN? Any pointers or documentation on this would be greatly appreciated.
cc: Abhi Khobare (@quic-akhobare)
Beta Was this translation helpful? Give feedback.
All reactions