QNN HTP

The following tutorial demonstrates running the Llama 3 3B model on the QNN HTP backend using genie-t2t-run.

Note

This section assumes that the QNN HTP context binaries have been obtained via the QNN workflow.

An example backend_ext_config.json can be found at ${QNN_SDK_ROOT}/examples/Genie/configs/htp_backend_ext_config.json.

For more information on the QNN HTP backend extension configurations options, please refer to ${QNN_SDK_ROOT}/docs/QNN/general/htp/htp_backend.html.

Please select your target platform: